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The purpose of this book is to give you a working knowledge of the big ideas of statistics and 
of the methods used in solving statistical problems. Because data always come from a real- 
world context, doing statistics means more than just manipulating data. The Practice of 
Statistics (TPS), Sixth Edition, is full of data. Each set of data has some brief background to 
help you understand where the data come from. We deliberately chose contexts and data sets in 
the examples and exercises to pique your interest. 

TPS 6e is designed to be easy to read and easy to use. This book is written by current high 
school AP® Statistics teachers, for high school students. We aimed for clear, concise 
explanations and a conversational approach that would encourage you to read the book. We 
also tried to enhance both the visual appeal and the book’s clear organization in the layout of 
the pages. 

Be sure to take advantage of all that TPS 6e has to offer. You can learn a lot by reading the 
text, but you will develop deeper understanding by doing the Activities and Projects and 
answering the Check Your Understanding questions along the way. The walkthrough guide on 
pages xiv—xx gives you an inside look at the important features of the text. 

You learn statistics best by doing statistical problems. This book offers many different 
types of problems for you to tackle. 


e Section Exercises include paired odd- and even-numbered problems that test the same skill 
or concept from that section. There are also some multiple-choice questions to help prepare 
you for the AP® Statistics exam. Recycle and Review exercises at the end of each exercise 
set involve material you studied in preceding sections. 


e Chapter Review Exercises consist of free-response questions aligned to specific learning 
targets from the chapter. Go through the list of learning targets summarized in the Chapter 
Review and be sure you can say of each item on the list, “I can do that.” Then prove it by 
solving some problems. 


e The AP® Statistics Practice Test at the end of each chapter will help you prepare for in- 
class exams. Each test has about 10 multiple-choice questions and 3 free-response problems, 
very much in the style of the AP® Statistics exam. 


e Finally, the Cumulative AP® Practice Tests after Chapters 4, 7, 10, and 12 provide 
challenging, cumulative multiple-choice and free-response questions like those you might 
find on a midterm, final, or the AP® Statistics exam. 


The main ideas of statistics, like the main ideas of any important subject, took a long time 
to discover and thus take some time to master. The basic principle of learning them is to be 
persistent. Once you put it all together, statistics will help you make informed decisions based 
on data in your daily life. 


TPS and AP® Statistics 


The Practice of Statistics (TPS) was the first book written specifically for the Advanced 
Placement (AP®) Statistics course. Like the previous five editions, TPS 6e is organized to 
closely follow the AP® Statistics Course Description. Every item on the College Board’s 
“Topic Outline” is covered thoroughly in the text. Visit the book’s website at 
highschool.bfwpub.com/tps6e for a detailed alignment guide. The few topics in the book that 
go beyond the AP® Statistics syllabus are marked with an asterisk (*). 

Most importantly, TPS 6e is designed to prepare you for the AP® Statistics exam. The 
author team has been involved in the AP® Statistics program since its early days. We have 
more than 40 years’ combined experience teaching AP® Statistics and grading the AP® exam! 
Both of us have served as Question Leaders for more than 10 years, helping to write scoring 
rubrics for free-response questions. Including our Content Advisory Board and Supplements 
Team (page vi), we have extensive knowledge of how the AP® Statistics exam is developed 
and scored. 

TPS 6e will help you get ready for the AP® Statistics exam throughout the course by: 

e Using terms, notation, formulas, and tables consistent with those found on the AP® 
Statistics exam. Key terms are shown in bold in the text, and they are defined in the 
Glossary. Key terms also are cross-referenced in the Index. See page F-1 to find “Formulas 
for the AP® Statistics Exam,” as well as Tables A, B, and C in the back of the book for 
reference. 

¢ Following accepted conventions from AP® Statistics exam rubrics when presenting 
model solutions. Over the years, the scoring guidelines for free-response questions have 
become fairly consistent. We kept these guidelines in mind when writing the solutions that 
appear throughout TPS 6e. For example, the four-step State-Plan—Do—Conclude process that 
we use to complete inference problems in Chapters 8—12 closely matches the four-point AP® 
scoring rubrics. 

e Including AP® Exam Tips in the margin where appropriate. We place exam tips in the 
margins as “on-the-spot” reminders of common mistakes and how to avoid them. These tips 
are collected and summarized in the About the AP® Exam and AP® Exam Tips appendix. 

e Providing over 1600 AP®-style exercises throughout the book. Each chapter contains a 
mix of free-response and multiple-choice questions that are similar to those found on the 
AP® Statistics exam. At the start of each Chapter Wrap-Up, you will find a FRAPPY (Free 
Response AP® Problem, Yay!). Each FRAPPY gives you the chance to solve an AP®-style 
free-response problem based on the material in the chapter. After you finish, you can view 


and critique two example solutions from the book’s Student Site 
(highschool.bfwpub.com/tps6e). Then you can score your own response using a rubric 
provided by your teacher. 


Turn the page for a tour of the text. See how to use the book to realize success in the course and 
on the AP® Statistics exam. 


READ THE TEXT and use the book’s features to help 
you grasp the big ideas. 


Section 3.1 Scatterplots and Correlation \y 153 


| SECTION 3.1 | Scatterplots and Correlation 


LEARNING TARGETS 03y the end of the section, you should be able to: 


Read the LEARNING 
TARGETS at the 
beginning of each 
section. Focus on 


* Distinguish between explanatory and response * Interpret the correlation, 


variables for quantitative data, * Understand the basic properties of correlation, 
* Make a scatterplot to display the relationship including how the correlation is influenced by 


between two quantitative variables, outliers. 

+ Describe the direction, form, and strength of * Distinguish correlation from causation, 
a relationship displayed in a scatterplot and 
identify unusual features. 


mastering these skills 
and concepts as you 
work through the 
chapter. 


Mast statistical studies examine data on more than one variable. Wortunately, 
analysis of relationships between two variables builds on the same tools we 
used to analyze one variable, The principles that guide our work also remain 
the same: 

¢ Plot the data, then look for overall patterns and departures from those patterns, 
¢ Add numerical summaries. 


¢ When there’s a regular overall pattern, use a simplified model to deseribe it 


It’s important to 
Explanatory and Response Variables learn the language of 
win a sdesteiace’ : statistics. Take note of 
In the “Candy grab” activity, the number of candies is the response variable. the green DEFINITION 


Hand span is the explanatory variable because we anticipate that knowing a stu- 


dent's hand span will help us predict the number of candies that student can grab. | Boxes that explain 
important vocabulary. 


DEFINITION Response variable, Explanatory variable Flip back to them to 


A response variable measures an outcome of a study. An explanatory variable . 
may help predict or explain changes in a response variable. ela key terms and 
their definitions, or turn 


to the Glossary/Glosario 


Scan the margins 
for the green notes, 


It is easiest to identify explanatory and response variables when we initially 


You will often see explanatory . : : ; . 
which represent the variables called independent specify the values of one variable to see how it affects another variable. For | at the back of the book. 
Hunt, ” variables and response variables instance, to study the effect of alcohol on body temperature, researchers gave 
voice of the teacher called dependent variables. Because several different amounts of alcohol to mice, Then they measured the change in 
giving helpful hints for the words nasnaktidale wii each mouse’s body temperature 15 minutes later, In this case, amount of alcohol 
. is the explanatory variable, and change in body temperature is the response vari- 
Statistics, we won't use them here, | ; R 3 ri 3 
being successful in the able. When we don't specify the values of either variable before collecting the 
course. data, there may or may not be a clear explanatory variable. 


HOW TO DESCRIBE A SCATTERPLOT 


When you are asked to describe ‘To describe a scatterplot, make sure to address the following four characteristics 
the association shown in a in the context of the data: 


ts the afi cso TIPS. as give egelened yntaiaed ¢ Direction: A scatterplot can show a positive association, nega Look for the boxes 
advice on how to be successful on with the green bands. 


h Ape and strength of the association, i A | \ i f li f 
t e Statistics exam. along with any unusual fea- © Porm: / scatterp lot can show a linear form or a nonlinear fo 

: Some explain how to 
Jem. This means that you need form is nonlinear make graphs or set 


tures, in the context of the prob- form is linear if the overall pattern follows a straight line, Ot 
to use both variable names in ¢ Strength: A scatterplot can show a weak, moderate, or strong f : 
your description. An association is strong if the points don’t deviate much from’ up calculations, while 
identified. An association is weak if the points deviate quite a others recap important 


association, or no association. 


Watch for CAUTION ICONS. 
They alert you to common 

mistakes that students often 
make. 


concepts. 


Few relationships are linear for all values of the explanatory variable. Don’t 
o make predictions using values of x that are much larger or much smaller than 
those that actually appear in your data. 


LEARN STATISTICS BY DOING STATISTICS 


Every chapter begins with a hands-on 
ACTIVITY that introduces the content of 

the chapter. Many of these activities involve 
collecting data and drawing conclusions from 
the data. In other activities, you'll use dynamic 
applets to explore statistical concepts. 


Yea eke Me Guess the correlation ¢€ 


Correlation Guessing Game button to see a “random” scatterplot. As 


ACTIVITY Bennie 


In this activity, you will investigate if students with a larger hand span can grab 
more candy than students with a smaller hand span, 


1, Measure the span of your dominant hand to the nearest half 
centimeter (cm). Hand span is the distance from the tip of the thumb 
to the tip of the pinkie finger on your fully stretched-out hand, 


2. One student at a time, go to the front of the class and use your 
dominant hand to grab as many candies as possible from the 
container. You must grab the candies with your fingers pointing 
down (no scooping!) and hold the candies for 2 seconds before 
counting them. After counting, put the candy back into the 
contaimer, 


3. On the board, record your hand span and number of candies in 
a table with the following headings: 


1 Hand span (em) Number of candies 


4. While other students record their yalues on the board, copy 

the table onto a piece of paper and make a graph. Begin by 
constructing a set of coordinate axes. Label the horizontal axis 
the vertical axis "Number of candies.” 
scale for each axis and plot each point 
le as accurately as you can on the graph. 


ll you about the relationship between 
of candies? Summarize your observa- 
oO, 


In this activity, we will have a class competition to see who can best guess the 
correlation, 


1. Load the Guess the Correlation applet at www.rossmanchance.com/ 
applets. 


2. The teacher will press the “New Sample” 


© Edit/Paste Data 


Chapters 1, 3, 4, 10, and 12 
conclude with a CHAPTER 


an opportunity to think like a 


of interest. The other two 


Collect Data, Analyze Data, 
Interpret Results. 


Number of points: [25 100 


Correlation guess; fe] > 


PROJECT. Three of the projects 
(Chapters 1, 3, and 10) provide 


statistician by analyzing larger 
data sets with multiple variables 


(Chapters 4 and 12) are longer- 
term projects that require you to 
engage in the statistical problem- 
solving process: Ask Questions, 


a class, try to guess the correlation. Type 
Data the guess in the “Correlation guess” box 
and press “Check Guess” to see how the 


Chapter 3 Project Investigating Relationships 
in Baseball 


What is a better predictor of the number of wins fora baseball 5, Because the number of wins a team has is dependent on 
team, the number of runs scored by the team or the number both how many runs they score and how many runs they 
of mins they allow the other team to score? What variables can allow, we can use a combination of both variables to pre- 
we use to predict the number of runs a team scores? To predict dict the number of wins, Add a column in your data table 
the number of runs it allows the other team to score? In this for a new variable, run differential. Fill in the values using 
project, you will use technology to help answer these questions the formula WIG — RAG 

by exploring a lange set of data from Major League Baseball 6, Create a seatterplot to investigate the relationship between 


run differential and wins. Then calculate the equation of 
Part 1 the least-squares regression line, the standard deviation of 
the residuals, and r°. 


. “ Tes ata 2012-2016" Excel fi , 

1. Download the “MLB ‘Team Data 2012-2016" Excel file Is run differential a better predictor than the variable you 
from the book's website, along with the “Glossary for MLB chose in Question 4? Explain your reasoning, 
Tear Data file,” which explains each of the variables 
included in the data set.* Import the data inte th 


s 


cal software package you prefer 


2. Create a scatterplot to investigate the relationship] 
runs scored per game (R/G) and wins (W), Th 
late the equation of the least-squares regression 
standard deviation of the residuals, and r*. Note: 
the section for hitting statistics and W is in the s¢ 


CHECK YOUR UNDERSTANDING 


In Exercises 3 and 7, we asked you to make and describe a scatterplot for the 
hiker data shown in the table. 


Body weight (Ib) 120 187 109 103 131 165 158 116 
Backpack weight (Ib) 26 30 26 24 29 35 31 28 


pilching statistics. 


1. Caleulate the equation of the least-squares regression line. 


2. Make a residual plot for the linear model in Question 1 


CHECK YOUR UNDERSTANDING questions 
appear throughout the section. They help clarify 
definitions, concepts, and procedures. Be sure 
to check your answers in the back of the book. 


3. What does the residual plot indicate about the appropriateness of the linear model? 
Explain your answer 


EXAMPLES: Model statistical problems and how to 


solve them 


Read through each EXAMPLE, and then try out the concept yourself 
by working the FOR PRACTICE, TRY exercise in the Section Exercises. 


Need extra help? Examples and exercises 
marked with the PLAY ICON .e) are supported 
by short video clips prepared by experienced 
AP® Statistics teachers. The video guides you 
through each step in the example and solution 
and provides additional explanation when you 
need it. 


Old Faithful and fertility 
Describing a scatterplot 


PROBLEM: Describe the relationship in each of the 


following contexts. 


(a) The scatterplot on the left shows the relationship between 
the duration (in minutes} of an eruption and the interval 
of time until the next eruption (in mimuates) of Old Faithful 
during a particular month, 


EigshotaiStockGatty images 


(b) The scatterplot on the right shows the relationship between the average income (gross domestic product 
per person, in dollars) and fertility rate (number of children per woman) in 187 countries." 


Example: Old Faithful and fertility 


Describe the relationship in each of the following contexts. 


— s+} rt ; dal The scattergiog on the Jef chows the 
i« oo? * | eeletionstip Between the duration [in 
ze oegeee mirartes) of an eruptinn and the ietervat 
: Sas ok PS 

= 


of time Unts the next eruption (iy 


«| ag: - mminaites) of Otd Faithful uring a 
1 “e particuar month, 


Interval (min) 
ze 
. 
. 
ie 
S60 
. 
fee 
: 
oe pg 
“ot . 
. 
Fertility rate 
enw ewe 


1 2 3 | 5 0 10,000 80,000 120,00 160,000 
Duration (min) Average income ($) 


SOLUTION: 


(a) There is a strong, positive linear relationship between the duration 
of an eruption and the interval of time until the next eruption. There 
are two main clusters of points: one cluster has durations around 


| Caffeine and pulse rates 


between average , 
isa potential o How random assignment works 
fertility rate aro 


PROBLEM: A total of 20 students have agreed to participate 
in an experiment comparing the effects of caffeinated cola 
and caffeine-free cola on pulse rates, Deseribe how you would 
randomly assign 10 students to each of the two treatments: 


(a) Using 20 identical slips of paper 


THE VOICE OF THE TEACHER. 
Study the worked examples 
The SOLUTION SOLUTION: and pay special attention to 


(b) Using technology 
(c) Using Table D 


is presented ina (a) On 10 slips of paper, write the letter "A"; on the remaining When describing a method of random the carefully placed “Teacher 

; 10 slips, write the letter “B.” Shuffle the slips of paper and assignment, don't stop after creating the | Fe 
special font and hand out one slip of paper to each volunteer. Students who get _| groups. Make sure to identify which group Talk” comment boxes that 
models the style an “A’ slip receive the cola with caffeine and students who get _| gets which treatment. _ | guide you step by step through 


a“B" slip receive the cola without caffeine. 


steps, and language | (») Label each student with a different integer from 1 to 20, Then | When using a random number generator ora) | te Solution. These comments 

that you should use randomly generate 10 different integers from 1 to 20. The table of random digits to assign treatments, offer lots of good advice—as if 
R Students with these labels receive the cola with caffeine. The make sure to account for the possibility of A ¢ F 

to earn full credit on remaining 10 students receive the cola without caffeine. repeated numbers wien describing your method, | YOU teacher is working directly 

the AP® Statistics (c) Label each student with a different integer from 011 to 20. Go toaline of Table D and read two-digit groups | With you to solve a problem. 


moving from left to right. The first: 10 different labels between 011 and 20 identify the 10 students who 
exam. receive cola with caffeine. The remaining 10 students receive the caffeine-free cola. |gnore groups of digits 
from 21 to 00, 


| FOR PRACTICE, TRY EXERCISE 63 | PRACTICE, TRY EXERCISE 63 


The blue page number icon next 

, ‘: 63. Layoffs and “survivor guilt” Workers who survive a 
to an exercise points you back POPE layott of other employees at their location may suffer 

‘ from “survivor guilt” A study of survivor guilt and its 

to the page on which the model = effects used as subjects 120 students who were offered 
example appears. an opportunity to carn extra course credit by doing, 
proofreading, Each subject worked in the same cubi- 
cle as another student, who was an accomplice of the 
experimenters. Ata break midway through the work, 


EXERCISES: Practice makes perfect! 


Start by reading the SECTION SUMMARY to be sure that 


you understand the big ideas and key concepts. 


Section 3.1 | Summary 


¢ A scatterplot displays the relationship between two quantitative variables 
measured on the same individuals. Mark values of one yariable on the hori- 
zontal axis (x axis} and values of the other variable on the vertical axis (y axis). 
Plot each individual's data as a point on the graph, 


Practice! Work the EXERCISES 
assigned by your teacher. Compare 
your answers to those in the 


© If we think that a variable x may help predict, explain, or even cause changes 
in another variable y, we call x an explanatory variable and y a response vari- 
able, Always plot the explanatory variable on the x axis of a scatterplot. Plot 
the response variable on the y axis. 


Solutions appendix at the back of the 
book. Short solutions to the exercises 
numbered in red are found in the 
appendix. 


¢ When describing a scatterplot, look for an overall pattern (direction, form, 
strength) a - : - a ae ‘ 
answer in ¢ 


Section 3.1 | Exercises 


1. Coral reefs and cell phones Identify the explanatory Track and Field team," Describe the relationship 
pgfE@ variable and the response variable for the following between height and weight for these athletes 


Most of the exercises are paired, meaning that S relationships, if possible. Explain your reasoning. 
odd and even-numbered exercises test the fir cc reeernn prann 
same skill or concept. If you answer an assigned (b) The number of text messages sent and the number of 
exercise incorrectly, try to figure out your mistake. phone calls made in a sample of 100 students 


Then see if you can solve the paired exercise. . Teenagers and corn yield Identify the explanatory 


variable and the response variable for the following 
relationships, if possible. Explain your reasoning, 


*  Directi 


hm 
Weight (Ib) 


{a) ‘The height and arm span of a sample of 50 teenagers 


(b) ‘The yield of com in bushels per acre and the amount @ 62 68 66 & % 72 1 76 78 BO 82 
Look for ICONS that appear of rain in the growing season Height (in.) 
3. Heavy backpacks Ninth-grade students at the Webb §.. StarbucksThe scutterplot st » relat 

dsviek sort Bs atterplot shows the relationship 
next to selected EXERCISES. poke Schools go ona backpacking trip each fall. Students are between the amount of fat (in grams) and number 
They will guide you to eS - pay a at ea are 2” ~—s tek of calories in products sold at Starbucks,'' Describe 

roma hat. Before leaving, students and their backpacks the relationship between fat and calories for these 
e the Example that models are weighed. ‘The data here are from one hiking group. products 


Make a scatterplot by hand that shows how backpack 


the CXEICISE. : weight relates to body weight 
videos that provide step-by- Body weight (Ib) 120 187 109 103 191 165 158 116 
step instructions for solving Backpack weight (Ib) 26 30 26 24 29 35 91 28 rs 
the exercise. . Putting success How well do professional golfers putt 


from various distances to the hole? The data show 


as 


Calories 
s 


Multiple Choice: Select the best answer for Exercises 71-78 

71. Which of the following is not a characteristic of the 

Exercise: Heavy Backpacks least-squares regression line? 

(a) The slope of the least-squares regression line is always 
between —1 and 1 


of putts maq 


Mahe # scatterplot by hand that shows how backpack weight relates to, 

ody weight. 

(b) The least-squares regression line always goes through 
the point (%, y). 


(c) The least-squares regression line minimizes the sum of 
squared residuals, 


(d) The slope of the least-squares regression line Recycle and Review 

always have the same sign as the correlation 79, Fuel economy (2.2) In its recent Fuel Economy Guide, 
(ce) The least-squares regression line is not resistant the Environmental Protection Agency (EPA) gives 
data on 1152 vehicles. There are a number of outliers, 
mainly vehicles with very poor gas mileage or hybrids 
with very good gas mileage. IF we ignore the outliers, 
however, the combined city and highway gas mileage 
of the other 1120 or so vehicles is approximately Nor- 
mal with mean 18,7 miles per gallon (mpg) and stan- 
dard deviation 4.3 mpg. 


Various types of problems in the Section Exercises let you 


practice solving many different types of questions, including 
AP®-style multiple-choice and free-response. The Recycle 
and Review exercises refer back to concepts and skills learned SEIS LN ae inies DAN eee ak Thon ia Oe 


The Chevrolet Malibu with a four-eylinder engine 
has a combined gas mileage of 25 mpg. What percent 


Malibu? 
in an earlier section, noted in purple after the problem title. 


REVIEW and PRACTICE for quizzes and tests and the 
AP® STATISTICS EXAM 


Study the CHAPTER REVIEW to be sure that you 
understand the key concepts in each section. 


Chapter 3 Wrap-Up —-“—- 


Chapter 3 Review 


Section 3.1; Scatterplots and Correlation scatterplot is the appropriate type of geaph to investigate rela- 
In this section, you learned how to explore the relationship —_ tionships between two quantitative variables. To describe 
between two quantitative variables, As with distributions of a scatterplot, be sure to discuss four characteristics: diree- 
a single variable, the first step is always to make a graph. A tion, form, strength, and unusual features. The direction of 


Use the WHAT DID YOU LEARN? 
table that directs you to examples and 
exercises to verify your mastery of 

each LEARNING TARGET. 


What Did You Learn? 


Related Example Relevant Chapter Review 
Learning Target Section on Page(s) Exercise(s) 


Distinguish between explanatory and response variables for 

(quantitative data, 31 154 R34 7 

Make a scatterplot to display the relationship between two 31 158 Asa SUMMARY TAB ES in 
quantitative variables = " se Chapters 8-11 review 
Describe the direction, form, and strength of a retationship displayed 44 158 R31. RB2 imp : details of 


ina scatleinlot and identity unusual features. 
Interpret the correlation 3.1 162 R33 


Understand the basic properties of correlation, including how the 
correlation is influenced by outliers. 


Distinguish correlation from causation. 3 
Make predictions using regression lines, keeping in mind the 


inference procedure, including 
conditions and formulas. 


Comparing significance tests for a proportion and a mean 


31 165, 169 R31, R3.2 


dangers of extrapolation Significance test for p Significance test for y« 
- Name One-sample 2 test for One-sample ftest for je 
(Tl-83/84) {1-PropZTest) {TTest) 
Formula " p-Pm sa X~ Wo 
[kl — py) a5 
Von Va 


P-vaiue from f distribution vath df =n — 4 
¢ Random: The data come from a random sample from 


P-value from standard Normal distribution 
* Random: The data come trom a random sample from 


Chapter 3 Review Exercises 


the population of interest, the population of interest. 
10%: When sampling without replacement, 10%: When sampling without replacement, 
These exercises are designed to help you review the important — {c)_ Point B is A< 0.108, n<0.10N. 
ideas and methods of the chapter. point have * Large Counts Both ig and n(1 — pa)are at least 10,» Normal/Large Sample: The population has a Normal 
: squares regi That is, the expected number of successes and the distribution or the sample size is large (9 = 30). If the 
R3.1 Born to be old? Is there a relationship between the ihe reaidaal expected number ct failures in the sample are both at population distribution has unknown shape and 7 < 30, 
2 gestational period (time from conception to birth) of saat least 10. use 2 graph of the sample data to assess the Normality of 


an animal and its average life span? The figure shows —-R3.2. Penguins! 
a scatterplot of the gestational period and average life a relation 
span for 43 species of animals.*” to seek fe vey 
For all but the shallowest dives, there is an association 
between x = depth (in meters) and y = dive duration 


the population. Do not use ¢ procedures if the graph shows 
strong skewness or outliers, 


os °B (in minutes} that is different for each penguing 
os ° study gives a scatterplot for one penguin titled [ 
3 0 Relation of Dive Duration {y} to Depth (x).” Thd . ae > 
= . terplot shows an association that is positive, Li Review Exercise: Late bloomers? 
§ > ome e and strong, . 
ai . octal ; 3 P ae (b) Use technology to calculate the correlation and 
4 o. 5 “th 2 (a) Explain the meaning of the term positive associati¢ the equation of the least-squares regression line. 


hs this context. 

kle the CHAPTER REVIEW EXERCISES 
ww 2% 1 for practice in solving problems that test 
concepts from throughout the chapter. Need 
‘more help or just want additional insights 
before you take the practice test? Watch the 
Chapter Review Exercise Videos. 


Interpret the slope and y intercept of the line in this 
setting. 


term finear associatior 


terin strong association) The correlation is r = —0.85. 


The equation of the LSRL is 9 = 33.12 — 4.69x, 
where 9 represents the predicted number of days and 
x represents the average March temperature, 


Chapter 3 AP® Statistics Practice Test 


Section |: Multiple Choice Select the best answer for each question, 


73.41 A school guidance counselor examines how many 
extracurricular activities students participate in and 
their grade point average. The guidance counselor 
says, “The evidence indicates that the correlation 
between the number of extracurricular activities a 
student participates in and his or her grade pomt aver- 
age is close to 0.” Which of the following is the most 
appropriate conclusion? 


Each chapter concludes with an 
AP® STATISTICS PRACTICE TEST. 


Questions T3.3-T3.5 refer to the following setting 
Scientists examined the activity level of 7 fish at different 
temperatures. lish activity was rated on a scale of 0 (no 
activity) to 100 (maximal activity). The temperature 

was measured in degrees Celsius. A computer 

regression printout and a residual plot are provided. 
Notice that the horizontal axis on the residual plot is 
labeled “Fitted value,” which means the same thing as 
“predicted value.” 


This test includes about 10 AP®- 
style multiple-choice questions and 
3 free-response questions. 


(a) Students involved in many extracurricular activities 
tend to be students with poor grades, 


(b) Students with good grades tend to be students who are . 

not involved in many extracurricular activities. Sa 
({c) Students involyed in many extracurricular activities are 3 255 ¥ 

° 
just as likely to get good grades as bad grades, 3 no Ni 
7) 

(d) Students with good grades tend to be students who are a _.; 

involved in many extracurricular activities, _ 

ai 


(e) No conclusion should be made based on the correlation 
without looking at a scatterplot of the data. 


Cumulative AP® Practice Test 1 


Section |: Muttiple Choice Choose the best answer for Questions AP|, [-AP1.14. 


AP1.2 A child is #0) inches tall, which places her at the 90th 
percentile of all children of similar age. The heights 
for children of this age form an approximately Nor- 
mal distribution with a mean of 38 inches. Based on 
this information, what is the standard deviation of 
the heights ofall children of this age? 


(a) 0.20 inch 
(b) O31 inch 
(c}) 0.65 inch 
(d) 1.21 inches 


AP1.1 You look at real estate ads for houses in Sarasota, 
Florida. Many houses have prices from $200.0) to 
$400,000. The few houses on the water, however, 
have prices up to $15 million. Which of the fol- 
lowing statements best describes the distribution of 
home prices in Sarasota? 


Four CUMULATIVE AP® PRACTICE 
TESTS simulate the real exam. 
They are placed after Chapters 4, 
7, 10, and 12. The tests expand in 


{a) ‘he distribution is most likely skewed to the left, and 
the mean is greater than the median 


(b) The distribution is most likely skewed to the left, and 
the mean is less than the median 


length and content coverage as you 
work through the book. The last test 
models a full AP® Statistics exam. 


(ce) The distribution is roughly symmetric with a few high out 


liers, and the mean is approximately equal to the median, (e) 1.56 inches 


(d) The distribution is most likely skewed to the right, and 
the mean is greater than the median. 


(e) The distribution is most likely skewed to the right, and 
the mean is less than the median 


PROBLEM, YAY! 


The following problem is modeled after actual AP® Statistics (a) 
exam free response questions. Your task is to generate a com- 
plete, concise respanse in 15 minutes 


Briefly describe the association shown in the 
scatterplot, 

‘The equation of the least-squares regression line for 
these data is ¥ = 180.8 + 15,8x, Interpret the slope 
of the line in the context of the study 

) Calculate and interpret the residual for the flower 
that had 2 tablespoons of sugar and looked fresh for 
204 hours 


Directions: Show all your work. Indicate clearly the meth- 
ods you use, because you will he scored on the correctness 
of your methods as well as on the accuracy and complete- 
ness of your results and explanations 


Learn how to answer free 


‘Two statistics students went ta a flower shop and ran- 
domly selected 12 carnations, When they got home, the 
students prepared 12 identical vases with exactly the same 
amount of water in each vase. They put one tablespoon of 
sugar in 3 vases, two tablespoons of sugar in 3 vases, and 
three tablespoons of sugar in 3 vases. In the remaming 3 
vases, they put no sugar. After the vases were prepared, 


Suppose that another group of students conducted 
a similar experiment using 12 Mowers, but included 
different varieties in addition to carnations. Would 
you expect the value ofr for the second group's data 
to be greater than, less than, or about the same as the 
value ofr for the first geoup’s data? Explain. 


response questions successfully 
by working the FRAPPY!—the 
Free Response AP® Problem, 
Yay!—that begins the Chapter 


the students randomly assigned | camation to each vase 
and abserved how many hours each flower continued to 
look fresh, A scatterplot of the data is shown below 


After you finish, you can view two example solutions on the 
book's website (highschool.bfwpub.com/tps6e). Determine 
whether you think each solution is “complete,” “substantial,” 
“developing,” or “minimal.” if the solution is not complete, what 
improvements would you suggest to the student who wrote it? 
Finally, your teacher will provide you with a scoring rubric, Score 
your response and note what, if anything, you would do differently 
fp improve your own score 


Wrap-Up in every chapter. 


Use TECHNOLOGY to discover and analyze 


SMe Cleats esit:tes) COMPUTING NUMERICAL SUMMARIES 


Use technology as a tool for discovery and analysis. 
TECHNOLOGY CORNERS give step-by-step 
instructions for using the TI-83/84 calculator. 
Instructions for the Tl-Nspire and other calculators are 


on the book’s Student Site (highschool.bfwpub.com/ 
tps6e) and in the e-Book platform. 


TI-Nspire and other technology instructions are on the book's website at 
highschool.bfwpub.com/tps6e. 


NORMAL FLOAT AUTO REAL RADIAN MP 


it 


Let's find numerical summaries for the boys’ shoes data from the 
example on page 6+. We'll start by showing you how to compute 
summary statistics on the T1-83/84 and then look at output from 
computer software. 

L One-variable statistics on the TI-83/84 

1. Enter the data in list L1. 
2. Find the summary statistics for the shoe data. 


© Press (CALC); choose 1-VarStats. 
OS 2.55 or later: In the dialog box, press [2nd] [1] (L1) and 


[ENTER] to specify LI as the List. Leave FreqList blank. ieuwy. agieaseee 
Arrow down to Calculate and press [ENTER]. ox=9. 183000599 
28 


Older OS: Press [2nd] [7] (L1) and [ENTER]. 


* Press [-¥] to see the rest of the one-variable statistics. 


II, Output from statistical software We used Minitab statistical software to 
calculate descriptive statistics for the boys’ shoes data. Minitab allows you 


Sx=9. 421559822 
ox=9. 183000599 


Technology Corner videos are also available 
to walk you through the key strokes needed to 
perform each analysis. 


184 Pin CE MORTAL FLOAT AUTO SEAL SADIAN NP n 


to choose which numerical summaries are included in the output. i= Lotl..OFF 
he 2 8 
Descriptive Statistics: Shoes 2:Plot2..0F f 
. 
Variable N Mean StDev Minimum Q1 Median Q3 Maximum Ssaona 3:Plots.OfF 
Shoes 20 11.65 9.42 4.00 6.25 9.00 11.75 38.00 moaoog 4:Plotsoff 
- S:PlotsOn 


Note: The TI-83/84 gives the first and third quartiles of the boys’ shoes distribution as Q; = 6.5 and Q3 = 11.5. 
Minitab reports that Q; = 6.25 and Q; = 11.75. What happened? Minitab and some other software use 
slightly different rules for locating quartiles. Results from the various rules are usually close to each other. 


lating quartiles as they may affect more than just the JOR. 
Although the Technology 


Comers focus on the TI-83/84 
graphing calculator, output 
from multiple programs— 
including Minitab and JMP—is 
used in the book’s Examples 
and Exercises to help you 
become familiar with reading 
and interpreting many 
different kinds of statistical 
summaries. 


3.2 Technology Comers 


Find the Technology Comers easily by 
TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tpsse. consulting the summary table at the 
9. Calculating least-squares regression lines Page 184 end of each section or the complete 


10. Making residual plots Page 187 || table at the back of the book. 


fl, Re ER ee ° "| Read, practice, access the 
ie ee "°°" Sshe’ | resources, and do homework 
fad Practice. =| assignments online with the new 
= Statistic Online Homework and e-Book 
3 wie tr | Platform that may be purchased 


ry ae 
trem PD rvaed nee sesser | to enhance your learning 


sagan mma.) €XPerience. 


Tere are re uncoming sapigurvents oF events 


Sapling Learring Resources 


Does listening to music while studying help or hinder learning? If an athlete fails a drug test, 
how sure can we be that she took a banned substance? Does having a pet help people live 
longer? How well do SAT scores predict college success? Do most people recycle? Which of 
two diets will help obese children lose more weight and keep it off? Can a new drug help 
people quit smoking? How strong is the evidence for global warming? 

These are just a few of the questions that statistics can help answer. But what is statistics? 
And why should you study it? 


istockphoto 


Data are usually numbers, but they are not “just numbers.” Data are numbers with a context. 
The number 10.5, for example, carries no information by itself. But if we hear that a family 
friend’s new baby weighed 10.5 pounds at birth, we congratulate her on the healthy size of the 
child. The context engages our knowledge about the world and allows us to make judgments. 
We know that a baby weighing 10.5 pounds is quite large, and that a human baby is unlikely to 
weigh 10.5 ounces or 10.5 kilograms. The context makes the number meaningful. 

In your lifetime, you will be bombarded with data and statistical information. Poll results, 
television ratings, music sales, gas prices, unemployment rates, medical study outcomes, and 
standardized test scores are discussed daily in the media. Using data effectively is a large and 
growing part of most professions. A solid understanding of statistics will enable you to make 
sound, data-based decisions in your career and everyday life. 


It is tempting to base conclusions on your own experiences or the experiences of those you 


know. But our experiences may not be typical. In fact, the incidents that stick in our memory 
are often the unusual ones. 


Do Cell Phones Cause Brain Cancer? 


Italian businessman Innocente Marcolini developed a brain tumor at age 60. He also talked on a 
cellular phone up to 6 hours per day for 12 years as part of his job. Mr. Marcolini’s physician 
suggested that the brain tumor may have been caused by cell-phone use. So Mr. Marcolini 
decided to file suit in the Italian court system. A court ruled in his favor in October 2012. 


Bloomberg via Getty Images 


Several statistical studies have investigated the link between cell-phone use and brain 
cancer. One of the largest was conducted by the Danish Cancer Society. Over 350,000 residents 
of Denmark were included in the study. Researchers compared the brain-cancer rate for the 
cell-phone users with the rate in the general population. The result: no statistical difference in 
brain-cancer rates. In fact, most studies have produced similar conclusions. In spite of the 
evidence, many people (like Mr. Marcolini) are still convinced that cell phones can cause brain 
cancer. 

In the public’s mind, the compelling story wins every time. A statistically literate person 
knows better. Data are more reliable than personal experiences because they systematically 
describe an overall picture, rather than focus on a few incidents. 


Are You Kidding Me? 


The famous advice columnist Ann Landers once asked her readers, “If you had it to do over 
again, would you have children?” A few weeks later, her column was headlined “70% OF 
PARENTS SAY KIDS NOT WORTH IT.” Indeed, 70% of the nearly 10,000 parents who 
wrote in said they would not have children if they could make the choice again. Do you believe 
that 70% of all parents regret having children? 


istockphoto 


You shouldn’t. The people who took the trouble to write to Ann Landers are not 
representative of all parents. Their letters showed that many of them were angry with their 
children. All we know from these data is that there are some unhappy parents out there. A 
statistically designed poll, unlike Ann Landers’s appeal, targets specific people chosen in a way 
that gives all parents the same chance to be asked. Such a poll showed that 91% of parents 
would have children again. 

Where data come from matters a lot. If you are careless about how you get your data, you 
may announce 70% “No” when the truth is close to 90% “Yes.” 


Who Talks More—Women or Men? 


According to Louann Brizendine, author of The Female Brain, women say nearly 3 times as 
many words per day as men. Skeptical researchers devised a study to test this claim. They used 
electronic devices to record the talking patterns of 396 university students from Texas, Arizona, 
and Mexico. The device was programmed to record 30 seconds of sound every 12.5 minutes 
without the carrier’s knowledge. What were the results? 

According to a published report of the study in Scientific American, “Men showed a slightly 
wider variability in words uttered. .. . But in the end, the sexes came out just about even in the 
daily averages: women at 16,215 words and men at 15,669.”2 When asked where she got her 
figures, Brizendine admitted that she used unreliable sources.2 

The most important information about any statistical study is how the data were 
produced. Only carefully designed studies produce results that can be trusted. 


Yogi Berra, a famous New York Yankees baseball player known for his unusual quotes, had 
this to say: “You can observe a lot just by watching.” That’s a motto for learning from data. A 


carefully chosen graph is often more instructive than a bunch of numbers. 


Do People Live Longer in Wealthier Countries? 


The Gapminder website, www.gapminder.org, provides loads of data on the health and well- 
being of the world’s inhabitants. The graph below displays some data from Gapminder.4 The 
individual points represent all the world’s nations for which data are available. Each point 
shows the income per person and life expectancy for one country, along with the region (color 
of point) and population (size of point). 


Hongqi Zhang/Alamy 


We expect people in richer countries to live longer. The overall pattern of the graph does 
show this, but the relationship has an interesting shape. Life expectancy rises very quickly as 
personal income increases and then levels off. People in very rich countries like the United 
States live no longer than people in poorer but not extremely poor nations. In some less wealthy 
countries, people live longer than in the United States. Several other nations stand out in the 
graph. What’s special about each of these countries? 
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Individuals vary. Repeated measurements on the same individual vary. Chance outcomes—like 
spins of a roulette wheel or tosses of a coin—vary. Almost everything varies over time. 
Statistics provides tools for understanding variation. 


Have Most Students Cheated on a Test? 


Commercial Eye/Getty Images 


Researchers from the Josephson Institute were determined to find out. So they surveyed about 
23,000 students from 100 randomly selected schools (both public and private) nationwide. The 
question was: “How many times have you cheated during a test at school in the past year?” 
Fifty-one percent said they had cheated at least once.2 

If the researchers had asked the same question of all high school students, would exactly 
51% have answered “Yes”? Probably not. If the Josephson Institute had selected a different 
sample of about 23,000 students to respond to the survey, they would probably have gotten a 
different estimate. Variation is everywhere! 

Fortunately, statistics provides a description of how the sample results will vary in relation 
to the actual population percent. Based on the sampling method that this study used, we can say 
that the estimate of 51% is very likely to be within 1% of the true population value. That is, we 
can be quite confident that between 50% and 52% of all high school students would say that 
they have cheated on a test. 

Because variation is everywhere, conclusions are uncertain. Statistics gives us a 
language for talking about uncertainty that is understood by statistically literate people 
everywhere. 


Chapter 1 bata Analysis 


AfriPics.com/Alamy Stock Photo 


Introduction Statistics: The Science and Art of Data 


Section 1.1 Analyzing Categorical Data 


INTRODUCTION Statistics: The Science and Art of Data 


LEARNING TARGETS By the end of the section, you should be able to: 


e Identify the individuals and variables in a set of data. 


e Classify variables as categorical or quantitative. 


We live in a world of data. Every day, the media report poll results, outcomes of medical 
studies, and analyses of data on everything from stock prices to standardized test scores to 
global warming. The data are trying to tell us a story. To understand what the data are saying, 
you need to learn more about statistics. 


DEFINITION Statistics 


Statistics is the science and art of collecting, analyzing, and drawing conclusions 
from data. 


A solid understanding of statistics will help you make good decisions based on data in your 
daily life. 


Organizing Data 


Every year, the U.S. Census Bureau collects data from over 3 million households as part of the 
American Community Survey (ACS). The table displays some data from the ACS in a recent 
year. 


Rudy Sulgan/Corbis 
Documentary/Getty Images 


Number of Time indwelling Response Household Internet 
Household Region people (years) mode income access? 
A425 Midwest 5 2-4 Internet 52,000 Yes 


936459 West 4 2-4 Mail 40,500 Yes 


50055 Northeast 2 10-19 Internet 481,000 Yes 
592934 West 4 2-4 Phone 230,800 No 
545854 ~—= South 9 2-4 Phone 33,800 Yes 
809928 South 2 30+ Internet 59,500 Yes 
110157 Midwest 1 5-9 Internet 80,000 Yes 
999347 ~~ South 1 <1 Mail 8,400 No 


Most data tables follow this format—each row describes an individual and each column 
holds the values of a variable. 


DEFINITION Individual, Variable 


An individual is an object described in a set of data. Individuals can be people, 
animals, or things. 


A variable is an attribute that can take different values for different individuals. 


Sometimes the individuals in a data set are called cases or observational units. 


For the American Community Survey data set, the individuals are households. The 
variables recorded for each household are region, number of people, time in current dwelling, 
survey response mode, household income, and whether the dwelling has Internet access. 
Region, time in dwelling, response mode, and Internet access status are categorical variables. 
Number of people and household income are quantitative variables. 


Note that household is not a variable. The numbers in the household column of the data 
table are just labels for the individuals in this data set. Be sure to look for a column of labels— 
names, numbers, or other identifiers—in any data table you encounter. 


DEFINITION Categorical variable, Quantitative variable 


A categorical variable assigns labels that place each individual into a particular 
group, called a category. 


A quantitative variable takes number values that are quantities—counts or 
measurements. 


og Not every variable that takes number values is quantitative. Zip code is one 


example. Although zip codes are numbers, they are neither counts of anything, nor 
measurements of anything. They are simply labels for a regional location, making zip code a 
categorical variable. Some variables—such as gender, race, and occupation—are categorical by 
nature. Time in dwelling from the ACS data set is also a categorical variable because the values 
are recorded as intervals of time, such as 2—4 years. If time in dwelling had been recorded to 


the nearest year for each household, this variable would be quantitative. 


To make life simpler, we sometimes refer to categorical data or quantitative data instead of 
identifying the variable as categorical or quantitative. 


EXAMPLE | Census At School (>) 
Individuals and Variables 


Garry Black/Alamy 


PROBLEM: Census At School is an international project that collects data about primary 
and secondary school students using surveys. Hundreds of thousands of students from 
Australia, Canada, Ireland, Japan, New Zealand, South Africa, South Korea, the United 
Kingdom, and the United States have taken part in the project. Data from the surveys are 
available online. We used the site’s “Random Data Selector” to choose 10 Canadian students 
who completed the survey in a recent year. The table displays the data. 


Number of Wrist 

languages Height circumference Preferred 
Province Gender spoken Handedness _ (cm) (mm) communication 
Saskatchewan Male ib Right 175.0 180 In person 
Ontario Female 1 Right 162.5 160 In person 
Alberta Male 1 Right 178.0 174 Facebook 
Ontario Male 2 Right 169.0 160 Cell phone 
Ontario Female Z Right 166.0 65 In person 
Nunavut Male 1 Right 168.5 160 Text messaging 
Ontario Female Al Right 166.0 165 Cell phone 
Ontario Male 4 Left 157.5 147 Text messaging 
Ontario Female Z Right 150.5 187 Text messaging 
Ontario Female 1 Right 171.0 180 Text messaging 


a. Identify the individuals in this data set. 


b. What are the variables? Classify each as categorical or quantitative. 
SOLUTION: 


a. 10 randomly selected Canadian students who participated in the Census At School 


S urvey. 


a | 


We'll see in Chapter 4 why choosing at random, as we did in this example, is a 


good idea. 


b. Categorical: Province, gender, handedness, preferred communication method 


Quantitative: Number of languages spoken, height (cm), wrist circumference (mm) 


There is at least one suspicious value in the data table. We doubt that the girl who 
is 166 cm tall really has a wrist circumference of 65 mm (about 2.6 inches). 
Always look to be sure the values make sense! 


FOR PRACTICE, TRY EXERCISE 1 


The proper method of data analysis depends on whether a variable is categorical or 
quantitative. For that reason, it is important to distinguish these two types of variables. The 
type of data determines what kinds of graphs and which numerical summaries are appropriate. 


AP® EXAM TIP 


If you learn to distinguish categorical from quantitative variables now, it will pay big 


rewards later. You will be expected to analyze categorical and quantitative variables 
correctly on the AP® exam. 


ANALYZING DATA A variable generally takes values that vary (hence the name variable!). 
Categorical variables sometimes have similar counts in each category and sometimes don’t. For 
instance, we might have expected similar numbers of males and females in the Census At 
School data set. But we aren’t surprised to see that most students are right-handed. Quantitative 
variables may take values that are very close together or values that are quite spread out. We 
call the pattern of variation of a variable its distribution. 


DEFINITION Distribution 


The distribution of a variable tells us what values the variable takes and how 
often it takes those values. 


Let’s return to the data for the sample of 10 Canadian students from the preceding example. 
Figure 1.1(a) shows the distribution of preferred communication method for these students in a 
bar graph. We can see how many students chose each method from the heights of the bars: cell 


phone (2), Facebook (1), in person (3), text messaging (4). Figure 1.1(b) shows the distribution 
of number of languages spoken in a dotplot. We can see that 6 students speak one language, 3 
students speak two languages, and 1 student speaks four languages. 
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FIGURE 1.1 (a) Bar graph showing the distribution of preferred communication method for the 
sample of 10 Canadian students. (b) Dotplot showing the distribution of number of languages 
spoken by these students. 


Section 1.1 begins by looking at how to describe the distribution of a single categorical 
variable and then examines relationships between categorical variables. Sections 1.2 and 1.3 
and all of Chapter 2 focus on describing the distribution of a quantitative variable. Chapter 3 
investigates relationships between two quantitative variables. In each case, we begin with 
graphical displays, then add numerical summaries for a more complete description. 


HOW TO ANALYZE DATA 


e Begin by examining each variable by itself. Then move on to study relationships among 
the variables. 


e Start with a graph or graphs. Then add numerical summaries. 


CHECK YOUR UNDERSTANDING 


\ 


y Jake is a car buff who wants to find out more about the vehicles that his classmates drive. 
He gets permission to go to the student parking lot and record some data. Later, he does 
some Internet research on each model of car he found. Finally, Jake makes a spreadsheet that 
includes each car’s license plate, model, year, color, highway gas mileage, weight, and 
whether it has a navigation system. 


1. Identify the individuals in Jake’s study. 


2. What are the variables? Classify each as categorical or quantitative. 


From Data Analysis to Inference 


Sometimes we’re interested in drawing conclusions that go beyond the data at hand. That’s the 
idea of inference. In the “Census At School” example, 9 of the 10 randomly selected Canadian 
students are right-handed. That’s 90% of the sample. Can we conclude that exactly 90% of the 
population of Canadian students who participated in Census At School are right-handed? No. 


If another random sample of 10 students were selected, the percent who are right-handed 
might not be exactly 90%. Can we at least say that the actual population value is “close” to 
90%? That depends on what we mean by “close.” The following activity gives you an idea of 
how statistical inference works. 


ACTIVITY Hiring discrimination—it just won't fly! 


An airline has just finished training 25 pilots—15 male and 10 female—to become captains. 
Unfortunately, only eight captain positions are available right now. Airline managers 
announce that they will use a lottery to determine which pilots will fill the available 
positions. The names of all 25 pilots will be written on identical slips of paper. The slips will 
be placed in a hat, mixed thoroughly, and drawn out one at a time until all eight captains 
have been identified. 


Choja/Getty Images 


A day later, managers announce the results of the lottery. Of the 8 captains chosen, 5 are 
female and 3 are male. Some of the male pilots who weren’t selected suspect that the lottery 
was not carried out fairly. One of these pilots asks your statistics class for advice about 
whether to file a grievance with the pilots’ union. 

The key question in this possible discrimination case seems to be: Is it plausible 
(believable) that these results happened just by chance? To find out, you and your 
classmates will simulate the lottery process that airline managers said they used. 


1. Your teacher will give you a bag with 25 beads (15 of one color and 10 of another) or 25 
slips of paper (15 labeled “M” and 10 labeled “F”) to represent the 25 pilots. Mix the 
beads/slips thoroughly. Without looking, remove 8 beads/slips from the bag. Count the 
number of female pilots selected. Then return the beads/slips to the bag. 


2. Your teacher will draw and label a number line for a class dotplot. On the graph, plot the 
number of females you got in Step 1. 


3. Repeat Steps 1 and 2 if needed to get a total of at least 40 simulated lottery results for your 
class. 


4. Discuss the results with your classmates. Does it seem plausible that airline managers 
conducted a fair lottery? What advice would you give the male pilot who contacted you? 


Our ability to do inference is determined by how the data are produced. Chapter 4 discusses 
the two main methods of data production—sampling and experiments—and the types of 
conclusions that can be drawn from each. As the activity illustrates, the logic of inference rests 
on asking, “What are the chances?” Probability, the study of chance behavior, is the topic of 
Chapters 5—7. We’ |l introduce the most common inference techniques in Chapters 8-12. 


Introduction Summary 


e Statistics is the science and art of collecting, analyzing, and drawing conclusions from data. 


e A data set contains information about a number of individuals. Individuals may be people, 
animals, or things. For each individual, the data give values for one or more variables. A 
variable describes some characteristic of an individual, such as a person’s height, gender, or 
salary. 

e A categorical variable assigns a label that places each individual in one of several groups, 
such as male or female. A quantitative variable has numerical values that count or measure 
some characteristic of each individual, such as number of siblings or height in meters. 


e The distribution of a variable describes what values the variable takes and how often it 
takes them. 


Introduction Exercises 


The solutions to all exercises numbered in red may be found in the Solutions Appendix. 


1. pg 3 |) Aclass survey Here is a small part of the data set that describes the students in 


an AP® Statistics class. The data come from anonymous responses to a questionnaire filled 
out on the first day of class. 


Grade Children in Homework last night Android or 
Gender level GPA family (min) iPhone? 
F 9 2.3 3 0-14 iPhone 
M 11 3.8 6 15-29 Android 


M 10 3.1 2 15-29 Android 


3. 


F 10 4.0 i 
F 10 3.4 4 
F 10 3.0 3 
M 9 3.9 2 
M 12 3.5 2 


a. Identify the individuals in this data set. 


45-59 
0-14 
30-44 
15-29 
0-14 


b. What are the variables? Classify each as categorical or quantitative. 
2. Coaster craze Many people like to ride roller coasters. Amusement parks try to increase 
attendance by building exciting new coasters. The following table displays data on several 


roller coasters that were opened in a recent year.~ 


Roller coaster Type —_ Height (ft) 
Wildfire Wood 187.0 
Skyline Steel 131.3 
Goliath Wood 165.0 
Helix Steel 134.5 
Banshee Steel 167.0 
Black Hole Steel 22d 


1 


Design 
Sit down 
Inverted 
Sit down 
Sit down 
Inverted 
Sit down 


a. Identify the individuals in this data set. 


Speed (mph) 


70.2 
50.0 
72.0 
62.1 
68.0 
25.5 


b. What are the variables? Classify each as categorical or quantitative. 
Hit movies According to the Internet Movie Database, Avatar is tops based on box-office 
receipts worldwide as of January 2017. The following table displays data on several 
popular movies. Identify the individuals and variables in this data set. Classify each 


variable as categorical or quantitative. 


Movie 
Avatar 


Titanic 

Star Wars: The Force Awakens 
Jurassic World 

Marvel’s The Avengers 

Furious 7 

The Avengers: Age of Ultron 


Harry Potter and the Deathly Hallows: 
Part 2 


Year Rating 
2009 PG- 
13 
1997 PG- 
13 
2015 PG- 
13 
2015 PG- 
13 
2012 PG- 
13 
2015 PG- 
13 
2015 PG- 
13 
2011 PG- 


13 


Time 
(min) 
162 
194 
136 
124 
142 
137 


141 


130 


Genre 
Action 


Drama 


iPhone 
iPhone 
Android 
iPhone 
iPhone 


Duration (sec) 
120 
90 
105 
130 
160 
75 


Box office ($) 
2,783,918,982 


2,207,615,668 


Adventure 2,040,375,795 


Action 


Action 


Action 


Action 


Fantasy 


1,669,164,161 


1,519,479,547 


1,516,246, 709 


1,404, 705,868 


1,328,111,219 


Frozen 2013 PG 108 Animation 1,254,512,386 


Iron Man 3 2013 PG- 129 Action 1,172,805,920 
13 


4. Skyscrapers Here is some information about the tallest buildings in the world as of 
February 2017. Identify the individuals and variables in this data set. Classify each variable 
as Categorical or quantitative. 


Height Year 
Building Country (m) Floors Use completed 
Burj Khalifa United Arab 828.0 163 Mixed 2010 

Emirates 

Shanghai Tower China 632.0 121 Mixed 2014 
Makkah Royal Clock Tower Saudi Arabia 601.0 120 Hotel 2012 
Hotel 
Ping An Finance Center China 599.0 115 Mixed 2016 
Lotte World Tower South Korea 554.5 123 Mixed 2016 
One World Trade Center United States 541.0 104 Office 2013 
Taipei 101 Taiwan 509.0 101 Office 2004 
Shanghai World Financial China 492.0 101 Mixed 2008 
Center 
International Commerce China 484.0 118 Mixed 2010 
Center 
Petronas Tower 1 Malaysia 452.0 88 Office 1998 


5. Protecting wood What measures can be taken, especially when restoring historic wooden 
buildings, to help wood surfaces resist weathering? In a study of this question, researchers 
prepared wooden panels and then exposed them to the weather. Some of the variables 
recorded were type of wood (yellow poplar, pine, cedar); type of water repellent (solvent- 
based, water-based); paint thickness (millimeters); paint color (white, gray, light blue); 
weathering time (months). Classify each variable as categorical or quantitative. 

6. Medical study variables Data from a medical study contain values of many variables for 
each subject in the study. Some of the variables recorded were gender (female or male); age 
(years); race (Asian, Black, White, or other); smoker (yes or no); systolic blood pressure 
(millimeters of mercury); level of calcium in the blood (micrograms per milliliter). Classify 
each variable as categorical or quantitative. 

7. Ranking colleges Popular magazines rank colleges and universities on their “academic 
quality” in serving undergraduate students. Describe two categorical variables and two 
quantitative variables that you might record for each institution. 

8. Social media You are preparing to study the social media habits of high school students. 
Describe two categorical variables and two quantitative variables that you might record for 
each student. 


Multiple Choice Select the best answer. 


Exercises 9 and 10 refer to the following setting. At the Census Bureau website 


www.census.gov, you can view detailed data collected by the American Community Survey. 
The following table includes data for 10 people chosen at random from the more than 1 million 
people in households contacted by the survey. “School” gives the highest level of education 


completed. 


Weight (Ib) Age (years) Travel to work (min) 


187 66 
158 66 
176 54 
339 37 
91 27 
155 18 
213 38 
194 40 
221 18 
193 a 


9. The individuals in this data set are 


a. households. 
b. people. 

c. adults. 

d. 120 variables. 


e. columns. 


10. This data set contains 


a. 7 variables, 2 of which are categorical. 
b. 7 variables, 1 of which is categorical. 
c. 6 variables, 2 of which are categorical. 


d. 6 variables, 1 of which is categorical. 


e. None of these. 


0 
n/a 


School 
Ninth grade 
High school grad 
Assoc. degree 
Assoc. degree 
Some college 
High school grad 
Master’s degree 
High school grad 
High school grad 
Fifth grade 


i 
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Gender Income last year ($) 


24,000 
0 
11,900 
6000 
30,000 
0 
125,000 
800 
2500 

0 


SECTION 1.1 Analyzing Categorical Data 


LEARNING TARGETS By the end of the section, you should be able to: 
e Make and interpret bar graphs for categorical data. 
e Identify what makes some graphs of categorical data misleading. 


e Calculate marginal and joint relative frequencies from a two-way table. 


e Calculate conditional relative frequencies from a two-way table. 


Use bar graphs to compare distributions of categorical data. 


Describe the nature of the association between two categorical variables. 


Here are the data on preferred communication method for the 10 randomly selected Canadian 
students from the example on page 3: 


In person In person Facebook Cell phone In person 


Text messaging Cellphone Text messaging Text messaging Text messaging 


We can summarize the distribution of this categorical variable with a frequency table or a 
relative frequency table. 


DEFINITION Frequency table, Relative frequency table 
A frequency table shows the number of individuals having each value. 


A relative frequency table shows the proportion or percent of individuals having 
each value. 


Some people use the terms frequency distribution and relative frequency distribution instead. 


To make either kind of table, start by tallying the number of times that the variable takes each 
value. ow Note that the frequencies and relative frequencies listed in these tables are not 


data. The tables summarize the data by telling us how many (or what proportion or percent of) 
students in the sample said “Cell phone,” “Facebook,” “In person,” and “Text messaging.” 


Preferred Tally Preferred Frequency Preferred Relative 
method method method frequency 
Cell phone || Cell phone 2 Cell phone 2/10 = 0.20 or 


20% 


Facebook | Facebook 1 Facebook 1/10 = 0.10 or 


10% 
In person [|| In person 3 In person 3/10 = 0.30 or 
30% 
Text messaging ||| Text messaging 4 Text messaging 4/10 = 0.40 or 
40% 


The same process can be used to summarize the distribution of a quantitative variable. Of 
course, it would be hard to make a frequency table or a relative frequency table for quantitative 
data that take many different values, like the ages of people attending a Major League Baseball 
game. We’II look at a better option for quantitative variables with many possible values in 
Section 1.2. 


Displaying Categorical Data: Bar Graphs and Pie Charts 


A frequency table or relative frequency table summarizes a variable’s distribution with 
numbers. To display the distribution more clearly, use a graph. You can make a bar graph or a 
pie chart for categorical data. 


DEFINITION Bar graph, Pie chart 


A bar graph shows each category as a bar. The heights of the bars show the 
category frequencies or relative frequencies. 


A pie chart shows each category as a slice of the “pie.” The areas of the slices 
are proportional to the category frequencies or relative frequencies. 


Bar graphs are sometimes called bar charts. Pie charts are sometimes called circle graphs. 


Figure 1.2 shows a bar graph and a pie chart of the data on preferred communication 
method for the random sample of Canadian students. Note that the percents for each category 
come from the relative frequency table. 


40) 


30 


~ Text 
E Messaging 
= 21) 40% 
a 
10 
Cell phone Facebook — In person Text 
messaging 
(a) Preferred communication method (b) Preferred communication method 


FIGURE 1.2 (a) Bar graph and (b) pie chart of the distribution of preferred communication 
method for a random sample of 10 Canadian students. 


Preferred Relative 
method frequency 
Cell phone 2/10 = 0.20 or 
20% 
Facebook 1/10 = 0.10 or 
10% 
In person 3/10 = 0.30 or 
30% 
Text messaging 4/10 = 0.40 or 
40% 


It is fairly easy to make a bar graph by hand. Here’s how you do it. 


HOW TO MAKE A BAR GRAPH 


Draw and label the axes. Put the name of the categorical variable under the horizontal 
axis. To the left of the vertical axis, indicate whether the graph shows the frequency 
(count) or relative frequency (percent or proportion) of individuals in each category. 


“Scale” the axes. Write the names of the categories at equally spaced intervals under the 
horizontal axis. On the vertical axis, start at 0 and place tick marks at equal intervals until 
you exceed the largest frequency or relative frequency in any category. 


Draw bars above the category names. Make the bars equal in width and leave gaps 
between them. Be sure that the height of each bar corresponds to the frequency or relative 
frequency of individuals in that category. 


Making a graph is not an end in itself. The purpose of a graph is to help us understand the 
data. When looking at a graph, always ask, “What do I see?” We can see from both graphs in 
Figure 1.2 that the most preferred communication method for these students is text messaging. 


EXAMPLE |What’s on the radio? (>) 
Making and interpreting bar graphs 


PROBLEM: Arbitron, the rating service for radio audiences, categorizes U.S. radio stations 
in terms of the kinds of programs they broadcast. The frequency table summarizes the 
distribution of station formats in a recent year.2 


Format Number of stations 
Adult contemporary 2536 
All sports 1274 
Contemporary hits 1012 
Country 2893 
News/talk/information 4077 
Oldies 831 
Religious 3884 
Rock 1636 
Spanish language 878 
Variety 1579 
Other formats 4852 
Total 25,452 


a. Identify the individuals in this data set. 


b. Make a frequency bar graph of the data. Describe what you see. 


SOLUTION: 


a. U.S. radio stations 


Number of stations 
3 


eS eS Ss ON WA so” * a eye 


Ny 
oe iy <i 
ss te eo 
Radio station format 


To make the bar graph: 


e Draw and label the axes. 


e “Scale” the axes. The largest frequency is 4852. So we choose a vertical scale 
from 0 to 5000, with tick marks 500 units apart. 


e Draw bars above the category names. 


On U.S. radio stations, the most frequent ‘formats are Other (4852), 
News/talk/information (4077), and Religious (388.4), while the least frequent are 
Oldies (831), Spanish language (878), and Contemporary hits (1012). 
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Radio station 
format 


Contemp. hits 
4% 


Spanish lang. 
3.4% 


Here is a pie chart of the radio station format data from the preceding example. You can use 
a pie chart when you want to emphasize each category’s relation to the whole. Pie charts are 
challenging to make by hand, but technology will do the job for you. Note that gg a pie chart 
must include all categories that make up a whole, which might mean adding an “other” 
category, as in the radio station example. 


IT DIDN’T HAVE 
ANYTHING USEFUL 
TO SAY SO T MADE 

THIS PIE CHART. 


IT MUST 
OOOH! BE TRUE 
( BECAUSE 


scottadams@aol.com 


OOOH! IT’S PIE. 


370% ©2009Scott Adams, Inc./Dist. by UFS, inc. 


CHECK YOUR UNDERSTANDING 


i. 


y The American Statistical Association sponsors a web-based project that collects data 
about primary and secondary school students using surveys. We used the site’s “Random 


Sampler” to choose 40 U.S. high school students who completed the survey in a recent year.2 
One of the questions asked: 


Which would you prefer to be? Select one. 
Rich Happy Famous Healthy 
Here are the responses from the 40 randomly selected students: 


Famous Healthy Healthy Famous Happy Famous’ Happy’ Happy Famous 


Rich Happy Happy Rich Happy Happy Happy Rich Happy 
Famous Healthy’ Rich Happy Happy Rich Happy Happy Rich 
Healthy Happy Happy Rich Happy Happy Rich Happy Famous 


Famous Happy Happy Happy 


Make a relative frequency bar graph of the data. Describe what you see. 


Graphs: Good and Bad 


Bar graphs are a bit dull to look at. It is tempting to replace the bars with pictures or to use 
special 3-D effects to make the graphs seem more interesting. Don’t do it! Our eyes react to the 
area of the bars as well as to their height. When all bars have the same width, the area (width x 
height) varies in proportion to the height, and our eyes receive the right impression about the 
quantities being compared. 


EXAMPLE |Who buys iMacs? (>) 
Beware the pictograph! 


Justin Sullivan/Getty Images 


PROBLEM: When Apple, Inc., introduced the iMac, the company wanted to know whether 
this new computer was expanding Apple’s market share. Was the iMac mainly being bought 
by previous Macintosh owners, or was it being purchased by first-time computer buyers and 
by previous PC users who were switching over? To find out, Apple hired a firm to conduct a 
survey of 500 iMac customers. Each customer was categorized as a new computer purchaser, 
a previous PC owner, or a previous Macintosh owner. The table summarizes the survey 
results.4 


Previous ownership Count Percent (%) 
None 85 17.0 
re. 60 12.0 
Macintosh 355 71.0 
Total 500 100.0 


a. Below is a clever graph of the data that uses pictures instead of the more traditional bars. 
How is this pictograph misleading? 


400 


300 


200 


Number of buyers 


100 


None Windows Macintosh 


Previous computer 


b. Two possible bar graphs of the data are shown below. Which one could be considered 
deceptive? Why? 
80) 80) 
70 70 
60 
50 


40 


Percent 
t & 
Percent 


30) 
0 LO 


None PC Macintosh None PC Macintosh 


Previous computer Previous computer 


SOLUTION: 


a. The pictograph makes it look like the percentage of Mac buyers who are former Mac 
owners is at least 10 times larger than either of the other two categories, which isn’t 
true. 


In part (a), the heights of the images are correct. But the areas of the images are 


misleading. The Macintosh image is about 6 times as tall as the PC image, but its 
area is about 36 times as large! 


b. The bar graph on the right 1S misleading. By starting the vertical scale at 10 instead of 
o, it looks like the percentage of iMac buyers who previously owned a PC is less than 
half the percentage who are first-time computer buyers, which isn’t true. 
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There are two important lessons to be learned from this example: Oo (1) beware the 


pictograph, and (2) watch those scales. 


Analyzing Data on Two Categorical Variables 


You have learned some techniques for analyzing the distribution of a single categorical 
variable. What should you do when a data set involves two categorical variables? For example, 
Yellowstone National Park staff surveyed a random sample of 1526 winter visitors to the park. 
They asked each person whether he or she belonged to an environmental club (like the Sierra 
Club). Respondents were also asked whether they owned, rented, or had never used a 
snowmobile. The data set looks something like the following: 


Respondent Environmental club? Snowmobile use 
1 No Own 
Z No Rent 
3 Yes Never 
4 Yes Rent 
5 No Never 


franz12/Shutterstock 


The two-way table summarizes the survey responses. 


Environmental club member? 


No Yes 
Never 212 
Snowmobile use Rent T/ 


Own 279 16 


DEFINITION Two-way table 


A two-way table is a table of counts that summarizes data on the relationship 
between two categorical variables for some group of individuals. 


A two-way table is sometimes called a contingency table. 


It’s easier to grasp the information in a two-way table if row and column totals are 
included, like the one shown here. 


Environmental club 


No Yes Total 

Never used 657 

Snowmobile use Snowmobile renter 574 
Snowmobile owner 295 

Total 1221 305 1526 


Now we can quickly answer questions like: 


e What percent of people in the sample are environmental club members? 
305 305 


385 _ pang nnx——= 0.200 = 20.0% 
3051526=0.200=20.0% 1526 700 = 79-9 1596 ‘ 


e What proportion of people in the sample never used a snowmobile? 
_ 657 
JOT 


sx) ——— = 0.431 
6571526=0.431is20 491 1526 


These percents or proportions are known as marginal relative frequencies because they 
are calculated using values in the margins of the two-way table. 


DEFINITION Marginal relative frequency 


A marginal relative frequency gives the percent or proportion of individuals that 
have a specific value for one categorical variable. 


We could call this distribution the marginal distribution of environmental club membership. 


We can compute marginal relative frequencies for the column totals to give the distribution 
of environmental club membership in the entire sample of 1526 park visitors: 


No: 12211526=0.800 or 80.0% Yes: 3051526=0.200 or 20.0% 


: 30! , 
= 0.800 0r 80.0% Yes: ——-= 0.200 or 20.0% 
1526 1526 


22 


0; 


1221 305 

No: ——=0. 0% Yes: 

Oo 1596 0.800 or 80.0% es 1596 

We can compute marginal relative frequencies for the row totals to give the distribution of 
snowmobile use for all the individuals in the sample: 


Never: 6571526=0.431 or 43.1%Rent: 5741526=0.376 or 37.6%Own: 2951526=0.193 or 19.3% 


= 0.200 or 20.0% 


657 
Never: —— = 0.431 or 43.1% 
ST a a 1526 
Never: ——~- = 0.431 or 43.1% 574 
rng Rent : — = 0.376 or 37.6% 
Rent: —-— = 0.376 or 37.6% 1526 
1526 995 
Own : = = 0.193 or 19.3% Own : 1526 = 0.193 or 19.3% 


1526 


We could call this distribution the marginal distribution of snowmobile use. 


Note that we could use a bar graph or a pie chart to display either of these distributions. 


A marginal relative frequency tells you about only one of the variables in a two-way table. 
It won’t help you answer questions like these, which involve values of both variables: 


e What percent of people in the sample are environmental club members and own 
snowmobiles? 
16 


16 8 _ _ goin = 109 
161526<0.010=1.09¢]Rg = 0010 = 1.0% Tagg = 02010 = 1.0% 


e What proportion of people in the sample are not environmental club members and never use 
snowmobiles? 
445 445 


- — 0.292 
4451526=0.2921526 79" 1526 


These percents or proportions are known as joint relative frequencies. 


DEFINITION Joint relative frequency 


A joint relative frequency gives the percent or proportion of individuals that have 
a specific value for one categorical variable and a specific value for another 
categorical variable. 


EXAMPLE |A Titanic disaster (>) 
Calculating marginal and joint relative frequencies 


wT rig iT oe | 
Che New York Cimes. === 
TITANIC SINKS FOUR HOURS AFTER HITTING ICEBERG; 
866 RESCUED BY CARPATHIA, PROBABLY 1250 PERISt 
ISMAY SAFE, MRS, ASTOR MAYBE, NOTED NAMES MISSI: 


Blank Archives/Getty Images 


PROBLEM: In 1912 the luxury liner Titanic, on its first voyage across the Atlantic, struck 
an iceberg and sank. Some passengers got off the ship in lifeboats, but many died. The two- 
way table gives information about adult passengers who survived and who died, by class of 
travel. 


Class of travel 


Third 
Survival status Survived 151 
Died 122 167 476 


a. What proportion of adult passengers on the Titanic survived? 


b. Find the distribution of class of travel for adult passengers on the Titanic using relative 
frequencies. 


c. What percent of adult Titanic passengers traveled in third class and survived? 


SOLUTION: 


Start by finding the marginal totals. 


Class of travel 


Survival status Survived 


a. 4421207=0.366'"" 1207 


| Remember that a distribution lists the possible values of a variable and how often | 


| those values occur. | 


b. 


First: 3191207=0.264=26.4%Second: 2611207=0.216=21.6%Third: 6271207=0.519=51.9% 


¢ 


7m 9 ys Sa 3: ae ; 
First : oT 0.264 = 26.4% First : 707 0.264 = 26.4% 
261 
Second —= 0.216 = 21.6% 261 
act 1207 Second : ———= 0.216 = 21.6% 
Third : °2" — 0.519 = 51.9% 1207 
ee Third: 22" — 0.519 = 51.9% 
urd: 1207 ‘ a ° 0 


Note that the three percentages for class of travel in part (b) do not add to exactly 


100% due to roundoff error. 


1 
= 0.125 = 12.5% = 0.125 = 12.5% 
C. 1511207=0.125=12.5% 1207 
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CHECK YOUR UNDERSTANDING 


q 


y An article in the Journal of the American Medical Association reports the results of a 
study designed to see if the herb St. John’s wort is effective in treating moderately severe 
cases of depression. The study involved 338 patients who were being treated for major 
depression. The subjects were randomly assigned to receive one of three treatments: St. 
John’s wort, Zoloft (a prescription drug), or placebo (an inactive treatment) for an 8-week 
period. The two-way table summarizes the data from the experiment.2 


Treatment 
St. Zoloft | Placebo 
John’s 
wort 
Full 37 
response 
Change in Partial 13 
depression —_ response 
No 66 


response 


1. What proportion of subjects in the study were randomly assigned to take St. John’s wort? 
Explain why this value makes sense. 

2. Find the distribution of change in depression for the subjects in this study using relative 
frequencies. 


3. What percent of subjects took Zoloft and showed a full response? 


Relationships Between Two Categorical Variables 


Let’s return to the data from the Yellowstone National Park survey of 1526 randomly selected 
winter visitors. Earlier, we calculated marginal and joint relative frequencies from the two-way 
table. These values do not tell us much about the relationship between environmental club 
membership and snowmobile use for the people in the sample. 


Environmental club 


Never used 

Snowmobile use Snowmobile renter 
Snowmobile owner 
Total 


We can also use the two-way table to answer questions like: 


e What percent of environmental club members in the sample are snowmobile owners? 
16 


= 0.052 = 0.052 = 5.2% 


16 ee 
16305=0.052=5.2%305 meee S05 


e What proportion of snowmobile renters in the sample are not environmental club members? 
497 

497... ——= 0.866 

497574=0.866574 °° 574 


These percents or proportions are known as conditional relative frequencies. 


DEFINITION Conditional relative frequency 


A conditional relative frequency gives the percent or proportion of individuals 
that have a specific value for one categorical variable among individuals who 
share the same value of another categorical variable (the condition). 


EXAMPLE |A Titanic disaster (>) 
Conditional relative frequencies 


PROBLEM: In 1912 the luxury liner Titanic, on its first voyage across the Atlantic, struck 
an iceberg and sank. Some passengers made it off the ship in lifeboats, but many died. The 
two-way table gives information about adult passengers who survived and who died, by class 
of travel. 


Class of travel 


Survival status Survived 
Died 
Total 


a. What proportion of survivors were third-class passengers? 


b. What percent of first-class passengers survived? 


SOLUTION: 
E 151 
151 _ 9 349——~= 0.342 
a. 151442=0,342442 442 
7 197 
197 _ o 618 = 61.8% >- 7 = 0.618 = 61.8% 
b. 197319=0.618=61.8%319 0.618 = 61.8% 379 - 


Note that a proportion is always a number between 0 and 1, whereas a percent isa 


number between 0 and 100. To get a percent, multiply the proportion by 100. 
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We can study the snowmobile use habits of environmental club members by looking only at 
the “Yes” column in the two-way table. 


Environmental club 
‘Yes\\ | Total 


No / Yes \ 
Never used 


657 

Snowmobile use Snowmobile renter 574 
Snowmobile owner 295 

Total 1221 | \305/ | 1526 


It is easy to calculate the proportions or percents of environmental club members who never 
use, rent, and own snowmobiles: 


Never: 212305=0.695 or 69.5%Rent: 77305=0.252 or 25.2%Own: 16305=0.052 or 5.2% 


212 
Never: ——= 0.695 or 69.5% 


305 
Never: a5 0.695 or 69.5% 77 
Rent: ——= 0.252 or 25.2% 
Rent:———= 0.252 or 25.2% 305 


305 16 
Own: Pa = 0.052 or 5.2% Ow mn: 305 — =; 052 or 5, 2 A 


We could also refer to this distribution as the conditional distribution of snowmobile use among 
environmental club members. 


This is the distribution of snowmobile use among environmental club members. 


We can find the distribution of snowmobile use among the survey respondents who are not 
environmental club members in a similar way. The table summarizes the conditional relative 
frequencies for both groups. 


Snowmobile 


use Not environmental club members En 
Never 445 
cael = -—— = ().364 or 36.4% 
4451221=0.364 or 36.4% i221 9°49 38-4 19907 212305=0.695 or 6 


Rent = 497 
497 _ 6.407 or 40.7% = 0-407 or 40.7% 
A971221=0.407 or 40.7% i221 407 F 49-7 1991 4 


77305=0.252 or 2! 


279 
279 0.229 or 22.9% = 0.229 or 22.9% 
2791221=0.229 or 22.9% 1221 1221 16305=0.052 or 


Own 


AP® EXAM TIP 


When comparing groups of different sizes, be sure to use relative frequencies 
(percents or proportions) instead of frequencies (counts) when analyzing categorical 
data. Comparing only the frequencies can be misleading, as in this setting. There 
are many more people who never use snowmobiles among the non-environmental 


club members in the sample (445) than among the environmental club members 
(212). However, the percentage of environmental club members who never use 
snowmobiles is much higher (69.5% to 36.4%). Finally, make sure to avoid 
statements like “More club members never use snowmobiles” when you mean “A 
greater percentage of club members never use snowmobiles.” 


Figure 1.3 compares the distributions of snowmobile use for Yellowstone National Park 
visitors who are environmental club members and those who are not environmental club 
members with (a) a side-by-side bar graph and (b) a segmented bar graph. Notice that the 
segmented bar graph can be obtained by stacking the bars in the side-by-side bar graph for each 
of the two environmental club membership categories (no and yes). 


Snowmobile Snowmobile 


a use 


Never 
~ ni [| Never 
Rent 
5 Own 5 i pre 
2 . 2 Own 
y = 
a a 
No Yes 
(a) Environmental club membership (b) Environmental club membership 


FIGURE 1.3 (a) Side-by-side bar graph and (b) segmented bar graph displaying the distribution 
of snowmobile use among environmental club members and among non-environmental club 
members from the 1526 randomly selected winter visitors to Yellowstone National Park. 


DEFINITION Side-by side bar graph, Segmented bar graph 


A side-by-side bar graph displays the distribution of a categorical variable for 
each value of another categorical variable. The bars are grouped together based 
on the values of one of the categorical variables and placed side by side. 


A segmented bar graph displays the distribution of a categorical variable as 
segments of a rectangle, with the area of each segment proportional to the 
percent of individuals in the corresponding category. 


Both graphs in Figure 1.3 show a clear association between environmental club 
membership and snowmobile use in this random sample of 1526 winter visitors to Yellowstone 
National Park. The environmental club members were much less likely to rent (25.2% versus 
40.7%) or own (5.2% versus 29.0%) snowmobiles than non-club-members and more likely to 
never use a snowmobile (69.5% versus 36.4%). Knowing whether or not a person in the sample 
is an environmental club member helps us predict that individual’s snowmobile use. 


DEFINITION Association 


There is an association between two variables if knowing the value of one 
variable helps us predict the value of the other. If knowing the value of one 
variable does not help us predict the value of the other, then there is no 
association between the variables. 


What would the graphs in Figure 1.3 look like if there was no association between 
environmental club membership and snowmobile use in the sample? The blue segments would 


be the same height for both the “Yes” and “No” groups. So would the green segments and the 
red segments, as shown in the graph at left. In that case, knowing whether a survey respondent 
is an environmental club member would not help us predict his or her snowmobile use. 


Snowmobile 
use 


[| Own 
fo] Rent 
a Never 


Percent 


No Yes 
Environmental club membership 


Which distributions should we compare? Our goal all along has been to analyze the 
relationship between environmental club membership and snowmobile use for this random 
sample of 1526 Yellowstone National Park visitors. We decided to calculate conditional 
relative frequencies of snowmobile use among environmental club members and among non- 
club-members. Why? Because we wanted to see if environmental club membership helped us 
predict snowmobile use. What if we had wanted to determine whether snowmobile use helps us 
predict whether a person is an environmental club member? Then we would have calculated 
conditional relative frequencies of environmental club membership among snowmobile 
owners, renters, and non-users. In general, you should calculate the distribution of the variable 
that you want to predict for each value of the other variable. 


Can we say that there is an association between environmental club membership and 
snowmobile use in the population of all winter visitors to Yellowstone National Park? Making 
this determination requires formal inference, which will have to wait until Chapter 11. 


EXAMPLE |A Titanic disaster (>) 


Conditional relative frequencies and association 


Ueivers History Archive/Getty Images 
PROBLEM: In 1912 the luxury liner Titanic, on its first voyage across the Atlantic, struck 
an iceberg and sank. Some passengers made it off the ship in lifeboats, but many died. The 
two-way table gives information about adult passengers who survived and who died, by class 
of travel. 


Class of travel 


Survival status Survived 
Died 
Total 


a. Find the distribution of survival status for each class of travel. Make a segmented bar 
graph to compare these distributions. 


b. Describe what the graph in part (a) reveals about the association between class of travel 
and survival status for adult passengers on the Titanic. 


SOLUTION: 
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To make the segmented bar graph: 


e Draw and label the axes. Put class of travel on the horizontal axis and percent 
on the vertical axis. 


“Scale” the axes. Use a vertical scale from 0 to 100%, with tick marks every 
20%. 


Draw bars. Make each bar have a height of 100%. Be sure the bars are equal 
in width and leave spaces between them. Segment each bar based on the 
conditional relative frequencies you calculated. Use different colors or shading 
patterns to represent the two possible statuses—survived and died. Add a key 
to the graph that tells us which color (or shading) represents which status. 


b. ‘Knowing a passenger’s class of travel helps us predict his or her survival status. First 


class had the highest percentage of SUYVIVOYS (61.8%), followed by second class (36.0%), 
and then third class (24.1%). 


FOR PRACTICE, TRY EXERCISE 29 


Bar graphs can be used to compare any set of quantities that can be measured in the same units. 
See Exercises 33 and 34. 


Because the variable “Survival status” has only two possible values, comparing the three 
distributions displayed in the segmented bar graph amounts to comparing the percent of 
passengers in each class of travel who survived. The bar graph in Figure 1.4 shows this 


comparison. Note that the bar heights do not add to 100%, because each bar represents a 
different group of passengers on the Titanic. 
70 


60 


Percent who survived 


First Second Third 
Class of travel 
FIGURE 1.4 Bar graph comparing the percents of passengers who survived among each of the 
three classes of travel on the Titanic. 


We offer a final caution about studying the relationship between two variables: eT" 


association does not imply causation. It may be true that being in a higher class of travel on 
the Titanic increased a passenger’s chance of survival. However, there isn’t always a cause- 
and-effect relationship between two variables even if they are clearly associated. For example, 
a recent study proclaimed that people who are overweight are less likely to die within a few 
years than are people of normal weight. Does this mean that gaining weight will cause you to 
live longer? Not at all. The study included smokers, who tend to be thinner and also much more 
likely to die in a given period than non-smokers. Smokers increased the death rate for the 


normal-weight category, making it appear as if being overweight is better.° The moral of the 
story: beware other variables! 


CHECK YOUR UNDERSTANDING 


i. 


y An article in the Journal of the American Medical Association reports the results of a 
study designed to see if the herb St. John’s wort is effective in treating moderately severe 
cases of depression. The study involved 338 subjects who were being treated for major 
depression. The subjects were randomly assigned to receive one of three treatments: St. 
John’s wort, Zoloft (a prescription drug), or placebo (an inactive treatment) for an 8-week 
period. The two-way table summarizes the data from the experiment. 


Treatment 


Full response 


Change in depression Partial response 16 26 13 
No response 70 56 66 


1. What proportion of subjects who showed a full response took St. John’s wort? 
2. What percent of subjects who took St. John’s wort showed no response? 


3. Find the distribution of change in depression for the subjects receiving each of the three 
treatments. Make a segmented bar graph to compare these distributions. 


4. Describe what the graph in Question 3 reveals about the association between treatment 
and change in depression for these subjects. 


1. Technology Corner | ANALYZING TWO-WAY TABLES 


Statistical software will provide marginal relative frequencies, joint relative frequencies, and 
conditional relative frequencies for data summarized in a two-way table. Here is output from 
Minitab for the data on snowmobile use and environmental club membership. Use the 
information on cell contents at the bottom of the output to help you interpret what each 
value in the table represents. 


Rows: Snowmobile u Columns: Environmental club member? 
No es All 


445 21 657 
67.73 2. 100.00 
36.45 9.5 43.05 
23.16 : 43.05 


497 574 
B6.59 x 100.00 
40.70 5.25 37.61 
32.57 5.05 37.61 


279 1 295 
94.58 5. 100.00 
22.85 5.25 19.33 
18.28 .Os 19,33 

1221 3¢ 1526 
80.01 ° 100.00 
100.00 90.00 100.00 
80.01 : 100.00 


Cell Contentea: 


Section 1.1 Summary 


e The distribution of a categorical variable lists the categories and gives the frequency (count) 
or relative frequency (percent or proportion) of individuals that fall in each category. 


e You can use a pie chart or bar graph to display the distribution of a categorical variable. 
When examining any graph, ask yourself, “What do I see?” 


e Beware of graphs that mislead the eye. Look at the scales to see if they have been distorted 
to create a particular impression. Avoid making graphs that replace the bars of a bar graph 
with pictures whose height and width both change. 


e A two-way table of counts summarizes data on the relationship between two categorical 
variables for some group of individuals. 


e You can use a two-way table to calculate three types of relative frequencies: 


: A marginal relative frequency gives the percent or proportion of individuals that have a 
specific value for one categorical variable. Use the appropriate row total or column total in 
a two-way table when calculating a marginal relative frequency. 


. A joint relative frequency gives the percent or proportion of individuals that have a 
specific value for one categorical variable and a specific value for another categorical 
variable. Use the value from the appropriate cell in the two-way table when calculating a 
joint relative frequency. 


: A conditional relative frequency gives the percent or proportion of individuals that have a 
specific value for one categorical variable among individuals who share the same value of 
another categorical variable (the condition). Use conditional relative frequencies to 
compare distributions of a categorical variable for two or more groups. 


Use a side-by-side bar graph or a segmented bar graph to compare the distribution of a 
categorical variable for two or more groups. 


e There is an association between two variables if knowing the value of one variable helps 
predict the value of the other. To see whether there is an association between two categorical 
variables, find the distribution of one variable for each value of the other variable by 
calculating an appropriate set of conditional relative frequencies. 


1.1 Technology Corner 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


1. Analyzing two-way tables 


Section 1.1 Exercises 
11. pg 11 |») Birth days The frequency table summarizes data on the numbers of babies 
born on each day of the week in the United States in a recent week.2 


Day Births 
Sunday 7374 


Monday 11,704 


Tuesday 13,169 
Wednesday 13,038 
Thursday 13,013 
Friday 12,664 
Saturday 8459 


a. Identify the individuals in this data set. 


b. Make a frequency bar graph to display the data. Describe what you see. 


12. Going up? As of 2015, there were over 75,000 elevators in New York City. The 
frequency table summarizes data on the number of elevators of each type.2 


Type Count 
Passenger elevator 66,602 
Freight elevator 4140 
Escalator 2663 
Dumbwaiter 1143 
Sidewalk elevator 943 
Private elevator 252 
Handicap lift 221 
Manlift {3 
Public elevator 45 


a. Identify the individuals in this data set. 


b. Make a frequency bar graph to display the data. Describe what you see. 


13. Buying cameras The brands of the last 45 digital single-lens reflex (SLR) cameras sold 
on a popular Internet auction site are listed here. Make a relative frequency bar graph for 
these data. Describe what you see. 


Canon Sony Canon Nikon Fujifilm 
Nikon Canon Sony Canon Canon 
Nikon Canon Nikon Canon Canon 
Canon Nikon Fujifilm Canon Nikon 
Nikon Canon Canon Canon Canon 
Olympus Canon Canon Canon Nikon 
Olympus Sony Canon Canon Sony 
Canon Nikon Sony Canon Fujifilm 
Nikon Canon Nikon Canon Sony 


14. Disc dogs Here is a list of the breeds of dogs that won the World Canine Disc 
Championships from 1975 through 2016. Make a relative frequency bar graph for these 


data. Describe what you see. 


Whippet 

Whippet 

Whippet 

Mixed breed 
Mixed breed 
Other purebred 
Labrador retriever 
Mixed breed 
Mixed breed 
Border collie 
Mixed breed 
Mixed breed 
Labrador retriever 
Labrador retriever 


Mixed breed 
Australian shepherd 
Border collie 
Australian shepherd 
Mixed breed 
Mixed breed 
Mixed breed 
Border collie 
Border collie 
Australian shepherd 
Border collie 
Australian shepherd 
Border collie 
Mixed breed 


Australian shepherd 
Australian shepherd 
Australian shepherd 
Border collie 

Border collie 
Australian shepherd 
Border collie 

Border collie 

Other purebred 
Border collie 

Border collie 

Border collie 

Mixed breed 
Australian shepherd 


15. Cool car colors The most popular colors for cars and light trucks change over time. Silver 
advanced past green in 2000 to become the most popular color worldwide, then gave way 
to shades of white in 2007. Here is a relative frequency table that summarizes data on the 


colors of vehicles sold worldwide in a recent year.2 


Color Percent of vehicles 
Black 19 
Blue 6 
Brown/beige 

Gray 12 
Green 

Red 

Silver 14 
White 29 
Yellow/gold 3 
Other 7? 


a. What percent of vehicles would fall in the “Other” category? 
b. Make a bar graph to display the data. Describe what you see. 


c. Would it be appropriate to make a pie chart of these data? Explain. 


16. Spam Email spam is the curse of the Internet. Here is a relative frequency table that 


summarizes data on the most common types of spam:12 


Type of spam Percent 
Adult 19 


Financial 20 


Health 7 
Internet 7 
Leisure 6 
Products 25 
Scams 9 
Other 2? 


a. What percent of spam would fall in the “Other” category? 
b. Make a bar graph to display the data. Describe what you see. 


c. Would it be appropriate to make a pie chart of these data? Explain. 


17. Hispanic origins Here is a pie chart prepared by the Census Bureau to show the origin of 
the more than 50 million Hispanics in the United States in 2010.4! About what percent of 
Hispanics are Mexican? Puerto Rican? 


Origin 


American 


18. Which major? About 3 million first-year students enroll in colleges and universities each 
year. What do they plan to study? The pie chart displays data on the percent of first-year 
students who plan to major in several disciplines.42 About what percent of first-year 
students plan to major in business? In social science? 


Major Arts/ 
Humanities 


Technical , Biological 


sciences 


Professional 


Physical sciences Education 


Engineering 


19. pg 12 |») Going to school Students in a high school statistics class were given data 


about the main method of transportation to school for a group of 30 students. They 
produced the pictograph shown. Explain how this graph is misleading. 


Key: 


de = 1 Cyclist 
t@&» = 2 Cars 
Ses, — 7 Bus takers 
& =2 Walkers 


Mode of transport 


20. Social media The Pew Research Center surveyed a random sample of U.S. teens and 


adults about their use of social media. The following pictograph displays some results. 
Explain how this graph is misleading. 


AGE BREAKDOWN (OF SOCIAL MEDIA USERS) 


13-18 19-29 30-49 50-64 65+ 


21. Binge-watching Do you “binge-watch” television series by viewing multiple episodes of 
a series at one sitting? A survey of 800 people who binge-watch were asked how many 
episodes is too many to watch in one viewing session. The results are displayed in the bar 
graph.42 Explain how this graph is misleading. 


30 


Percent 


3to4 5 to6 7to9 10 or There 
more is no 
“too many” 


Number of episodes 


22. Support the court? A news network reported the results of a survey about a controversial 
court decision. The network initially posted on its website a bar graph of the data similar 
to the one that follows. Explain how this graph is misleading. (Note: When notified about 
the misleading nature of its graph, the network posted a corrected version.) 


23. 


24. 


Percent who agree 
ws 
oo 


Democrats Republicans Independents 


pg 15 |») Asmash or a hit? Researchers asked 150 subjects to recall the details of a 


car accident they watched on video. Fifty subjects were randomly assigned to be asked, 
“About how fast were the cars going when they smashed into each other?” For another 50 
randomly assigned subjects, the words “smashed into” were replaced with “hit.” The 
remaining 50 subjects—the control group—were not asked to estimate speed. A week 
later, all subjects were asked if they saw any broken glass at the accident (there wasn’t 


any). The table shows each group’s response to the broken glass question. 4 
Treatment 
“Smashed into” “Hit” Control 
Response Yes 6 
No 44 


a. What proportion of subjects were given the control treatment? 


b. Find the distribution of responses about whether there was broken glass at the accident 


for the subjects in this study using relative frequencies. 


c. What percent of the subjects were given the “smashed into” treatment and said they 


saw broken glass at the accident? 


Superpowers A total of 415 children from the United Kingdom and the United States 
who completed a survey in a recent year were randomly selected. Each student’s country 
of origin was recorded along with which superpower they would most like to have: the 
ability to fly, ability to freeze time, invisibility, superstrength, or telepathy (ability to read 
minds). The data are summarized in the following table. 


Country 
U.S. 
Fly 45 
Freeze time 44 
Superpower Invisibility ot 
Superstrength Zo 


Telepathy | 44 | 66 


a. What proportion of students in the sample are from the United States? 


b. Find the distribution of superpower preference for the students in the sample using 
relative frequencies. 


c. What percent of students in the sample are from the United Kingdom and prefer 
telepathy as their superpower preference? 1 


25. Body image A random sample of 1200 U.S. college students was asked, “What is your 


perception of your own body? Do you feel that you are overweight, underweight, or about 


right?” The two-way table summarizes the data on perceived body image by gender.1& 


Body image 


Gender 
Female Male Total 
About right 855 
Overweight 235 
Underweight 110 
Total 760 440 1200 


a. What percent of respondents feel that their body weight is about right? 


b. What proportion of the sample is female? 


c. What percent of respondents are males and feel that they are overweight or 
underweight? 


26. Python eggs How is the hatching of water python eggs influenced by the temperature of 
the snake’s nest? Researchers randomly assigned newly laid eggs to one of three water 
temperatures: hot, neutral, or cold. Hot duplicates the extra warmth provided by the 
mother python, and cold duplicates the absence of the mother. The two-way table 
summarizes the data on whether or not the eggs hatched.12 


Hatched? 


Water temperature 


Cold Neutral Hot Total 
Yes 129 
No 58 
Total Zt 56 104 187 


a. What percent of eggs were randomly assigned to hot water? 


b. What proportion of eggs in the study hatched? 


c. What percent of eggs in the study were randomly assigned to cold or neutral water and 


hatched? 


27. PQ 17 |») Asmash or a hit Refer to Exercise 23. 
a. What proportion of subjects who said they saw broken glass at the accident received the 
“hit” treatment? 


b. What percent of subjects who received the “smashed into” treatment said they did not 
see broken glass at the accident? 


28. Superpower Refer to Exercise 24. 


a. What proportion of students in the sample who prefer invisibility as their superpower 
are from the United States? 


b. What percent of students in the sample who are from the United Kingdom prefer 
superstrength as their superpower? 


29. pg 20 |») Asmash or a hit Refer to Exercise 23. 
a. Find the distribution of responses about whether there was broken glass at the accident 


for each of the three treatment groups. Make a segmented bar graph to compare these 
distributions. 


b. Describe what the graph in part (a) reveals about the association between response 
about broken glass at the accident and treatment received for the subjects in the study. 


30. Superpower Refer to Exercise 24. 


a. Find the distribution of superpower preference for the students in the sample from each 
country (i.e., the United States and the United Kingdom). Make a segmented bar graph 
to compare these distributions. 


b. Describe what the graph in part (a) reveals about the association between country of 
origin and superpower preference for the students in the sample. 


31. Body image Refer to Exercise 25. 


a. Of the respondents who felt that their body weight was about right, what proportion 
were female? 


b. Of the female respondents, what percent felt that their body weight was about right? 


c. The segmented bar graph displays the distribution of perceived body image by gender. 
Describe what this graph reveals about the association between these two variables for 
the 1200 college students in the sample. 


Body image 


| Underweight 
fe} Overweight 


BR About right 


Relative frequency (%) 


Female Male 


Gender 
32. Python eggs Refer to Exercise 26. 
a. Of the eggs that hatched, what proportion were randomly assigned to hot water? 
b. Of the eggs that were randomly assigned to hot water, what percent hatched? 


c. The segmented bar graph displays the distribution of hatching status by water 
temperature. Describe what this graph reveals about the association between these two 
variables for the python eggs in this experiment. 


Relative frequency (%) 


Cold Neutral Hot 
Water temperature 


33. Far from home A survey asked first-year college students, “How many miles is this 
college from your permanent home?” Students had to choose from the following options: 
5 or fewer, 6 to 10, 11 to 50, 51 to 100, 101 to 500, or more than 500. The side-by-side bar 
graph shows the percentage of students at public and private 4-year colleges who chose 
each option.t® Write a few sentences comparing the distributions of distance from home 
for students from private and public 4-year colleges who completed the survey. 


Type of university 
Private Public 


Percent 


Distance from home (mi) 


34. Popular car colors Favorite car colors may differ among countries. The side-by-side bar 
graph displays data on the most popular car colors in a recent year for North America and 
Asia. Write a few sentences comparing the distributions.12 


Region 
50 


Asia North America 


Percent 


& - sw S 
Ss BE PES SES SHAH PES 


Color 


35. Phone navigation The bar graph displays data on the percent of smartphone owners in 
several age groups who say that they use their phone for turn-by-turn navigation.22 


SSsS3d8 


Percent who use smartphone 
for navigation 
a 


18-29 30-49 50-64 65+ 
Age group (years) 


a. Describe what the graph reveals about the relationship between age group and use of 
smartphones for navigation. 


b. Would it be appropriate to make a pie chart of the data? Explain. 


36. Who goes to movies? The bar graph displays data on the percent of people in several age 
groups who attended a movie in the past 12 months. 


10 - 


Percent who attended a movie 
> 


18-24 25-34 35-44 45-54 55-64 65-74 75 
and older 


Age group (years) 


a. Describe what the graph reveals about the relationship between age group and movie 
attendance. 


b. Would it be appropriate to make a pie chart of the data? Explain. 


37. Marginal totals aren’t the whole story Here are the row and column totals for a two-way 


38. 


39. 


table with two rows and two columns: 
a b 50 
C d 50 


60 40 100 


Find two different sets of counts a, b, c, and d for the body of the table that give these 
same totals. This shows that the relationship between two variables cannot be obtained 
from the two individual distributions of the variables. 


Women and children first? Here’s another table that summarizes data on survival status 
by gender and class of travel on the Titanic: 


Class of travel 


First class Second class Third class 
Survival status Female Male Female Male Female Male 
Survived 
Died 


a. Find the distributions of survival status for males and for females within each class of 
travel. Did women survive the disaster at higher rates than men? Explain. 


b. In an earlier example, we noted that survival status is associated with class of travel. 
First-class passengers had the highest survival rate, while third-class passengers had the 
lowest survival rate. Does this same relationship hold for both males and females in all 
three classes of travel? Explain. 


Simpson’s paradox Accident victims are sometimes taken by helicopter from the 
accident scene to a hospital. Helicopters save time. Do they also save lives? The two-way 
table summarizes data from a sample of patients who were transported to the hospital by 


40. 


helicopter or by ambulance.22 


Method of transport 


Helicopter Ambulance Total 
Died 324 
Survival status Survived 976 
Total 200 1100 1300 
a. What percent of patients died with each method of transport? 
Here are the same data broken down by severity of accident: 
Serious accidents 
Method of transport 
Helicopter Ambulance Total 
Died 108 
Survival status Survived 92 
Total 100 100 200 
Less serious accidents 
Method of transport 
Died 
Survival status Survived 
Total 


b. Calculate the percent of patients who died with each method of transport for the serious 
accidents. Then calculate the percent of patients who died with each method of 
transport for the less serious accidents. What do you notice? 


c. See if you can explain how the result in part (a) is possible given the result in part (b). 


Note: This is an example of Simpson’s paradox, which states that an association between 
two variables that holds for each value of a third variable can be changed or even reversed 
when the data for all values of the third variable are combined. 


Multiple Choice Select the best answer for Exercises 40-43. 


For which of the following would it be inappropriate to display the data with a single pie 
chart? 


a. The distribution of car colors for vehicles purchased in the last month 


b. The distribution of unemployment percentages for each of the 50 states 


c. The distribution of favorite sport for a sample of 30 middle school students 


d. The distribution of shoe type worn by shoppers at a local mall 


e. The distribution of presidential candidate preference for voters in a state 


41. The following bar graph shows the distribution of favorite subject for a sample of 1000 
students. What is the most serious problem with the graph? 


42. 


d. 


e. 


280 


Number of students 


Math Science English Social Foreign Fine 
studies language arts 
Favorite subject 
The subjects are not listed in the correct order. 
This distribution should be displayed with a pie chart. 
The vertical axis should show the percent of students. 


The vertical axis should start at 0 rather than 100. 


The foreign language bar should be broken up by language. 


The Dallas Mavericks won the NBA championship in the 2010—2011 season. The two- 
way table displays the relationship between the outcome of each game in the regular 
season and whether the Mavericks scored at least 100 points. 


Points scored 


100 or more Fewer than 100 Total 
Outcome of game Win 57 
Loss Zo 
Total 82 


Which of the following is the best evidence that there is an association between the 
outcome of a game and whether or not the Mavericks scored at least 100 points? 


The Mavericks won 57 games and lost only 25 games. 


The Mavericks scored at least 100 points in 47 games and fewer than 100 points in only 
35 games. 


The Mavericks won 43 games when scoring at least 100 points and only 14 games 


when scoring fewer than 100 points. 


d. The Mavericks won a higher proportion of games when scoring at least 100 points 
(43/47) than when they scored fewer than 100 points (14/35). 


e. The combination of scoring 100 or more points and winning the game occurred more 
often (43 times) than any other combination of outcomes. 


43. The following partially completed two-way table shows the marginal distributions of 
gender and handedness for a sample of 100 high school students. 


Gender 
Male Female Total 
Right 90 
Dominant hand Left 10 
Total 40 60 100 


If there is no association between gender and handedness for the members of the sample, 
which of the following is the correct value of x? 


a. 20 
b. 30 
c. 36 
d. 45 
e. Impossible to determine without more information. 


Recycle and Review 


44. Hotels (Introduction) A high school lacrosse team is planning to go to Buffalo for a 
three-day tournament. The tournament’s sponsor provides a list of available hotels, along 
with some information about each hotel. The following table displays data about hotel 
options. Identify the individuals and variables in this data set. Classify each variable as 
categorical or quantitative. 


Exercise Internet Distance to Room Roomrate 
Hotel Pool room? (Siday) Restaurants’ site(mi) service? ($/day) 
Comfort Inn Out Y 0.00 1 8.2 Y 149 
Fairfield Inn & In Y 0.00 1 8.3 N 119 
Suites 
Baymont Inn & Out Y 0.00 sl auf Y 60 
Suites 
Chase Suite Out N 15.00 0 15 N 139 
Hotel 
Courtyard In 4 0.00 1 0.2 Dinner 114 


Hilton In Y 10.00 Zz 0.1 Y 156 


Marriott In Y 9.95 2 0.0 Y 145 


SECTION 1.2 Displaying Quantitative Data with Graphs 


LEARNING TARGETS By the end of the section, you should be able to: 
Make and interpret dotplots, stemplots, and histograms of quantitative data. 


Identify the shape of a distribution from a graph. 


Describe the overall pattern (Shape, center, and variability) of a distribution and 
identify any major departures from the pattern (outliers). 


Compare distributions of quantitative data using dotplots, stemplots, and 
histograms. 


To display the distribution of a categorical variable, use a bar graph or a pie chart. How can we 
picture the distribution of a quantitative variable? In this section, we present several types of 
graphs that can be used to display quantitative data. 


Dotplots 


One of the simplest graphs to construct and interpret is a dotplot. 


DEFINITION Dotplot 


A dotplot shows each data value as a dot above its location on a number line. 


Here are data on the number of goals scored in 20 games played by the 2016 U.S. women’s 
soccer team: 


55110 S20 123322142124 53 
Figure 1.5 shows a dotplot of these data. 
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Goals scored 


FIGURE 1.5 Dotplot of goals scored in 20 games by the 2016 U.S. women’s soccer team. 


It is fairly easy to make a dotplot by hand for small sets of quantitative data. 


HOW TO MAKE A DOTPLOT 


e Draw and label the axis. Draw a horizontal axis and put the name of the quantitative 


variable underneath. Be sure to include units of measurement. 


e Scale the axis. Look at the smallest and largest values in the data set. Start the horizontal 
axis at a convenient number equal to or less than the smallest value and place tick marks 
at equal intervals until you equal or exceed the largest value. 


e Plot the values. Mark a dot above the location on the horizontal axis corresponding to 
each data value. Try to make all the dots the same size and space them out equally as you 
stack them. 


Remember what we said in Section 1.1: Making a graph is not an end in itself. When you 
look at a graph, always ask, “What do I see?” From Figure 1.5, we see that the 2016 U.S. 
women’s soccer team scored 4 or more goals in 6/20 = 0.30 or 30% of its games. That’s quite 
an offense! Unfortunately, the team lost to Sweden on penalty kicks in the 2016 Summer 
Olympics. 


EXAMPLE | Give it some gas! (>) 
Making and interpreting dotplots 


Ann Heath 


PROBLEM: The Environmental Protection Agency (EPA) is in charge of determining and 
reporting fuel economy ratings for cars. To estimate fuel economy, the EPA performs tests 
on several vehicles of the same make, model, and year. Here are data on the highway fuel 
economy ratings for a sample of 25 model year 2018 Toyota 4Runners tested by the EPA: 


D2 A ee 2 A 2 ed eno ee a a, ee oe ee Oe OD 2a, 
22,0 22.4 226.9229 6 22,5) 22 A222 2 2 22 6 29 24 


a. Make a dotplot of these data. 
b. Toyota reports the highway gas mileage of its 2018 model year 4Runners as 22 mpg. Do 


these data give the EPA sufficient reason to investigate that claim? 
SOLUTION: 


a. 


a 
as 42.0 22.5 23.0 43.5 
Highway fuel economy rating (mpg) 


To make the dotplot: 


e Draw and label the axis. Note variable name and units in the label. 


e Scale the axis. The smallest value is 21.5 and the largest value is 23.3. So we choose 
a scale from 21.5 to 23.5 with tick marks 0.1 units apart. 


e Plot the values. 


b. No. 23 of the 25 cars tested had an estimated highway fuel economy of 22 mpg OT 


fa) reater. 
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Describing Shape 


When you describe the shape of a dotplot or another graph of quantitative data, focus on the 
main features. Look for major peaks, not for minor ups and downs in the graph. Look for 
clusters of values and obvious gaps. Decide if the distribution is roughly symmetric or clearly 
skewed. 


DEFINITION Symmetric and skewed distributions 


A distribution is roughly symmetric if the right side of the graph (containing the 
half of observations with the largest values) is approximately a mirror image of the 
left side. 


Roughly symmetric 


Skewed to the right 


Skewed to the left 


A distribution is skewed to the right if the right side of the graph is much longer 
than the left side. 


A distribution is skewed to the left if the left side of the graph is much longer than 
the right side. 


We could also describe a distribution with a long tail to the left as “skewed toward negative 
values” or “negatively skewed” and a distribution with a long right tail as “positively skewed.” 
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For ease, we sometimes say “left-skewed” instead of “skewed to the left” and “right- 
skewed” instead of “skewed to the right.” Oo The direction of skewness is toward the long 
tail, not the direction where most observations are clustered. The drawing is a cute but 


corny way to help you keep this straight. To avoid danger, Mr. Starnes skis on the gentler slope 
—in the direction of the skewness. 


EXAMPLE | Quiz scores and die rolls (>) 
Describing shape 


Malerapaso/Getty Images 


PROBLEM: The dotplots display two different sets of quantitative data. Graph (a) shows 
the scores of 21 statistics students on a 20-point quiz. Graph (b) shows the results of 100 
rolls of a 6-sided die. Describe the shape of each distribution. 
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SOLUTION: 


a. The distribution of statistics quiz scores 1s skewed to the left, with a single peak at 20 (a 
perfect score). There are two small gaps at 12 and 16. 


b. The distribution of die rolls is roughly symmetric. It has no clear peak. 


We can describe the shape of the distribution in part (b) as “approximately uniform” 


because the frequencies are about the same for all possible rolls. 
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Some people refer to graphs with a single peak as unimodal, to graphs with two peaks as 
bimodal, and to graphs with more than two clear peaks as multimodal. 


Some quantitative variables have distributions with easily described shapes. But many 
distributions have irregular shapes that are neither symmetric nor skewed. Some distributions 
show other patterns, like the dotplot in Figure 1.6. This graph shows the durations (in minutes) 
of 220 eruptions of the Old Faithful geyser. The dotplot has two distinct clusters and two peaks: 
one at about 2 minutes and one at about 4.5 minutes. When you examine a graph of quantitative 
data, describe any pattern you see as clearly as you can. 


15 2.0 Mea 3.0 3.5 4.0 45 5.0 
Duration (min) 


FIGURE 1.6 Dotplot displaying duration (in minutes) of 220 Old Faithful eruptions. This 
graph has two distinct clusters and two clear peaks. 


Some quantitative variables have distributions with predictable shapes. Many biological 
measurements on individuals from the same species and gender—lengths of bird bills, heights 
of young women—have roughly symmetric distributions. Salaries and home prices, on the 
other hand, usually have right-skewed distributions. There are many moderately priced houses, 
for example, but the few very expensive mansions give the distribution of house prices a strong 
right skew. 


CHECK YOUR UNDERSTANDING 


i. 


y Knoebels Amusement Park in Elysburg, Pennsylvania, has earned acclaim for being an 
affordable, family-friendly entertainment venue. Knoebels does not charge for general 
admission or parking, but it does charge customers for each ride they take. How much do the 
rides cost at Knoebels? The table shows the cost for each ride in a sample of 22 rides ina 
recent year. 


Name Cost 


Merry Mixer $1.50 


Italian Trapeze $1.50 
Satellite $1.50 
Galleon $1.50 
Whipper $1225 
Skooters $175 
Ribbit $1.25 
Roundup $1.50 
Paradrop $1225 
The Phoenix $2.50 
Gasoline Alley $1.75 
Looper $1.75 
Flying Turns $3.00 
Flyer $1.50 
The Haunted Mansion $1.75 
StratosFear $2.00 
Twister $2.50 
Cosmotron $1.75 
Paratrooper $1.50 
Downdraft $1.50 
Rockin’ Tug $1725 
Sklooosh! $175 


1. Make a dotplot of the data. 
2. Describe the shape of the distribution. 


Describing Distributions 


Here is a general strategy for describing a distribution of quantitative data. 


HOW TO DESCRIBE THE DISTRIBUTION OF A QUANTITATIVE 
VARIABLE 


In any graph, look for the overall pattern and for clear departures from that pattern. 


e You can describe the overall pattern of a distribution by its shape, center, and 
variability. 


e An important kind of departure is an outlier, an observation that falls outside the overall 
pattern. 


Variability is sometimes referred to as spread. We prefer variability because students sometimes 


think that spread refers only to the distance between the maximum and minimum value of a 
quantitative data set (the range). There are several ways to measure the variability (spread) of a 
distribution, including the range. 


AP® EXAM TIP 


Always be sure to include context when you are asked to describe a distribution. 
This means using the variable name, not just the units the variable is measured in. 


We will discuss more formal ways to measure center and variability and to identify outliers 
in Section 1.3. For now, just use the median (middle value in the ordered data set) when 
describing center and the minimum and maximum when describing variability. 


Let’s practice with the dotplot of goals scored in 20 games played by the 2016 U.S. 
women’s soccer team. 


° ° 
e _ 
° ° ° ° ° 
eS a | Ce | 
0 1 2 3 4 5 6 7 8 9 10 


Goals scored 


When describing a distribution of quantitative data, don’t forget: Statistical Opinions Can Vary 


(Shape, Outliers, Center, Variability). 


Shape: The distribution of goals scored is skewed to the right, with a single peak at 1 
goal. There is a gap between 5 and 9 goals. 


Outliers: | The games when the team scored 9 and 10 goals appear to be outliers. 
Center: The median is 2 goals scored. 


Variability: The data vary from 1 to 10 goals scored. 


EXAMPLE | Give it some gas! (>) 
Describing a distribution 


Daren Starnes 


PROBLEM: Here is a dotplot of the highway fuel economy ratings for a sample of 25 
model year 2018 Toyota 4Runners tested by the EPA. Describe the distribution. 


.. a ee ° 
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Highway fuel economy rating (mpg) 


SOLUTION: 


S hape: The distribution of highway fuel economy ratings Is roughly symmetric, 
with a single peak at 22.4 mpg. There are two clear gaps: between 21.5 and 
21.9 mpg and between 22.9 and 23.3 mpg. 


Outliers: The cars with 21.5 mpg and 23.3 mpg ratings are possible outliers. 
Center: The median rating {S22 2p mpg. 


Variability: The ratings vary from 21.5 to 23.3 mpg. 
4) oe es) UH8) 


Be sure to include context by discussing the variable of interest, highway fuel 


economy ratings. And give the units of measurement: miles per gallon (mpg). 
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Comparing Distributions 


Some of the most interesting statistics questions involve comparing two or more groups. Which 
of two popular diets leads to greater long-term weight loss? Who texts more—males or 
females? As the following example suggests, you should always discuss shape, outliers, center, 
and variability whenever you compare distributions of a quantitative variable. 


EXAMPLE | Household size: U.K. versus South Africa (>) 
Comparing distributions 


FrankvandenBergh/Getty Images 


PROBLEM: How do the numbers of people living in households in the United Kingdom 
(U.K.) and South Africa compare? To help answer this question, we used Census At 
School’s “Random Data Selector” to choose 50 students from each country. Here are 
dotplots of the household sizes reported by the survey respondents. Compare the 
distributions of household size for these two countries. 


South Africa 


Household size 


Country 


UK 
eecece 
Seecceccesceeeceececs 


0 5 10 15 20 25 30 
Household size 


AP® EXAM TIP 


When comparing distributions of quantitative data, it’s not enough just to list 


values for the center and variability of each distribution. You have to explicitly 
compare these values, using words like “greater than,” “less than,” or “about the 
same as.” 


SOLUTION: 


S hape: The distribution of household size for the UK. sample is roughly symmetric, 
with a single peak at 4 people. The distribution of household size for the 
South Africa sample is skewed to the right, with a single peak at 4 people and 
a clear gap between 15 and 26. 


Outliers: There don’t appear to be any outliers in the U.K. distribution. The South 
African distribution seems to have two outliers: the households with 15 and 


26 people. 
Center: Household sizes for the South African students tend to be larger (median = 6 
people) than for the U.K. students (median = 4 people). 


Variability: The household sizes for the South African students vary more (from 3 to 26 
people) than for the U.K. students (from 2to6 people). 


Don't forget to include context! It isn’t enough to refer to the U.K. distribution or the 


South Africa distribution. You need to mention the variable of interest, household size. 
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Notice that in the preceding example, we discussed the distributions of household size only 
for the two samples of 50 students. We might be interested in whether the sample data give us 
convincing evidence of a difference in the population distributions of household size for South 
Africa and the United Kingdom. We’Il have to wait a few chapters to decide whether we can 
reach such a conclusion, but our ability to make such an inference later will be helped by the 
fact that the students in our samples were chosen at random. 


CHECK YOUR UNDERSTANDING 


6 


y For a Statistics class project, Jonathan and Crystal hosted an ice-cream-eating contest. 
Each student in the contest was given a small cup of ice cream and instructed to eat it as fast 
as possible. Jonathan and Crystal then recorded each contestant’s gender and time (in 
seconds), as shown in the dotplots. Compare the distributions of eating times for males and 


females. 


Female 


Gender 


Time to eat ice cream (sec) 


Stemplots 


Another simple type of graph for displaying quantitative data is a stemplot. 


DEFINITION Stemplot 


A stemplot shows each data value separated into two parts: a stem, which 
consists of all but the final digit, and a /eaf, the final digit. The stems are ordered 
from lowest to highest and arranged in a vertical column. The leaves are arranged 
in increasing order out from the appropriate stems. 


A stemplot is also known as a stem-and-leaf plot. 


Here are data on the resting pulse rates (beats per minute) of 19 middle school students: 


71 104 76 88 78 71 68 86 70 90 74 76 69 68 88 96 68 82 120 


Figure 1.7 shows a stemplot of these data. 


6| 8889 Kev: 812i 

710116668 | sient whose 

9106 resting pulse 
10/4 rate is 82 beats 
11 per minute, 
12/0 


FIGURE 1.7 Stemplot of the resting pulse rates of 19 middle school students. 

According to the American Heart Association, a resting pulse rate above 100 beats per 
minute is considered high for this age group. We can see that 2/19 = 0.105 = 10.5% of these 
students have high resting pulse rates by this standard. Also, the distribution of pulse rates for 
these 19 students is skewed to the right (toward the larger values). 

Stemplots give us a quick picture of a distribution that includes the individual observations 
in the graph. It is fairly easy to make a stemplot by hand for small sets of quantitative data. 


HOW TO MAKE A STEMPLOT 


Make stems. Separate each observation into a stem, consisting of all but the final digit, 
and a leaf, the final digit. Write the stems in a vertical column with the smallest at the top. 
Draw a vertical line at the right of this column. Do not skip any stems, even if there is no 


data value for a particular stem. 
Add leaves. Write each leaf in the row to the right of its stem. 
Order leaves. Arrange the leaves in increasing order out from the stem. 


Add a key. Provide a key that identifies the variable and explains what the stems and 
leaves represent. 


EXAMPLE | Wear your helmets! (>) 
Making and interpreting stemplots 


Pete Saloutos/AGE Fotostock 


PROBLEM: Many athletes (and their parents) worry about the risk of concussions when 
playing sports. A football coach plans to obtain specially made helmets for his players that 
are designed to reduce the chance of getting a concussion. Here are the measurements of 
head circumference (in inches) for the 30 players on the team: 


230° 22,2 21-7 2240) 22.3 22.6 22.7 2lo 22.7 25:6 20:8 230) 242° 23.5 20:8 
24.00 22.79 22-6) 25.018 22.0 8231) 21.9 2210) 20 428 5 220 23.9 23 4216 23.3 


a. Make a stemplot of these data. 


b. Describe the shape of the distribution. Are there any obvious outliers? 


SOLUTION: 
a. 20 | 88 
41 | 05679 


22 | OA34SS566F FF 
43 | 001345599 
24 | 02 

a5| 6 


| of 23.5 inches. 


| Key: 2315 is a 


player with 
ahead 
circumference 


To make the stemplot: 


e Make stems. The smallest head circumference is 20.8 inches and the largest is 
25.6 inches. We use the first two digits as the stem and the final digit as the leaf. 
So we need stems from 20 to 25. 


e Add leaves. 
e Order leaves. 


e Add a key. 


b. The distribution of head circumferences for the 30 players on the team is roughly 


symmetric, with a single peak on the 22-inch stem. There are no obvious outliers. 
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We can get a better picture of the head circumference data by splitting stems. In Figure 
1.8(a), leaf values from 0 to 9 are placed on the same stem. Figure 1.8(b) shows another 
stemplot of the same data. This time, values with leaves from 0 to 4 are placed on one stem, 
while those with leaves from 5 to 9 are placed on another stem. Now we can see the shape of 
the distribution even more clearly—including the possible outlier at 25.6 inches. 


20|88 20/88 
21|05679 21/0 
22102345566777 Key: 23|5 is a player with a head 2115679 
23)}001345599 circumference of 23.5 inches. 2210234 
24/02 , 22|5566777 
2516 23)}00134 

23)5599 

24)02 

24 

25 

25|6 


(a) 


(b) 


FIGURE 1.8 Two stemplots showing the head circumference data. The graph in (b) 
improves on the graph in (a) by splitting stems. 


Here are a few tips to consider before making a stemplot: 


e There is no magic number of stems to use. Too few or too many stems will make it difficult 
to see the distribution’s shape. Five stems is a good minimum. 


e If you split stems, be sure that each stem is assigned an equal number of possible leaf digits. 


e When the data have too many digits, you can get more flexibility by rounding or truncating 
the data. See Exercises 61 and 62 for an illustration of rounding data before making a 
stemplot. 


You can use a back-to-back stemplot with common stems to compare the distribution of a 
quantitative variable in two groups. The leaves are placed in order on each side of the common 
stem. For example, Figure 1.9 shows a back-to-back stemplot of the 19 middle school students’ 
resting pulse rates and their pulse rates after 5 minutes of running. 

Resting After exercise 


9888 6 
8664110 7 
8862 8| 6788 
60 9} 02245899 
4110) 044 Key: 8|2 is a student 
11} 8 whose pulse rate is 
0112] 44 82 beats per minute. 


141 6 


FIGURE 1.9 Back-to-back stemplot of 19 middle school students’ resting pulse rates and 
their pulse rates after 5 minutes of running. 


CHECK YOUR UNDERSTANDING 


i. 
v 


1. Write a few sentences comparing the distributions of resting and after-exercise pulse rates 
in Figure 1.9. 


Multiple Choice: Select the best answer for Questions 2-4. 


Here is a stemplot of the percent of residents aged 65 and older in the 50 states and the 
District of Columbia: 


6 8 Key: 8/8 represents a state 
if in which 8.8% of residents 
é 8 are 65 and older. 
9179 

10 | 08 

11 15566 


12 | 012223444457888999 
13 | 01233333444899 


14 | 02666 
15 | 23 
1618 


2. The low outlier is Alaska. What percent of Alaska residents are 65 or older? 


a. 
b. 
C. 
d. 


e. 


0.68 
6.8 
8.8 
16.8 
68 


3. Ignoring the outlier, the shape of the distribution is 


a. 
b. 


skewed to the right. 
skewed to the left. 


c. skewed to the middle. 


d. 


e. 


double-peaked. 


roughly symmetric. 


4. The center of the distribution is close to 


a. 
b. 
C, 
d. 


e. 


11.6%. 
12.0%. 
12.8%. 
13.3%. 
6.8% to 16.8%. 


Histograms 


You can use a dotplot or stemplot to display quantitative data. Both graphs show every 
individual data value. For large data sets, this can make it difficult to see the overall pattern in 
the graph. We often get a clearer picture of the distribution by grouping together nearby values. 
Doing so allows us to make a new type of graph: a histogram. 


DEFINITION Histogram 


A histogram shows each interval of values as a bar. The heights of the bars 
show the frequencies or relative frequencies of values in each interval. 


Figure 1.10 shows a dotplot and a histogram of the durations (in minutes) of 220 eruptions 
of the Old Faithful geyser. Notice how the histogram groups together nearby values. 
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FIGURE 1.10 (a) Dotplot and (b) histogram of the duration (in minutes) of 220 eruptions of 
the Old Faithful geyser. 


It is fairly easy to make a histogram by hand. Here’s how you do it. 


HOW TO MAKE A HISTOGRAM 


Choose equal-width intervals that span the data. Five intervals is a good minimum. 


Make a table that shows the frequency (count) or relative frequency (percent or 
proportion) of individuals in each interval. Put values that fall on an interval boundary in 
the interval containing larger values. 


Draw and label the axes. Draw horizontal and vertical axes. Put the name of the 
quantitative variable under the horizontal axis. To the left of the vertical axis, indicate 
whether the graph shows the frequency (count) or relative frequency (percent or 
proportion) of individuals in each interval. 


Scale the axes. Place equally spaced tick marks at the smallest value in each interval 
along the horizontal axis. On the vertical axis, start at 0 and place equally spaced tick 
marks until you exceed the largest frequency or relative frequency in any interval. 


Draw bars above the intervals. Make the bars equal in width and leave no gaps between 
them. Be sure that the height of each bar corresponds to the frequency or relative 
frequency of individuals in that interval. An interval with no data values will appear as a 
bar of height 0 on the graph. 


It is possible to choose intervals of unequal widths when making a histogram. Such graphs 
are beyond the scope of this book. 


EXAMPLE | How much tax? (>) 
Making and interpreting histograms 


PROBLEM: Sales tax rates vary widely across the United States. Four states charge no state 
or local sales tax: Delaware, Montana, New Hampshire, and Oregon. The table shows data 
on the average total tax rate for each of the remaining 46 states and the District of 
Columbia.22 


State Tax rate (%) 
Alabama 9.0 
Alaska 1.8 
Arizona 8.3 
Arkansas See 
California 8.5 
Colorado 7.5 
Connecticut 6.4 
Florida 6.7 
Georgia 7.0 
Hawaii 4.4 
Idaho 6.0 
Illinois 8.6 
Indiana 7.0 
lowa 6.8 
Kansas 8.6 
Kentucky 6.0 
Louisiana 9.0 
Maine 5:5 
Maryland 6.0 
Massachusetts 6.3 
Michigan 6.0 
Minnesota eo 
Mississippi feu' 
Missouri 7.9 
Nebraska 6.9 
Nevada 8.0 
New Jersey 7.0 
New Mexico Tins 


New York 8.5 


North Carolina 6.9 


North Dakota 6.8 
Ohio Taal 
Oklahoma 8.8 
Pennsylvania 6.3 
Rhode Island 7.0 
South Carolina We 
South Dakota 5.8 
Tennessee 9.5 
Texas 8.2 
Utah 6.7 
Vermont 6.2 
Virginia 5.6 
Washington 8.9 
West Virginia 6.2 
Wisconsin 5.4 
Wyoming 5.4 
District of Columbia 5.8 


a. Make a frequency histogram to display the data. 


b. What percent of values in the distribution are less than 6.0? Interpret this result in context. 


SOLUTION: 


Number of states 


Tax rate (%) 
Interval Frequency 
1060 2.0 1 
2.0 to < 3.0 Oo 
3.0 to < 4.0 Oo 
4.0 to < 5.0 fl 
5.0 to < 6.0 6 


6.0 to < 7.0 15 


7.0 to < 8.0 11 
8.0 to < 9.0 9 


9.0 to < 10.0 4 


To make the histogram: 


e Choose equal-width intervals that span the data. The data vary from 1.8 
percent to 9.5 percent. So we choose intervals of width 1.0, starting at 1.0%. 


Make a table. Record the number of states in each interval to make a frequency 
histogram. 


Draw and label the axes. Don’t forget units (percent) for the variable (tax rate). 
Scale the axes. 


Draw bars. 


b. 8/47 = (0170 = 17.0%, 17% of the states (including the District of Columbia) have tax 
rates less than 6%. 
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Figure 1.11 shows two different histograms of the state sales tax data. Graph (a) uses the 
intervals of width 1% from the preceding example. The distribution has a single peak in the 6.0 
to <7.0 interval. Graph (b) uses intervals half as wide: 1.0 to <1.5, 1.5 to <2.0, and so on. Now 
we see a distribution with more than one distinct peak. ww The choice of intervals in a 


histogram can affect the appearance of a distribution. Histograms with more intervals show 
more detail but may have a less clear overall pattern. 
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FIGURE 1.11 (a) Frequency histogram of the sales tax rate in the states that have local or 
state sales taxes and the District of Columbia with intervals of width 1.0%, from the 
preceding example. (b) Frequency histogram of the data with intervals of width 0.5%. 


You can use a graphing calculator, statistical software, or an applet to make a histogram. 
The technology’s default choice of intervals is a good starting point, but you should adjust the 
intervals to fit with common sense. 


2. Technology Corner | MAKING HISTOGRAMS (> 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 


1. Enter the data from the sales tax example in your Statistics/List Editor. 


e Press|STAT| and choose Edit... 


e Type the values into list L1. 


NORMAL FLOAT AUTO REAL RADIAN MP 


<= NOCVIVIQO@G eo 


9 
1 
8 
9 
8 
? 
6 
6 
? 
4 
6 


Lac(wy=9 


2. Set up a histogram in the Statistics Plots menu. 


° Press (STAT PLOT). 
e Press or |1| to go into Plot1. 


e Adjust the settings as shown. 


NORMAL FLOAT AUTO REAL RADIAN MP 1 


iacaes Plot2 Plot3 


3. Use ZoomStat to let the calculator choose intervals and make a histogram. 


e Press|ZOOM| and choose ZoomStat. 
e Press| TRACE| to examine the intervals. 


NORMAL FLOAT AUTO REAL RADIAN MP fy 


Ploti:La 


min=1.8 
mMax<2.9 


4. Adjust the intervals to match those in Figure 1.11(a), and then graph the histogram. 


e Press and enter the values shown for Xmin, Xmax, Xscl, Ymin, Ymax, and 
Yscl. 


¢ Press |GRAPH|. 
e Press| TRACE| to examine the intervals. 


NORMAL FLOAT AUTO REAL RADIAN MP 


ox=0.04166666666666 
TraceStep=@0. 083333333333... 


5. See if you can match the histogram in Figure 1.11(b). 


NORMAL FLOAT AUTO REAL RADIAN MP 1 
Ploti:La 


AP® EXAM TIP 


If you’re asked to make a graph on a free-response question, be sure to label 
and scale your axes. Unless your calculator shows labels and scaling, don’t just 
transfer a calculator screen shot to your paper. 


CHECK YOUR UNDERSTANDING 


q 


y Many people believe that the distribution of IQ scores follows a “bell curve,” like the 


one shown. But is this really how such scores are distributed? The IQ scores of 60 fifth-grade 


students chosen at random from one school are shown here.24 


145 139 126 22 125 130 96 110 118 118 
101 142 134 124 112 109 134 113 81 113 
123 94 100 136 109 131 117 110 127 124 
106 124 115 133 116 102 127 117 109 3/ 
117 90 103 114 139 101 122 105 97 89 
102 108 110 128 114 1 114 102 82 101 


1. Construct a histogram that displays the distribution of IQ scores effectively. 


2. Describe what you see. Is the distribution bell-shaped? 


Using Histograms Wisely 


We offer several cautions based on common mistakes students make when using histograms. 


, og Don’t confuse histograms and bar graphs. Although histograms resemble bar graphs, 


their details and uses are different. A histogram displays the distribution of a quantitative 
variable. Its horizontal axis identifies intervals of values that the variable takes. A bar graph 
displays the distribution of a categorical variable. Its horizontal axis identifies the categories. 
Be sure to draw bar graphs with blank space between the bars to separate the categories. 
Draw histograms with no space between bars for adjacent intervals. For comparison, here is 


one of each type of graph from earlier examples: 


Histogram Bar graph 
16 40 
14 
12 30 
> 
5 10 5 
z 8 = 20 
a 
i 6 
4 10 
2 
0 
1 2 3 Cell phone Facebook — In person Text 
= Messaging 
Tax rate (%) Preferred communication method 


s @ Use percents or proportions instead of counts on the vertical axis when comparing 


distributions with different numbers of observations. Mary was interested in comparing 
the reading levels of a biology journal and an airline magazine. She counted the number of 
letters in the first 400 words of an article in the journal and of the first 100 words of an 
article in the airline magazine. Mary then used statistical software to produce the histograms 
shown in Figure 1.12(a). This figure is misleading—it compares frequencies, but the two 
samples were of very different sizes (400 and 100). Using the same data, Mary’s teacher 
produced the histograms in Figure 1.12(b). By using relative frequencies, this figure makes 
the comparison of word lengths in the two samples much easier. 

Journal Magazine Journal Magazine 


90 30 


Frequency 
Percent 
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(a) Word length Word length (b) Word length Word length 


FIGURE 1.12 Two sets of histograms comparing word lengths in articles from a biology 
journal and from an airline magazine. In graph (a), the vertical scale uses frequencies. 


Graph (b) fixes the problem of different sample sizes by using percents (relative 
frequencies) on the vertical scale. 


3, o@ Just because a graph looks nice doesn’t make it a meaningful display of data. The 


15 students in a small statistics class recorded the number of letters in their first names. One 
student entered the data into an Excel spreadsheet and then used Excel’s “chart maker” to 
produce the graph shown on the left. What kind of graph is this? It’s a bar graph that 


compares the raw data values. But first-name length is a quantitative variable, so a bar graph 
is not an appropriate way to display its distribution. The histogram on the right is a much 
better choice because the graph makes it easier to identify the shape, center, and variability 
of the distribution of name length. 


First name length 
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CHECK YOUR UNDERSTANDING 


A 
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1. Write a few sentences comparing the distributions of word length shown in Figure 1.12(b). 


Questions 2 and 3 refer to the following setting. About 3 million first-year students enroll in 


colleges and universities each year. What do they plan to study? The graph displays data on 


the percent of first-year students who plan to major in several disciplines.22 


20 


Percent of students who plan to major 


Field of study 


2. Is this a bar graph or a histogram? Explain. 


3. Would it be correct to describe this distribution as right-skewed? Why or why not? 


Section 1.2 Summary 


e You can use a dotplot, stemplot, or histogram to show the distribution of a quantitative 
variable. A dotplot displays individual values on a number line. Stemplots separate each 
observation into a stem and a one-digit leaf. Histograms plot the frequencies (counts) or 
relative frequencies (proportions or percents) of values in equal-width intervals. 


e Some distributions have simple shapes, such as symmetric, skewed to the left, or skewed to 
the right. The number of peaks is another aspect of overall shape. So are distinct clusters 
and gaps. 


e When examining any graph of quantitative data, look for an overall pattern and for clear 
departures from that pattern. Shape, center, and variability describe the overall pattern of 
the distribution of a quantitative variable. Outliers are observations that lie outside the 
overall pattern of a distribution. 


e When comparing distributions of quantitative data, be sure to compare shape, center, 
variability, and possible outliers. 


e Remember: histograms are for quantitative data; bar graphs are for categorical data. Be sure 
to use relative frequencies when comparing data sets of different sizes. 


1.2 Technology Corner | > 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


2. Making histograms 


Section 1.2 Exercises 


45. pg 31 |») Feeling sleepy? Students in a high school statistics class responded to a 
survey designed by their teacher. One of the survey questions was “How much sleep did 
you get last night?” Here are the data (in hours): 


S) 6 8 7 8 8 6 6.5 7 7 9.0 4 3 4 
5 6 a1. 6 3 7 6 10.0 7 8 4.5 9 7 7 


a. Make a dotplot to display the data. 


b. Experts recommend that high school students sleep at least 9 hours per night. What 
proportion of students in this class got the recommended amount of sleep? 


46. Easy reading? Here are data on the lengths of the first 25 words on a randomly selected 
page from Toni Morrison’s Song of Solomon: 


a. Make a dotplot of these data. 


b. Long words can make a book hard to read. What percentage of words in the sample 
have 8 or more letters? 


47. U.S. women’s soccer—2016 Earlier, we examined data on the number of goals scored by 
the 2016 U.S. women’s soccer team in 20 games played. The following dotplot displays 
the goal differential for those same games, computed as U.S. goals scored minus opponent 
goals scored. 


Difference (U.S. — Opponent) in goals scored 
a. Explain what the dot above 3 represents. 


b. What does the graph tell us about how well the team did in 2016? Be specific. 


48. Fuel efficiency The dotplot shows the difference (Highway — City) in EPA mileage 
ratings, in miles per gallon (mpg) for each of 24 model year 2018 cars. 
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Difference (Highway — City) in EPA mileage rating (mpg) 
a. Explain what the dot above —3 represents. 


b. What does the graph tell us about fuel economy in the city versus on the highway for 
these car models? Be specific. 


49. pg 33 4) Getting older How old is the oldest person you know? Prudential Insurance 


Company asked 400 people to place a blue sticker on a huge wall next to the age of the 
oldest person they have ever known. An image of the graph is shown here. Describe the 
shape of the distribution. 


50. Pair-a-dice The dotplot shows the results of rolling a pair of fair, six-sided dice and 
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59. 


56. 


57. 


finding the sum of the up-faces 100 times. Describe the shape of the distribution. 
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Sum of the up-faces on a pair of dice 
Feeling sleepy? Refer to Exercise 45. Describe the shape of the distribution. 
Easy reading? Refer to Exercise 46. Describe the shape of the distribution. 


pg 35 |») U.S. women’s soccer—2016 Refer to Exercise 47. Describe the distribution. 


Fuel efficiency Refer to Exercise 48. Describe the distribution. 


pg 36 Making money The parallel dotplots show the total family income of 
Nie] 


randomly chosen individuals from Indiana (38 individuals) and New Jersey (44 
individuals). Compare the distributions of total family incomes in these two samples. 


seen 


Indiana 


New Jersey - 7 , 
0 25 50 75 100 125 150 175 
Total family income ($1000s) 


Healthy streams Nitrates are organic compounds that are a main ingredient in fertilizers. 
When those fertilizers run off into streams, the nitrates can have a toxic effect on fish. An 
ecologist studying nitrate pollution in two streams measures nitrate concentrations at 42 

places on Stony Brook and 42 places on Mill Brook. The parallel dotplots display the data. 
Compare the distributions of nitrate concentration in these two streams. 


Stony Brook : 
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Nitrate concentration (mg/L) 


Enhancing creativity Do external rewards—things like money, praise, fame, and grades 
—promote creativity? Researcher Teresa Amabile recruited 47 experienced creative 
writers who were college students and divided them at random into two groups. The 
students in one group were given a list of statements about external reasons (E) for 
writing, such as public recognition, making money, or pleasing their parents. Students in 


the other group were given a list of statements about internal reasons (I) for writing, such 
as expressing yourself and enjoying wordplay. Both groups were then instructed to write a 
poem about laughter. Each student’s poem was rated separately by 12 different poets 


using a creativity scale.2® These ratings were averaged to obtain an overall creativity score 
for each poem. Parallel dotplots of the two groups’ creativity scores are shown here. 
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a. Is the variability in creativity scores similar or different for the two groups? Justify your 


answer. 


b. Do the data suggest that external rewards promote creativity? Justify your answer. 


58. Healthy cereal? Researchers collected data on 76 brands of cereal at a local 


supermarket.22 For each brand, the sugar content (grams per serving) and the shelf in the 
store on which the cereal was located (1 = bottom, 2 = middle, 3 = top) were recorded. A 


dotplot of the data is shown here. 


Shelf 
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Sugar content (grams per serving) 


a. Is the variability in sugar content of the cereals on the three shelves similar or different? 


Justify your answer. 


b. Critics claim that supermarkets tend to put sugary cereals where kids can see them. Do 
the data from this study support this claim? Justify your answer. (Note that Shelf 2 is at 
about eye level for kids in most supermarkets.) 


59. pg 38 |p) Snickers® are fun! Here are the weights (in grams) of 17 Snickers Fun Size 


bars from a single bag: 


17.1 17.4 16.6 


17.4 


17.7 17.1 17.3 17.7 17.8 


19:2 16.0 15.9 16.5 16.8 16.5 17.1 16.7 


a. Make a stemplot of these data. 
b. What interesting feature does the graph reveal? 


c. The advertised weight of a Snickers Fun Size bar is 17 grams. What proportion of 
candy bars in this sample weigh less than advertised? 


60. Eat your beans! Beans and other legumes are a great source of protein. The following 


data give the protein content of 30 different varieties of beans, in grams per 100 grams of 


cooked beans.22 


7.5 8.2 8.9 9.3 7.1 8.3 8.7 9.5 8.2 9.1 
9.0 9.0 O.7 g2 8.9 8.1 9.0 7.8 8.0 7.8 
7.0 15 13.5 8.3 6.8 10.6 8.3 7.6 7.7 S.A 


a. Make a stemplot of these data. 
b. What interesting feature does the graph reveal? 


c. What proportion of these bean varieties contain more than 9 grams of protein per 100 
grams of cooked beans? 


61. South Carolina counties Here is a stemplot of the areas of the 46 counties in South 
Carolina. Note that the data have been rounded to the nearest 10 square miles (mi?). 


3} 9999 

4} 0116689 

5 | 01115566778 

6| 47899 

7|01245579 | 

8} 0011 | Key: 6|4 represents 
9/13 a county with an area 
10/8 of 635 to 644.99 
11 | 233 | square miles, 
12) 2 


a. What is the area of the largest South Carolina county? 


b. Describe the distribution of area for the 46 South Carolina counties. 


62. Shopping spree The stemplot displays data on the amount spent by 50 shoppers at a 
grocery store. Note that the values have been rounded to the nearest dollar. 
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a. What was the smallest amount spent by any of the shoppers? 


b. Describe the distribution of amount spent by these 50 shoppers. 


63. Where do the young live? Here is a stemplot of the percent of residents aged 25 to 34 in 


each of the 50 states: 


11 | 44 
11 | 66778 
12 | 0134 
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13 | 0000001111444 


13 | 7788999 
14 | 0044 

14 | 567 

AD! | TA 

15 

16] 0 


a. Why did we split stems? 
b. Give an appropriate key for this stemplot. 


c. Describe the shape of the distribution. Are there any outliers? 


64. Watch that caffeine! The U.S. Food and Drug Adminis tration (USFDA) limits the 


65. 


amount of caffeine in a 12-ounce can of carbonated beverage to 72 milligrams. That 
translates to a maximum of 48 milligrams of caffeine per 8-ounce serving. Data on the 
caffeine content of popular soft drinks (in milligrams per 8-ounce serving) are displayed in 
the stemplot. 
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a. Why did we split stems? 
b. Give an appropriate key for this graph. 


c. Describe the shape of the distribution. Are there any outliers? 


Acorns and oak trees Of the many species of oak trees in the United States, 28 grow on 
the Atlantic Coast and 11 grow in California. The back-to-back stemplot displays data on 
the average volume of acorns (in cubic centimeters) for these 39 oak species.22 Write a 
few sentences comparing the distributions of acorn size for the oak trees in these two 
regions. 
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Key: 2/6 = An oak species 


whose acorn volume 


66. Who studies more? Researchers asked the students in a large first-year college class how 
many minutes they studied on a typical weeknight. The back-to-back stemplot displays the 
responses from random samples of 30 women and 30 men from the class, rounded to the 
nearest 10 minutes. Write a few sentences comparing the male and female distributions of 


study time. 
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Key: 2|3 = 230 minutes 


67. pg 42 |») Carbon dioxide emissions Burning fuels in power plants and motor vehicles 


emits carbon dioxide (CO>), which contributes to global warming. The table displays CO» 


emissions per person from countries with populations of at least 20 million.=~ 


30 


a. Make a histogram of the data using intervals of width 2, starting at 0. 


b. Describe the shape of the distribution. Which countries appear to be outliers? 


Country 
Algeria 
Argentina 
Australia 
Bangladesh 
Brazil 
Canada 
China 
Colombia 


CO2 
33 
4.5 

16.9 
0.4 
2.2 

14.7 
6.2 
1.6 


Congo 0.5 


Egypt 2.6 
Ethiopia 0.1 
France 5.6 
Germany 9.1 
Ghana 0.4 
India dg 
Indonesia 1.8 
lran Tek 
Iraq 3.7 
Italy 6.7 
Japan 9.2 
Kenya 0.3 
Korea, North 11.5 
Korea, South 2.9 
Malaysia ta 
Mexico 3.8 
Morocco 1.6 
Myanmar 0.2 
Nepal O.T 
Nigeria 0.5 
Pakistan 0.9 
Peru 2.0 
Philippines 0.9 
Poland 8.3 
Romania 3.9 
Russia 12.2 
Saudi Arabia 17.0 
South Africa 9.0 
Spain 5.8 
Sudan 0:3 
Tanzania 0.2 
Thailand 4.4 
Turkey 4.1 
Ukraine 6.6 
United Kingdom 7.9 
United States 17.6 
Uzbekistan 3.7 
Venezuela 6.9 
Vietnam 1.7 


68. Traveling to work How long do people travel each day to get to work? The following 
table gives the average travel times to work (in minutes) for workers in each state and the 


District of Columbia who are at least 16 years old and don’t work at home. 


AL 

AK 
AZ 

AR 
CA 
CO 
CT 
DE 


23.6 
i ey, 
25.0 
207 
26.8 
23.9 
24.1 
23.6 
25:4 
Zio 
25:5 
20.1 
20:9 
22.3 
18.2 
18.5 
22.4 
25.1 
22.3 
30.6 
26.6 
23.4 
22.0 
24.0 
22.9 
17.6 
17.7 
24.2 
24.6 
29.1 
20.9 
30.9 
23.4 
15.0 
22.1 
20.0 
21.8 
25.0 
22.3 
229 
15:9 


ral 


TN 23.5 


TX 24.6 
UT 20.8 
VT 21.2 
VA 26.9 
WA 25.2 
WV 25.6 
WI 20.8 
WY (ae 
DC 29.2 


a. Make a histogram to display the travel time data using intervals of width 2 minutes, 
starting at 14 minutes. 


b. Describe the shape of the distribution. What is the most common interval of travel 
times? 


69. DRP test scores There are many ways to measure the reading ability of children. One 
frequently used test is the Degree of Reading Power (DRP). In a research study on third- 
grade students, the DRP was administered to 44 students.22 Their scores were as follows. 


40 26 39 14 42 18 25 43 46 27 19 
47 19 26 30 34 15 44 40 38 aL 46 
52 25 35 35 33 29 34 41 49 28 52 
47 35 48 Ze 33 41 DL 27 14 34 45 


Make a histogram to display the data. Write a few sentences describing the distribution of 
DRP scores. 


70. Country music The lengths, in minutes, of the 50 most popular mp3 downloads of songs 
by country artist Dierks Bentley are given here. 


4.2 4.0 3.9 3.8 3.7 4.7 
3.4 4.0 4.4 50 4.6 3.7 
4.6 4.4 4.1 3.0 3.2 4.7 
3.5 aa | 4.3 3./ 4.8 4.4 
4.2 4.7 6.2 4.0 7.0 3.9 
3.4 3.4 2.9 3.3 4.0 4.2 
a2 3.4 af 3.5 3.4 uf 
3.9 oa 3.8 3.4 auf 3.6 


4.5 3 


Make a histogram to display the data. Write a few sentences describing the distribution of 
song lengths. 


71. Returns on common stocks The return on a stock is the change in its market price plus 


any dividend payments made. Return is usually expressed as a percent of the beginning 


price. The figure shows a histogram of the distribution of monthly returns for the U.S. 


stock market over a 273-month period.22 
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a. Describe the overall shape of the distribution of monthly returns. 
b. What is the approximate center of this distribution? 


c. Explain why you cannot find the exact value for the minimum return. Between what 
two values does it lie? 


d. A return less than 0 means that stocks lost value in that month. About what percent of 
all months had returns less than 0? 


72. Healthy cereal? Researchers collected data on calories per serving for 77 brands of 
breakfast cereal. The histogram displays the data.24 
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a. Describe the overall shape of the distribution of calories. 
b. What is the approximate center of this distribution? 


c. Explain why you cannot find the exact value for the maximum number of calories per 


serving for these cereal brands. Between what two values does it lie? 


d. About what percent of the cereal brands have 130 or more calories per serving? 


73. Paying for championships Does paying high salaries lead to more victories in 
professional sports? The New York Yankees have long been known for having Major 
League Baseball’s highest team payroll. And over the years, the team has won many 
championships. This strategy didn’t pay off in 2008, when the Philadelphia Phillies won 
the World Series. Maybe the Yankees didn’t spend enough money that year. The figure 
shows histograms of the salary distributions for the two teams during the 2008 season. 
Why can’t you use these graphs to effectively compare the team payrolls? 
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74, Paying for championships Refer to Exercise 73. Here is a better graph of the 2008 salary 
distributions for the Yankees and the Phillies. Write a few sentences comparing these two 
distributions. 

Yankees Phillies 


Frequen 
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75. Value of a diploma Do students who graduate from high school earn more money than 
students who do not? To find out, we took a random sample of 371 U.S. residents aged 18 
and older. The educational level and total personal income of each person were recorded. 
The data for the 57 non-graduates (No) and the 314 graduates (Yes) are displayed in the 
relative frequency histograms. 


High school graduate? 


i i 
0 20 40 60 80 100 120 140 160 
Total personal income ($1000s) 


a. Would it be appropriate to use frequency histograms instead of relative frequency 
histograms in this setting? Explain why or why not. 


b. Compare the distributions of total personal income for the two groups. 


76. Strong paper towels In commercials for Bounty paper towels, the manufacturer claims 
that they are the “quicker picker-upper,” but are they also the stronger picker-upper? Two 
of Mr. Tabor’s statistics students, Wesley and Maverick, decided to find out. They 
selected a random sample of 30 Bounty paper towels and a random sample of 30 generic 
paper towels and measured their strength when wet. To do this, they uniformly soaked 
each paper towel with 4 ounces of water, held two opposite edges of the paper towel, and 
counted how many quarters each paper towel could hold until ripping, alternating brands. 
The data are displayed in the relative frequency histograms. Compare the distributions. 
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a. Would it be appropriate to use frequency histograms instead of relative frequency 
histograms in this setting? Explain why or why not. 


b. Compare the distributions of number of quarters until breaking for the two paper towel 
brands. 


77. Birth months Imagine asking a random sample of 60 students from your school about 
their birth months. Draw a plausible (believable) graph of the distribution of birth months. 
Should you use a bar graph or a histogram to display the data? 


78. Die rolls Imagine rolling a fair, six-sided die 60 times. Draw a plausible graph of the 
distribution of die rolls. Should you use a bar graph or a histogram to display the data? 


79. AP® exam scores The table gives the distribution of grades earned by students taking the 
AP® Calculus AB and AP® Statistics exams in 2016.22 


Grade 
5 4 3 2 1 Total 
Calculus AB 76,486 308,215 
Statistics 29,627 | 44,884 | 51,367 32,120 | 48,565 206,563 


a. Make an appropriate graphical display to compare the grade distributions for AP® 
Calculus AB and AP® Statistics. 


b. Write a few sentences comparing the two distributions of exam grades. 
Multiple Choice Select the best answer for Exercises 80-85. 


80. Here are the amounts of money (cents) in coins carried by 10 students in a statistics class: 
50, 35, 0, 46, 86, 0, 5, 47, 23, 65. To make a stemplot of these data, you would use stems 


a. 0, 2, 3, 4, 6, 8. 
b. 0, 1, 2, 3, 4, 5, 6, 7, 8. 


ce 0; 3,5;.6,.7. 


d. 00, 10, 20, 30, 40, 50, 60, 70, 80, 90. 


e. None of these. 


81. The histogram shows the heights of 300 randomly selected high school students. Which of 
the following is the best description of the shape of the distribution of heights? 


82. 


83. 


d. 


e. 


70 


60 
50 
40 


Roughly symmetric and single-peaked 
Roughly symmetric and double-peaked 
Roughly symmetric and multi-peaked 
Skewed to the left 


Skewed to the right 


You look at real estate ads for houses in Naples, Florida. There are many houses ranging 
from $200,000 to $500,000 in price. The few houses on the water, however, are priced up 
to $15 million. The distribution of house prices will be 


a. 
b. 
C. 


d. 


eC. 


skewed to the left. 
roughly symmetric. 
skewed to the right. 
single-peaked. 

too high. 


The histogram shows the distribution of the percents of women aged 15 and over who 
have never married in each of the 50 states and the District of Columbia. Which of the 
following statements about the histogram is correct? 


84. 


85. 


d. 


e. 


Number of states 


0 1 l | 
20 24 28 32 36 40 4d 48 52 
Percent of women over age 15 who never married 


The center (median) of the distribution is about 36%. 


There are more states with percentages above 32 than there are states with percentages 
less than 24. 


It would be better if the values from 34 to 50 were deleted on the horizontal axis so 
there wouldn’t be a large gap. 


There was one state with a value of exactly 33%. 


About half of the states had percentages between 24% and 28%. 


When comparing two distributions, it would be best to use relative frequency histograms 
rather than frequency histograms when 


a. 
b. 
c 


d, 


e. 


the distributions have different shapes. 

the distributions have different amounts of variability. 
the distributions have different centers. 

the distributions have different numbers of observations. 


at least one of the distributions has outliers. 


Which of the following is the best reason for choosing a stemplot rather than a histogram 
to display the distribution of a quantitative variable? 


a. 


b 


C. 


d 


e. 


Stemplots allow you to split stems; histograms don’t. 

. Stemplots allow you to see the values of individual observations. 
Stemplots are better for displaying very large sets of data. 

. Stemplots never require rounding of values. 


Stemplots make it easier to determine the shape of a distribution. 


86. 


Recycle and Review 


Risks of playing soccer (1.1) A study in Sweden looked at former elite soccer players, 
people who had played soccer but not at the elite level, and people of the same age who 
did not play soccer. Here is a two-way table that classifies these individuals by whether or 
not they had arthritis of the hip or knee by their mid-fifties:2® 


Soccer level 


Elite Non-elite Did not play 
Whether person Yes 24 
developed No 61 206 548 
arthritis 


a. What percent of the people in this study were elite soccer players? What percent of the 
people in this study developed arthritis? 


b. What percent of the elite soccer players developed arthritis? What percent of those who 


got arthritis were elite soccer players? 


c. Researchers suspected that the more serious soccer players were more likely to develop 
arthritis later in life. Do the data confirm this suspicion? Calculate appropriate 
percentages to support your answer. 


SECTION 1.3 Describing Quantitative Data with Numbers 


LEARNING TARGETS By the end of the section, you should be able to: 


e Calculate measures of center (mean, median) for a distribution of quantitative 
data. 


Calculate and interpret measures of variability (range, standard deviation, /QR) 
for a distribution of quantitative data. 


Explain how outliers and skewness affect measures of center and variability. 
Identify outliers using the 1.5 x JOR rule. 


Make and interpret boxplots of quantitative data. 


Use boxplots and numerical summaries to compare distributions of quantitative 
data. 


How much offense did the 2016 U.S. women’s soccer team generate? The dotplot (reproduced 
from Section 1.2) shows the number of goals the team scored in 20 games played. 
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Goals scored 


The distribution is right-skewed and single-peaked. The games in which the team scored 9 and 
10 goals appear to be outliers. How can we describe the center and variability of this 
distribution? 


Measuring Center: The Mean 


The most common measure of center is the mean. 


DEFINITION The mean X X 


The mean X_ X (pronounced “x-bar”) of a distribution of quantitative data is the 
average of all the individual data values. To find the mean, add all the values and 
divide by the total number of observations. 


If the n observations are x1, Xo, ..., X,, the mean is given by the formula 


X =sum of data valuesnumber of data values=X1+X2+---+Xnn=>y Xin 


x —_sum of data values xy +X9+---+Xn 7 > ae 
~ number of data values _ n on 


The & (capital Greek letter sigma) in the formula is short for “add them all up.” The 
subscripts on the observations x; are just a way of keeping the n data values distinct. They do 


not necessarily indicate order or any other special facts about the data. 


EXAMPLE | How many goals? (>) 
Calculating the mean 


Kyodo News/Getty Images 


PROBLEM: Here are the data on the number of goals scored in 20 games played by the 
2016 U.S. women’s soccer team: 


a. Calculate the mean number of goals scored per game by the team. Show your work. 


b. The earlier description of these data (page 35) suggests that the games in which the team 
scored 9 and 10 goals are possible outliers. Calculate the mean number of goals scored per 
game by the team in the other 18 games that season. What do you notice? 


SOLUTION: 


a. 
M=§ 4541410454 2+141424+3+3+2414+4424+1+241+49 + 320=6320=3.15 goals 


b. The mean for the other 18 games ts 


KM =§4+ 541454 24+141424+34+3+2414442414+2 41 + 318=4418-2.44 goals 
a 6+565+1+54+24+14+1424+3434+24+14+44241424+143 
7 18 


4 
= == 2.44 goals 


These two games increased the team’s mean number 0 oals scored er qame b 0.71 
J ag) | 


goals. 
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The notation x © refers to the mean of a sample. Most of the time, the data we encounter 
can be thought of as a sample from some larger population. When we need to refer to a 


population mean, we’|l use the symbol p (Greek letter mu, pronounced “mew”). If you have the 


entire population of data available, then you calculate p in just the way you’d expect: add the 
values of all the observations, and divide by the number of observations. 


The preceding example illustrates an important weakness of the mean as a measure of 
center: rT) the mean is sensitive to extreme values in a distribution. These may be outliers, 


but a skewed distribution that has no outliers will also pull the mean toward its long tail. We 
say that the mean is not a resistant measure of center. 


DEFINITION Resistant 


A statistical measure is resistant if it isn’t sensitive to extreme values. 


The mean of a distribution also has a physical interpretation, as the following activity 
shows. 


ae . 
Y Mean as a “balance point” 


Ann Heath 


In this activity, you’ ll investigate an important property of the mean. 


1. Stack 5 pennies on top of the 6-inch mark on a 12-inch ruler. Place a pencil under the ruler 
to make a “seesaw” on a desk or table. Move the pencil until the ruler balances. What is 
the relationship between the location of the pencil and the mean of the five data values 6, 
6, 6, 6, and 6? 


2. Move one penny off the stack to the 8-inch mark on your ruler. Now move one other 
penny so that the ruler balances again without moving the pencil. Where did you put the 
other penny? What is the mean of the five data values represented by the pennies now? 


3. Move one more penny off the stack to the 2-inch mark on your ruler. Now move both 
remaining pennies from the 6-inch mark so that the ruler still balances with the pencil in 
the same location. Is the mean of the data values still 6? 


4. Discuss with your classmates: Why is the mean called the “balance point” of a 
distribution? 


The activity gives a physical interpretation of the mean as the balance point of a 
distribution. For the data on goals scored in each of 20 games played by the 2016 U.S. 
women’s soccer team, the dotplot balances at x _=3.15¢ = 3.15 goals. 
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Measuring Center: The Median 


We could also report the value in the “middle” of a distribution as its center. That’s the idea of 
the median. 


DEFINITION Median 


The median is the midpoint of a distribution, the number such that about half the 
observations are smaller and about half are larger. 


To find the median, arrange the data values from smallest to largest. 


e If the number rn of data values is odd, the median is the middle value in the 
ordered list. 


e If the number n of data values is even, the median is the average of the two 
middle values in the ordered list. 


The median is easy to find by hand for small sets of data. For instance, here are the data 


from Section 1.2 on the highway fuel economy ratings for a sample of 25 model year 2018 
Toyota 4Runners tested by the EPA: 


224. 22.4 22.3. 23.3. 22.3 22,5 22.5 22.4 22.1 21,5 22.0 22.2 22.7 
228 224 22.6 22.9 22.5 22.1 22.4 222 22.9 22.6 21,9 22.4 


Start by sorting the data values from smallest to largest: 


21,5 21,9 22,0: 224. 224 222° 22.2 22.5 22.9. 22.9 22,4 224. 224 
224 224 22.4 22.5 22.5 22.6 22.6 22.7 22.6 22.9 22.9 23.3 


There are n = 25 data values (an odd number), so the median is the middle (13th) value in the 
ordered list, the bold 22.4. 


EXAMPLE | How many goals? (>) 
Finding the median 


Icon Sports Wire/Getty Images 


PROBLEM: Here are the data on the number of goals scored in 20 games played by the 
2016 U.S. women’s soccer team: 


Find the median. 


SOLUTION: 


111111222Q|2)33345559 10 


2+2 _ 9 
The median is 24+22=2. 2 


| To find the median, sort the data values from smallest to largest. Because there are n | 


= 20 data values (an even number), the median is the average of the middle two 
values in the ordered list. 
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Comparing the Mean and the Median 


Which measure—the mean or the median—should we report as the center of a distribution? 
That depends on both the shape of the distribution and whether there are any outliers. 


e Shape: Figure 1.13 shows the mean and median for dotplots with three different shapes. 
Notice how these two measures of center compare in each case. The mean is pulled in the 
direction of the long tail in a skewed distribution. 


Skewed to the left 
Mean < Median 


Median = 19 
> . ts 3 ¢ E 
T T T T T A Sa 
13 14 15 16 17 18 19 20 
Mean = 17.97 
(a) Quiz score 
Roughly symmetric 
Mean = Median 
Median = 22.4 
, att 24864 , 
j 
21.5 22.0 22.5 23.0 23.5 
Mean = 22.40 


(b) Highway fuel economy (mpg) 


Skewed to the right 
Mean > Median 


Median = 2 


tra a 


S 412 42 4&35 6 FJ & SS 30 
Mean = 3.15 


(c) Goals scored 
FIGURE 1.13 Dotplots that show the relationship between the mean and median in distributions 
with different shapes: (a) Scores of 30 statistics students on a 20-point quiz, (b) highway fuel 
economy ratings for a sample of 25 model year 2018 Toyota 4Runners, and (c) number of goals 
scored in 20 games played by the 2016 U.S. women’s soccer team. 


e Outliers: We noted earlier that the mean is sensitive to extreme values. If we remove the 


two possible outliers (9 and 10) in Figure 1.13(c), the mean number of goals scored per game 
decreases from 3.15 to 2.44. The median number of goals scored is 2 whether we include 
these two games or not. The median is a resistant measure of center, but the mean is not. 


You can compare how the mean and median behave by using the Mean and Median applet at the 
book’s website, highschool.bfwpub.comitps6e. 


EFFECT OF SKEWNESS AND OUTLIERS ON MEASURES OF CENTER 


e Ifa distribution of quantitative data is roughly symmetric and has no outliers, the mean 
and median will be similar. 


e If the distribution is strongly skewed, the mean will be pulled in the direction of the 
skewness but the median won’t. For a right-skewed distribution, we expect the mean to be 
greater than the median. For a left-skewed distribution, we expect the mean to be less than 
the median. 


e The median is resistant to outliers but the mean isn’t. 


The mean and median measure center in different ways, and both are useful. In Major 
League Baseball (MLB), the distribution of player salaries is strongly skewed to the right. Most 
players earn close to the minimum salary (which was $507,500 in 2016), while a few earn more 
than $20 million. The median salary for MLB players in 2016 was about $1.5 million—but the 
mean salary was about $4.4 million. Clayton Kershaw, Miguel Cabrera, John Lester, and 
several other highly paid superstars pulled the mean up but that did not affect the median. The 
median gives us a good idea of what a “typical” MLB salary is. If we want to know the total 
salary paid to MLB players in 2016, however, we would multiply the mean salary by the total 
number of players: ($4.4 million)(862) * $3.8 billion! 


CHECK YOUR UNDERSTANDING 


i. 


y Some students purchased pumpkins for a carving contest. Before the contest began, they 
weighed the pumpkins. The weights in pounds are shown here, along with a histogram of the 
data. 


3.6 4.0 9.6 14.0 110 124 13.0 2.0 6.0 6.6 15.0 3.4 
12:77 6.0 2.8 9.6 4.0 6.1 a4 11-9 3.4 31.0 33.0 


woh UA +) © 


Frequency 


ore | 


0 5 10 15 20 25 30 35 
Pumpkin weight (Ib) 


1. Calculate the mean weight of the pumpkins. 
2. Find the median weight of the pumpkins. 


3. Would you use the mean or the median to summarize the typical weight of a pumpkin in 
this contest? Explain. 


Measuring Variability: The Range 


Being able to describe the shape and center of a distribution is a great start. However, two 
distributions can have the same shape and center, but still look quite different. 

Figure 1.14 shows comparative dotplots of the length (in millimeters) of separate random 
samples of PVC pipe from two suppliers, A and B.=“ Both distributions are roughly symmetric 
and single-peaked, with centers at about 600 mm, but the variability of these two distributions 
is quite different. The sample of pipes from Supplier A has much more consistent lengths (less 
variability) than the sample from Supplier B. 
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FIGURE 1.14 Comparative dotplots of the length of PVC pipes in separate random samples 
from Supplier A and Supplier B. 


There are several ways to measure the variability of a distribution. The simplest is the 


range. 


DEFINITION Range 


The range of a distribution is the distance between the minimum value and the 


maximum value. That is, 


Range = Maximum — MinimumRange = Maximum — Minimum 


Here are the data on the number of goals scored in 20 games played by the 2016 U.S. 
women’s soccer team, along with a dotplot: 


5S 10 5272223 321427219 3 
T _—_ a ae: T T —r 
0 l 2 3 4 5 6 7 8 9 10 


Goals scored 


The range of this distribution is 10 — 1 = 9 goals. Note that og the range of a data set is a 
single number. In everyday language, people sometimes say things like, “The data values 
range from 1 to 10.” A correct statement is “The number of goals scored in 20 games played by 
the 2016 U.S. women’s soccer team varies from 1 to 10, a range of 9 goals.” 


The range is not a resistant measure of variability. It depends on only the maximum and 
minimum values, which may be outliers. Look again at the data on goals scored by the 2016 
U.S. women’s soccer team. Without the possible outliers at 9 and 10 goals, the range of the 
distribution would decrease to 5 — 1 = 4 goals. 

The following graph illustrates another problem with the range as a measure of variability. 
The parallel dotplots show the lengths (in millimeters) of a sample of 11 nails produced by 
each of two machines.22 Both distributions are centered at 70 mm and have a range of 72 — 68 
= 4mm. But the lengths of the nails made by Machine B clearly vary more from the center of 
70 mm than the nails made by Machine A. 


68 69 70 71 72 
Length (mm) 


Measuring Variability: The Standard Deviation 


If we summarize the center of a distribution with the mean, then we should use the standard 
deviation to describe the variation of data values around the mean. 


DEFINITION Standard deviation 


The standard deviation measures the typical distance of the values ina 
distribution from the mean. 


How do we calculate the standard deviation s, of a quantitative data set with n values? Here 


are the steps. 
HOW TO CALCULATE THE STANDARD DEVIATION s, 


e Find the mean of the distribution. 
e Calculate the deviation of each value from the mean: deviation = value — mean. 
e Square each of the deviations. 


e Add all the squared deviations, divide by n — 1, and take the square root. 


If the values in a data set are Xj, X9,..., X, the standard deviation is given by the formula 


sx=(x1-x )2 + (x2-x )2 +--+ + (xn-x )2n-1=)y (xi-x )2n-1 


)? tos + (an—#)* - 


rr — 


| (a1 -2)* + (2-# 
n—1 


AP® EXAM TIP 


The formula sheet provided with the AP® Statistics exam gives the sample standard 
deviation in the equivalent form 


~ >. (i = z)’. 


"42 / 
sx=1n-1)¥ (xi-x )2. i 


The notation s, refers to the standard deviation of a sample. When we need to refer to the 


standard deviation of a population, we’ll use the symbol o (Greek lowercase sigma). The 
population standard deviation is calculated by dividing the sum of squared deviations by n 
instead of n — 1 before taking the square root. 


EXAMPLE |How many friends? (>) 
Calculating and interpreting standard deviation 


LaraBelova/Getty Images 


PROBLEM: Eleven high school students were asked how many “close” friends they have. 
Here are their responses, along with a dotplot: 


122233 33 4 4 6 


J 
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Number of close friends 


Calculate the standard deviation. Interpret this value. 


SOLUTION: 


To calculate the standard deviation: 
e Find the mean of the distribution. 
e Calculate the deviation of each value from the mean: deviation = value — mean 


e Square each of the deviations. 
e Add all the squared deviations, divide by n — 1, and take the square root to return to 
the original units. 


Rosle 2 eee 238+ 34s 4 4 oll3 
142424243434343444446 _ 


4 7 3 

xizi xi-x Zi — & (xi-x 2 —z) 
1 Ls =e (-2)2 =4 

2 2232-1 (-1)2=1 

2 2233 =1 (-1)2=1 

2 ced (-1)2=1 

3 3-3=0 02=0 

3 3-3=0 02=0 

3 3-5 —0 02=0 

3 oo =O 02=0 


8, = ,4/ -——— = 1.34 close friends 
salt 112. che he rE 


Interpretation: The number of close friends these students have typically varies by about 
1.34 close friends from the mean of 3 close friends. 
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The value obtained before taking the square root in the standard deviation calculation is 

known as the variance. In the preceding example, the sample variance is 
—\2 
(2;—Z) 18 

_ a a2 -__"___"__ 1.80 
sx2=) (xi-x )2n-1=1811-1=1.80 n—1 11-1 
Unfortunately, the units are “squared close friends.” Because variance is measured in squared 
units, it is not a very helpful way to describe the variability of a distribution. 


Think About It 


WHY IS THE STANDARD DEVIATION CALCULATED IN SUCH A COMPLEX 
WAY? Add the deviations from the mean in the preceding example. You should get a sum 
of 0. Why? Because the mean is the balance point of the distribution. We square the 
deviations to avoid the positive and negative deviations balancing each other out and adding 
to 0. It might seem strange to “average” the squared deviations by dividing by n - 1. We’ll 
explain the reason for doing this in Chapter 7. It’s easier to understand why we take the 
square root: to return to the original units (close friends). 


More important than the details of calculating s, are the properties of the standard deviation as 

a measure of variability: 

e s, is always greater than or equal to 0. s, = 0 only when there is no variability, that is, 
when all values in a distribution are the same. 

e Larger values of s, indicate greater variation from the mean of a distribution. The 


comparative dotplot shows the lengths of PVC pipe in random samples from two different 


suppliers. Supplier A’s pipe lengths have a standard deviation of 0.681 mm, while Supplier 
B’s pipe lengths have a standard deviation of 2.02 mm. The lengths of pipes from Supplier B 
are typically farther from the mean than the lengths of pipes from Supplier A. 
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e s, is not a resistant measure of variability. The use of squared deviations makes s, even 


more sensitive than x # to extreme values in a distribution. For example, the standard 
deviation of the number of goals scored in 20 games played by the 2016 U.S women’s soccer 
team is 2.58 goals. If we omit the possible outliers of 9 and 10 goals, the standard deviation 
drops to 1.46 goals. 
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e s, measures variation about the mean. It should be used only when the mean is chosen as 


the measure of center. 


In the close friends example, 11 high school students had an average of x =3% = 3 close 
friends with a standard deviation of s, = 1.34. What if a 12th high school student was added to 


the sample who had 3 close friends? The mean number of close friends in the sample would 
still be x =3. = 3. How would s, be affected? Because the standard deviation measures the 


typical distance of the values in a distribution from the mean, s, would decrease because this 


12th value is at a distance of 0 from the mean. In fact, the new standard deviation would be 


| S- (x5 = 2 | 18 


= 1.28 friends 


Sx / 
sx=) (xi-x )2n-1=1812-1=1.28 friends n—1 V 12-1 


Measuring Variability: The Interquartile Range (/QR) 


We can avoid the impact of extreme values on our measure of variability by focusing on the 
middle of the distribution. Start by ordering the data values from smallest to largest. Then find 
the quartiles, the values that divide the distribution into four groups of roughly equal size. The 
first quartile Q, lies one-quarter of the way up the list. The second quartile is the median, 


which is halfway up the list. The third quartile Q3 lies three-quarters of the way up the list. 


The first and third quartiles mark out the middle half of the distribution. 
Second quartile 


First quartile Q, (median) Third quartile QO, 


“e 
hels « 


[ a T T 


18 20 22 24 26 28 30 32 34 36 38 40 
Amount collected ($) 


For example, here are the amounts collected each hour by a charity at a local store: $19, 
$26, $25, $37, $31, $28, $22, $22, $29, $34, $39, and $31. The dotplot displays the data. 
Because there are 12 data values, the quartiles divide the distribution into 4 groups of 3 values. 


DEFINITION Quartiles, First quartile Q;, Third quartile Q3 


The quartiles of a distribution divide the ordered data set into four groups having 
roughly the same number of values. To find the quartiles, arrange the data values 
from smallest to largest and find the median. 


The first quartile Q, is the median of the data values that are to the left of the 
median in the ordered list. 


The third quartile Q3 is the median of the data values that are to the right of the 
median in the ordered list. 


The interquartile range ([QR) measures the variability in the middle half of the 
distribution. 


DEFINITION Interquartile range (!QR) 


The interquartile range (/QR) is the distance between the first and third quartiles 
of a distribution. In symbols: 


IQR=Q3-Q1/QR = Q3 — Qi 


Notice that the IQR is simply the range of the “middle half” of the distribution. 


EXAMPLE | Boys and their shoes? (>) 
Finding the [QR 


Peter Cade/Getty Images 


PROBLEM: How many pairs of shoes does a typical teenage boy own? To find out, two 
AP® Statistics students surveyed a random sample of 20 male students from their large high 
school and recorded the number of pairs of shoes that each boy owned. Here are the data, 
along with a dotplot: 
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Number of pairs of shoes 


Find the interquartile range. 


SOLUTION: 


455567777@6(|1010 10 10 11 12 14 22 35 38 
Median = 9 


Sort the data values from smallest to largest and find the median. 


i lige ia ages Soe cag MO RORE™ sees b= aig cp op 1 
i455 56|2)7 7 7 81140 10 10 10 41 |12)14 22 35 38 


Find the first quartile Q; and the third quartile Q3. 


fQR=11.5-65=5 pairs of shoes 
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The quartiles and the interquartile range are resistant because they are not affected by a few 
extreme values. For the shoe data, Q3 would still be 11.5 and the IQR would still be 5 if the 
maximum were 58 rather than 38. 

Be sure to leave out the median when you locate the quartiles. In the preceding example, 
the median was not one of the data values. For the earlier close friends data set, we ignore the 
circled median of 3 when finding Q, and Q3. 


CHECK YOUR UNDERSTANDING 


. 


y Here are data on the highway fuel economy ratings for a sample of 25 model year 2018 
Toyota 4Runners tested by the EPA, along with a dotplot: 


224 2 AL 22 239 pee. Meo 22 Ce a DD eee 22 e222 
22O 224A 22.65 222090 225 2 2a 2 2 2S 24 


21.5 22.0 22.5 23.0 | 23.5 
Highway fuel economy (mpg) 
1. Find the range of the distribution. 


2. The mean and standard deviation of the distribution are 22.404 mpg and 0.363 mpg, 
respectively. Interpret the standard deviation. 


3. Find the interquartile range of the distribution. 


4. Which measure of variability would you choose to describe the distribution? Explain. 


Numerical Summaries with Technology 


Graphing calculators and computer software will calculate numerical summaries for you. Using 
technology to perform calculations will allow you to focus on choosing the right methods and 
interpreting your results. 


| 3. Technology Corner | COMPUTING NUMERICAL SUMMARIES | 


N<] 
TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


Let’s find numerical summaries for the boys’ shoes data from the example on page 64. 
We’ll start by showing you how to compute summary statistics on the TI-83/84 and then 
look at output from computer software. 


I. One-variable statistics on the TI-83/84 
1. Enter the data in list L1. 
2. Find the summary statistics for the shoe data. 
e Press (CALC); choose 1-VarStats. Os 2.55 or later: In the dialog box, press 


(L1) and to specify L1 as the List. Leave FreqList blank Arrow 
down to Calculate and press [ENTER]. Older Os: Press (L1) and [ENTER 


NORMAL FLOAT AUTO REAL RADIAN MP 1 


x=11.65 

=x=233 
=x?7=4401 

Sx=9. 421559822 
ox=9. 183000599 
n=20 

minx=4 
401=6.5 


e Press |] to see the rest of the one-variable statistics. 


NORMAL FLOAT AUTO REAL RADIAN MP 


TSx=9. 421559822 
ox=9.183000S99 
n=20 
minx=4 
Q1=6.5 
Med=9 
Q3=11.5 
maxx=38 


I. Output from statistical software We used Minitab statistical software to calculate 
descriptive statistics for the boys’ shoes data. Minitab allows you to choose which 
numerical summaries are included in the output. 


Descriptive Statistics: Shoes 
Variable N Mean StDev Minimum Q1 Median Q3 Maximum 


Shoes 20° 11.657 9.42 4.00 6:20 9.00 11775 38-700 


Note: The TI-83/84 gives the first and third quartiles of the boys’ shoes distribution as Q; = 
6.5 and Q3 = 11.5. Minitab reports that Q; = 6.25 and Q3 = 11.75. What happened? Minitab 


and some other software use slightly different rules for locating quartiles. Results from the 
various rules are usually close to each other. Be aware of possible differences when 
calculating quartiles as they may affect more than just the JQR. 


Identifying Outliers 


Besides serving as a measure of variability, the interquartile range (IQR) is used as a “ruler” for 
identifying outliers. 


HOW TO IDENTIFY OUTLIERS: THE 1.5 x /QR RULE 


Call an observation an outlier if it falls more than 1.5 x IQR above the third quartile or 


below the first quartile. That is, 
Low outliers <Q; — 1.5 IQR High outliers > Q3 + 1.5 x IQR 


Here are sorted data on the highway fuel economy ratings for a sample of 25 model year 2018 
Toyota 4Runners tested by the EPA, along with a dotplot: 
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Does the 1.5 x IQR rule identify any outliers in this distribution? If you did the preceding 
Check Your Understanding, you should have found that Q, = 22.2 mpg, Q3 = 22.6 mpg, and 
IQR = 0.4 mpg. For these data, 


High outliers > Q3+1.5xIQR=22.6+1.5x0.4 = 23.2 
High outliers > Q3 + 1.5 x IQR = 22.6 4+ 1.5 x 0.4 = 23.2 


and 


Low outliers < Q1-1.5xIQR=22.2-1.5x0.4 = 21.6 
Low outliers < Q, — 1.5 x JQR = 22.2—1.5 x 0.4 = 21.6 


The cars with estimated highway fuel economy ratings of 21.5 and 23.3 are identified as 
outliers. 


AP® EXAM TIP 


You may be asked to determine whether a quantitative data set has any outliers. Be 
prepared to state and use the rule for identifying outliers. 


EXAMPLE |How many goals? (>) 
Identifying outliers 


Icon Sports Wire/Getty Images 


PROBLEM: Here are sorted data on the number of goals scored in 20 games played by the 
2016 U.S women’s soccer team, along with a dotplot: 
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Identify any outliers in the distribution. Show your work. 


SOLUTION: 


111114 D2 2 2@ D353 35 55910! 
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Q,=1 Median=2 Q,=4.5 

DIOBS Oly (Oli, = 2s 25 

Low outliers < Q,-1.5 x IQR = 1-1.5 x 3.5 = -4.25 

High outliers > Qy + 1.5 X IQR = 4.5 + 1.5 X 3.5 = 9.75 


There are no data values less than —4.25, but the game in which the team scored 10 goals 
is an outlier. 


The game in which the team scored 9 goals is not identified as an outlier by the 1.5 x 


JOR rule. 
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It is important to identify outliers in a distribution for several reasons: 


. They might be inaccurate data values. Maybe someone recorded a value as 10.1 instead of 
101. Perhaps a measuring device broke down. Or maybe someone gave a silly response, like 
the student in a class survey who claimed to study 30,000 minutes per night! Try to correct 
errors like these if possible. If you can’t, give summary statistics with and without the 
outlier. 


’. They can indicate a remarkable occurrence. For example, in a graph of net worth, Bill 
Gates is likely to be an outlier. 


3. They can heavily influence the values of some summary statistics, like the mean, range, 
and standard deviation. 


Making and Interpreting Boxplots 


You can use a dotplot, stemplot, or histogram to display the distribution of a quantitative 
variable. Another graphical option for quantitative data is a boxplot. A boxplot summarizes a 
distribution by displaying the location of 5 important values within the distribution, known as 


its five-number summary. 


DEFINITION Five-number summary, Boxplot 


The five-number summary of a distribution of quantitative data consists of the 
minimum, the first quartile Q,, the median, the third quartile Q3, and the 


maximum. 


A boxplot is a visual representation of the five-number summary. 


A boxplot is sometimes called a box-and-whisker plot. 


Figure 1.15 illustrates the process of making a boxplot. The dotplot in Figure 1.15(a) shows 
the data on EPA estimated highway fuel economy ratings for a sample of 25 model year 2018 
Toyota 4Runners. We have marked the first quartile, the median, and the third quartile with 
vertical blue lines. The process of testing for outliers with the 1.5 x IQR rule is shown in red. 
Because the values of 21.5 mpg and 23.3 mpg are outliers, we mark these separately. To get the 
finished boxplot in Figure 1.15(b), we make a box spanning from Q, to Q3 and then draw 


“whiskers” to the smallest and largest data values that are not outliers 
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FIGURE 1.15 A visual illustration of how to make a boxplot for the Toyota 4Runner highway gas 
mileage data. (a) Dotplot of the data with the five-number summary and 1.5 x /QR marked. (b) 
Boxplot of the data with outliers identified (*). 


As you can see, it is fairly easy to make a boxplot by hand for small sets of data. Here’s a 
summary of the steps. 


HOW TO MAKE A BOXPLOT 


Find the five-number summary for the distribution. 
Identify outliers using the 1.5 x IQR rule. 


Draw and label the axis. Draw a horizontal axis and put the name of the quantitative 
variable underneath, including units if applicable. 


Scale the axis. Look at the minimum and maximum values in the data set. Start the 


horizontal axis at a convenient number equal to or below the minimum and place tick 
marks at equal intervals until you equal or exceed the maximum. 


Draw a box that spans from the first quartile (Q,) to the third quartile (Q3). 


Mark the median with a vertical line segment that’s the same height as the box. 


Draw whiskers—lines that extend from the ends of the box to the smallest and largest 
data values that are not outliers. Mark any outliers with a special symbol such as an 
asterisk (*). 


We see from the boxplot in Figure 1.15 that the distribution of highway gas mileage ratings 


for this sample of model year 2018 Toyota 4Runners is roughly symmetric with one high 
outlier and one low outlier. 


EXAMPLE | Picking pumpkins (>) 
Making and interpreting boxplots 


cscredon/Getty Images 


PROBLEM: Some students purchased pumpkins for a carving contest. Before the contest 
began, they weighed the pumpkins. The weights in pounds are shown here. 


3.6 4.0 9.6 1440 110 124 130 20 £46. 66 15.0 3.4 
1297 50 2.8 9.6 4.0 6.1 04 119 54 31.0 33.0 


a. Make a boxplot of the data. 


b. Explain why the median and JQR would be a better choice for summarizing the center and 
variability of the distribution of pumpkin weights than the mean and standard deviation. 


SOLUTION: 


d. Min Q, _—_ Posipeclel 


(20 28 34 36 40 GO) 54 54 60 60 61) 


"96 96 110 119124 427130 14.0 150 31.0330) 
Qs “Max 


BOOS = (Ol) (OU) = Ny = AO Sy 

Low outliers < Q, - 1.5 x IQR = 4.0 - 1.5 x 8.7 = -9.05 
High outliers > Q, + 1.5 X IQR = 12.7 + 1.5 X 8.7 = 25.75 
The pumpkins that weighed 31.0 and 33.0 pounds are outliers. 


To make the boxplot: 


e Find the five-number summary. 


e Identify outliers. 


e Draw and label the axis. 
e Scale the axis. 
e Draw a box. 


e Mark the median. 


e Draw whiskers to the smallest and largest data values that are not outliers. Mark 
outliers with an asterisk. 


0 5 1 1 20 2 30 3g 
Pumpkin weight (1b) 


b. The distribution of ‘pumpkin weights is skewed to the right with two high outliers. 
Because the mean and standard deviation are sensitive to outliers, it would be better to 
use the median and IQR, which are resistant. 


We know the distribution is skewed to the right because the left half of the distribution 


varies from 2.0 to 6.6 pounds, while the right half of the distribution (excluding 
outliers) varies from 6.6 to 15.0 pounds. 


FOR PRACTICE, TRY EXERCISE 111 


Boxplots provide a quick summary of the center and variability of a distribution. The 
median is displayed as a line in the central box, the interquartile range is the length of the box, 
and the range is the length of the entire plot, including outliers. Note that some statistical 
software orients boxplots vertically. At left is a vertical boxplot of the pumpkin weight data 
from the preceding example. You can see that the graph is skewed toward the larger values. 


Pumpkin weight (Ib) 


0 


Boxplots do not display each individual value in a distribution. And o@ boxplots don’t 
show gaps, clusters, or peaks. For instance, the dotplot below left displays the duration, in 
minutes, of 220 eruptions of the Old Faithful geyser. The distribution of eruption durations is 
clearly double-peaked (bimodal). But a boxplot of the data hides this important information 
about the shape of the distribution. 
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CHECK YOUR UNDERSTANDING 


Ryan and Brent were curious about the amount of french fries they would get in a large 
order from their favorite fast-food restaurant, Burger King. They went to several different 
Burger King locations over a series of days and ordered a total of 14 large fries. The weight 
of each order (in grams) is as follows: 


165 163 160 159 166 152 166 168 173 «#171 «+168 167 170 = 170 


1. Make a boxplot to display the data. 


2. According to a nutrition website, Burger King’s large fries weigh 160 grams, on average. 
Ryan and Brent suspect that their local Burger King restaurants may be skimping on fries. 
Does the boxplot in Question 1 support their suspicion? Explain why or why not. 


Comparing Distributions with Boxplots 


Boxplots are especially effective for comparing the distribution of a quantitative variable in two 
or more groups, as seen in the following example. 


EXAMPLE |Which company makes better tablets? (>) 
Comparing distributions with boxplots 


Peter Cade/Getty Images 


PROBLEM: In a recent year, Consumer Reports rated many tablet computers for 
performance and quality. Based on several variables, the magazine gave each tablet an 
overall rating, where higher scores indicate better ratings. The overall ratings of the tablets 
produced by Apple and Samsung are given here, along with parallel boxplots and numerical 
summaries of the data.22 


Apple 87 87 87 87 86 86 86 86 84 84 
84 84 8 83 83 83 81 79 76 73 
Samsung 88 87 87 86 86 86 86 86 84 84 83 83 
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Manufacturer 


60 65 70 75 80 85 90 
Overall rating 


aA Sy Min Qi Median Q3 Max IQR 
Apple 83.45 3.762 73. «83 84 86 87 3 
Samsung 79.87 6.74 62 75 83 86 88 11 


Compare the distributions of overall rating for Apple and Samsung. 


SOLUTION: 


Remember to compare shape, outliers, center, and variability! 


S hape: Both distributions of overall ratings are skewed to the left. 


Outliers: There are two low outliers in the Apple tablet distribution: overall ratings of 
73 and 76. The Samsung tablet distribution has no outliers. 


Center: The Apple tablets had a slightly higher median overall rating (84) than the 
Samsung tablets (8 2») ‘More importantly, about 75% of the Apple tablets had 
overall ratings that were greater than or equal to the median for the 


Samsung tablets. 


Because of the strong skewness and outliers, use the median and /QR instead of the 


mean and standard deviation when comparing center and variability. 


Variability: There is much more variation in overall rating among the Samsung tablets 
than the Apple tablets. The IQR for Samsung tablets (11) is almost four 
times larger than the IQR for Apple tablets (3). 


FOR PRACTICE, TRY EXERCISE 115 


AP® EXAM TIP 


Use statistical terms carefully and correctly on the AP® Statistics exam. Don’t say 
“mean” if you really mean “median.” Range is a single number; so are Q,, Q3, and 
IQR. Avoid poor use of language, like “the outlier skews the mean” or “the median is 
in the middle of the /QR.” Skewed is a shape and the /QR is a single number, nota 
region. If you misuse a term, expect to lose some credit. 


Here’s an activity that gives you a chance to put into practice what you have learned in this 
section. 


ACTIVITY Team challenge: Did Mr. Starnes stack his class? 


In this activity, you will work in a team of three or four students to resolve a dispute. 


Mr. Starnes teaches AP® Statistics, but he also does the class scheduling for the high school. 
There are two AP® Statistics classes—one taught by Mr. Starnes and one taught by Ms. 
McGrail. The two teachers give the same first test to their classes and grade the test together. 
Mr. Starnes’s students earned an average score that was 8 points higher than the average for 
Ms. McGrail’s class. Ms. McGrail wonders whether Mr. Starnes might have “adjusted” the 
class rosters from the computer scheduling program. In other words, she thinks he might 
have “stacked” his class. He denies this, of course. 


To help resolve the dispute, the teachers collect data on the cumulative grade point 
averages and SAT Math scores of their students. Mr. Starnes provides the GPA data from his 
computer. The students report their SAT Math scores. The following table shows the data for 
each student in the two classes. 


Did Mr. Starnes stack his class? Give appropriate graphical and numerical evidence to 
support your conclusion. Be prepared to defend your answer. 


Student Teacher GPA SAT-M 
1 Starnes 2.900 670 
2 Starnes 2.860 520 
S Starnes 2.600 570 
4 Starnes 3.600 710 
iS Starnes 3.200 600 
6 Starnes 2.700 590 
i Starnes 3.100 640 
8 Starnes 3.085 570 
9 Starnes 3.750 710 

10 Starnes 3.400 630 

11 Starnes 3.338 630 


12 Starnes 3.560 670 


is Starnes 3.800 650 
14 Starnes 3.200 660 
15 Starnes 3.100 510 
16 McGrail 2.900 620 
17 McGrail 3.300 590 
18 McGrail 3.980 650 
19 McGrail 2.900 600 
20 McGrail 3.200 620 
21 McGrail 3.500 680 
22 McGrail 2.800 500 
23 McGrail 2.900 502.5 
24 McGrail 3.950 640 
25 McGrail 3.100 630 
26 McGrail 2.850 580 
27 McGrail 2.900 590 
28 McGrail 3.245 600 
29 McGrail 3.000 600 
30 McGrail 3.000 620 
31 McGrail 2.800 580 
32 McGrail 2.900 600 
oo McGrail 3.200 600 


You can use technology to make boxplots, as the following Technology Corner illustrates. 


4. Technology Corner | MAKING BOXPLOTS (@& 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


The TI-83/84 can plot up to three boxplots in the same viewing window. Let’s use the 
calculator to make parallel boxplots of the overall rating data for Apple and Samsung 
tablets. 


1. Enter the ratings for Apple tablets in list L1 and for Samsung in list L2. 


2. Set up two statistics plots: Plot1 to show a boxplot of the Apple data in list L1 and Plot2 
to show a boxplot of the Samsung data in list L2. The setup for Plot1 is shown. When you 
define Plot2, be sure to change L1 to L2. 


Note: The calculator offers two types of boxplots: one that shows outliers and one that 
doesn’t. We’ll always use the type that identifies outliers. 


3. Use the calculator’s Zoom feature to display the parallel boxplots. Then Trace to view the 
five-number summary. 


e Press |ZOOM| and select ZoomStat. 
e Press |TRACE|. 
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Section 1.3 Summary 


e A numerical summary of a distribution should include measures of center and variability. 


e The mean x = and the median describe the center of a distribution in different ways. The 
_ ei 
= t=>—_———. 

mean is the average of the observations: x => xin. n The median is the midpoint 
of the distribution, the number such that about half the observations are smaller and half are 
larger. 

e The simplest measure of variability for a distribution of quantitative data is the range, which 
is the distance from the maximum value to the minimum value. 


e When you use the mean to describe the center of a distribution, use the standard deviation 
to describe the distribution’s variability. The standard deviation s, gives the typical distance 
of the values in a distribution from the mean. In symbols, sx=y (xi-x )2n-1. 

> (2-2)? 
$y = \| ———. 


n—1 The standard deviation s, is 0 when there is no variability and gets 


larger as variability from the mean increases. 


e When you use the median to describe the center of a distribution, use the interquartile 
range to describe the distribution’s variability. The first quartile Q, has about one-fourth of 
the observations below it, and the third quartile Q3 has about three-fourths of the 


observations below it. The interquartile range (IQR) measures variability in the middle half 
of the distribution and is found using IQR = Q3 — Q). 


e The median is a resistant measure of center because it is relatively unaffected by extreme 
observations. The mean is not resistant. Among measures of variability, the IQR is resistant, 
but the standard deviation and range are not. 


e According to the 1.5 x IQR rule, an observation is an outlier if it is less than Q; — 1.5 x IQR 
or greater than Q3 + 1.5 x IQR. 


e Boxplots are based on the five-number summary of a distribution, consisting of the 
minimum, Q, the median, Q3, and the maximum. The box shows the variability in the 
middle half of the distribution. The median is marked within the box. Lines extend from the 
box to the smallest and the largest observations that are not outliers. Outliers are plotted with 
special symbols. Boxplots are especially useful for comparing distributions. 


1.3 Technology Corners 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


3. Computing numerical summaries 
4. Making boxplots 


Section 1.3 Exercises 
87. PQ 55 |) Quiz grades Joey’s first 14 quiz grades in a marking period were as follows: 


86 84 91 15 78 80 74 
87 76 96 82 90 98 93 


a. Calculate the mean. Show your work. 


b. Suppose Joey has an unexcused absence for the 15th quiz, and he receives a score of 0. 
Recalculate the mean. What property of the mean does this illustrate? 


88. Pulse rates Here are data on the resting pulse rates (in beats per minute) of 19 middle 
school students: 


71 104 76 88 78 71 68 86 70 90 
74 76 69 68 88 96 68 82 120 


a. Calculate the mean. Show your work. 


b. The student with a 120 pulse rate has a medical issue. Find the mean pulse rate for the 
other 18 students. What property of the mean does this illustrate? 


89. pg 57 | pm) Quiz grades Refer to Exercise 87. 


a. Find the median of Joey’s first 14 quiz grades. 


b. Find the median of Joey’s quiz grades after his unexcused absence. Explain why the 0 
quiz grade does not have much effect on the median. 


90. Pulse rates Refer to Exercise 88. 
a. Find the median pulse rate for all 19 students. 


b. Find the median pulse rate excluding the student with the medical issue. Explain why 
this student’s 120 pulse rate does not have much effect on the median. 


91. Electing the president To become president of the United States, a candidate does not 
have to receive a majority of the popular vote. The candidate does have to win a majority 
of the 538 Electoral College votes. Here is a stemplot of the number of electoral votes in 
2016 for each of the 50 states and the District of Columbia: 
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a. Find the median. 


b. Without doing any calculations, explain how the mean and median compare. 


92. Birthrates in Africa One of the important factors in determining population growth rates 
is the birthrate per 1000 individuals in a population. The dotplot shows the birthrates per 
1000 individuals (rounded to the nearest whole number) for 54 African nations. 
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Birthrate (per 1000 population) 


a. Find the median. 


b. Without doing any calculations, explain how the mean and median compare. 


93. House prices The mean and median selling prices of existing single-family homes sold in 


September 2016 were $276,200 and $234,200.42 Which of these numbers is the mean and 
which is the median? Explain your reasoning. 


94. Mean salary? Last year a small accounting firm paid each of its five clerks $32,000, two 
junior accountants $60,000 each, and the firm’s owner $280,000. 


a. What is the mean salary paid at this firm? How many of the employees earn less than 
the mean? What is the median salary? 


b. Write a sentence to describe how an unethical recruiter could use statistics to mislead 
prospective employees. 


95. Do adolescent girls eat fruit? We all know that fruit is good for us. Here is a histogram 
of the number of servings of fruit per day claimed by 74 seventeen-year-old girls ina 
study in Pennsylvania:44 


15 


Number of subjects 


0 1 2 E a 5 6 7 8 
Servings of fruit per day 


a. Find the median number of servings of fruit per day from the histogram. Explain your 
method clearly. 


b. Calculate the mean of the distribution. Show your work. 


96. Shakespeare The histogram shows the distribution of lengths of words used in 
Shakespeare’s plays.42 


2 4 6 8 10 
Word length 


a. Find the median word length in Shakespeare’s plays from the histogram. Explain your 
method clearly. 


b. Calculate the mean of the distribution. Show your work. 


97. Quiz grades Refer to Exercise 87. 


a. Find the range of Joey’s first 14 quiz grades and the range of Joey’s quiz grades after 
his unexcused absence. 


b. Explain what part (a) suggests about using the range as a measure of variability for a 
distribution of quantitative data. 


98. Pulse rates Refer to Exercise 88. 


a. Find the range of the pulse rates for all 19 students and the range of the pulse rates 
excluding the student with the medical issue. 


b. Explain what part (a) suggests about using the range as a measure of variability for a 
distribution of quantitative data. 


99. pg 61 2) Foot lengths Here are the foot lengths (in centimeters) for a random sample 
of seven 14-year-olds from the United Kingdom: 
25 22 20 25 24 24 28 


Calculate the standard deviation. Interpret this value. 


100. Well rested? A random sample of 6 students in a first-period statistics class was asked 
how much sleep (to the nearest hour) they got last night. Their responses were 6, 7, 7, 8, 
10, and 10. Calculate the standard deviation. Interpret this value. 


101. File sizes How much storage space does your music use? Here is a dotplot of the file sizes 
(to the nearest tenth of a megabyte) for 18 randomly selected files on Nathaniel’s mp3 
player: 


1 2 3 4 3 6 7 8 
File size (mb) 


a. The distribution of file size has a mean of x _=3.2% = 3.2 megabytes and a standard 
deviation of s, = 1.9 megabytes. Interpret the standard deviation. 


b. Suppose the music file that takes up 7.5 megabytes of storage space is replaced with 
another version of the file that only takes up 4 megabytes. How would this affect the 
mean and the standard deviation? Justify your answer. 


102. Healthy fast food? Here is a dotplot of the amount of fat (to the nearest gram) in 12 
different hamburgers served at a fast-food restaurant: 
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a. The distribution of fat content has a mean of x =22.83@ = 22.83 grams and a standard 
deviation of s, = 9.06 grams. Interpret the standard deviation. 


b. Suppose the restaurant replaces the burger that has 22 grams of fat with a new burger 
that has 35 grams of fat. How would this affect the mean and the standard deviation? 
Justify your answer. 


103. Comparing SD Which of the following distributions has a smaller standard deviation? 
Justify your answer. 
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104. Comparing SD The parallel dotplots show the lengths (in millimeters) of a sample of 11 
nails produced by each of two machines. Which distribution has the larger standard 
deviation? Justify your answer. 
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105. pg 64 |») File sizes Refer to Exercise 101. Find the interquartile range of the file size 
distribution shown in the dotplot. 


106. Healthy fast food? Refer to Exercise 102. Find the interquartile range of the fat content 
distribution shown in the dotplot. 


107. pg 67 |») File sizes Refer to Exercises 101 and 105. Identify any outliers in the 
distribution. Show your work. 


108. Healthy fast food? Refer to Exercises 102 and 106. Identify any outliers in the 
distribution. Show your work. 


109. Shopping spree The figure displays computer output for data on the amount spent by 50 
grocery shoppers. 


x £ Sy Min Q1 Med Q3 Max 
Amount spent 34.70 21.70 S.L1 19.27 27.86 45.40 93.34 


a. What would you guess is the shape of the distribution based only on the computer 
output? Explain. 


b. Interpret the value of the standard deviation. 


c. Are there any outliers? Justify your answer. 


110. C-sections A study in Switzerland examined the number of cesarean sections (surgical 
deliveries of babies) performed in a year by samples of male and female doctors. Here 
are summary Statistics for the two distributions: 


x £ S. Min Q, Med Q3 + Max 
Male doctors 41.333 20.607 20 27 34 50 86 
Female doctors 19.1 10.126 5 10 18.5 29 33 


a. Based on the computer output, which distribution would you guess has a more 
symmetrical shape? Explain your answer. 


b. Explain how the IQRs of these two distributions can be so similar even though the 
standard deviations are quite different. 


c. Does either distribution have any outliers? Justify your answer. 


111. pg 69 |») Don’t call me According to a study by Nielsen Mobile, “Teenagers ages 13 


to 17 are by far the most prolific texters, sending 1742 messages a month.” Mr. 
Williams, a high school statistics teacher, was skeptical about the claims in the article. 
So he collected data from his first-period statistics class on the number of text messages 
they had sent in the past 24 hours. Here are the data: 


0 7 1 29 285 8 5 1 2 98 9 O 26 
8 118 72 O 92 52 14 3 3 44 5 42 


a. Make a boxplot of these data. 
b. Use the boxplot you created in part (a) to explain how these data seem to contradict 
the claim in the article. 
112. Acing the first test Here are the scores of Mrs. Liao’s students on their first statistics test: 
93 «493 875 91 945 72 96 95 935 935 73. 


82 45 88 80 86 855 875 81 78 86 89 
92 91 98 85 82.5 88 945 43 


a. Make a boxplot of these data. 
b. Use the boxplot you created in part (a) to describe how the students did on Mrs. 
Liao’s first test. 


113. Electing the president Refer to Exercise 91. Here are a boxplot and some numerical 
summaries of the electoral vote data: 


Electoral votes 


Variable N Mean SD Min Q4 Median Q3 Max 
Electoral votes SL 10.55 9.69 3 4 8 12 55 


a. Explain why the median and IQR would be a better choice for summarizing the center 
and variability of the distribution of electoral votes than the mean and standard 
deviation. 


b. Identify an aspect of the distribution that the stemplot in Exercise 91 reveals that the 
boxplot does not. 


114. Birthrates in Africa Refer to Exercise 92. Here are a boxplot and some numerical 


summaries of the birthrate data: 


10 20 30 40 50 
Birthrate (per 1000 individuals) 


Variable N Mean SD Min Q1 Median Q3 Max 
Birthrate 54 = 34.91 8.57 14.00 29.00 37.50 41.00 53.00 


a. Explain why the median and IQR would be a better choice for summarizing the center 
and variability of the distribution of birthrates in African countries than the mean and 
standard deviation. 


b. Identify an aspect of the distribution that the dotplot in Exercise 92 reveals that the 
boxplot does not. 


115. pg 71 |») Energetic refrigerators Consumer Reports magazine rated different types of 


refrigerators, including those with bottom freezers, those with top freezers, and those 
with side freezers. One of the variables they measured was annual energy cost (in 
dollars). The following boxplots show the energy cost distributions for each of these 
types. Compare the energy cost distributions for the three types of refrigerators. 
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116. Income in New England The following boxplots show the total income of 40 randomly 
chosen households each from Connecticut, Maine, and Massachusetts, based on U.S. 
Census data from the American Community Survey. Compare the distributions of annual 
incomes in the three states. 


Maine HW * 
Massachusetts H [| ;_—— * 
Connecticut —{T RH 


0 50 100 150 200 250 300 350 400 450 500 
Annual household income ($1000s) 


117. Who texts more? For their final project, a group of AP® Statistics students wanted to 
compare the texting habits of males and females. They asked a random sample of 
students from their school to record the number of text messages sent and received over 
a two-day period. Here are their data: 


127 44 28 83 0 6 78 6 
Males 
5 213 73 20 214 28 11 
112 203 102 54 379 305 179 24 
Females 
127 65 Al 27 298 6 130 0 


a. Make parallel boxplots of the data. 
b. Use your calculator to compute numerical summaries for both samples. 


c. Do these data suggest that males and females at the school differ in their texting 
habits? Use the results from parts (a) and (b) to support your answer. 


118. SSHA scores Here are the scores on the Survey of Study Habits and Attitudes (SSHA) for 
a random sample of 18 first-year college women: 


154 109 137 115 152 140 154 178 101 
103 126 126 137 165 165 129 200 148 


Here are the SSHA scores for a random sample of 20 first-year college men: 


108 140 114 91 180 115 126 
92 169 146 109 132 15 88 
113 151 70 115 187 104 


Note that high scores indicate good study habits and attitudes toward learning. 
a. Make parallel boxplots of the data. 
b. Use your calculator to compute numerical summaries for both samples. 


c. Do these data support the belief that men and women differ in their study habits and 
attitudes toward learning? Use your results from parts (a) and (b) to support your 
answer. 


119. Income and education level Each March, the Bureau of Labor Statistics compiles an 
Annual Demographic Supplement to its monthly Current Population Survey.“ Data on 
about 71,067 individuals between the ages of 25 and 64 who were employed full-time 
were collected in one of these surveys. The parallel boxplots compare the distributions of 
income for people with five levels of education. This figure is a variation of the boxplot 
idea: because large data sets often contain very extreme observations, we omitted the 
individuals in each category with the top 5% and bottom 5% of incomes. Also, the 
whiskers are drawn all the way to the maximum and minimum values of the remaining 


data for each distribution. 
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Use the graph to help answer the following questions. 
a. What shape do the distributions of income have? 


b. Explain how you know that there are outliers in the group that earned an advanced 
degree. 


c. How does the typical income change as the highest education level reached increases? 
Why does this make sense? 


d. Describe how the variability in income changes as the highest education level reached 
increases. 


120. Sleepless nights Researchers recorded data on the amount of sleep reported each night 
during a week by a random sample of 20 high school students. Here are parallel boxplots 
comparing the distribution of time slept on all 7 nights of the study:44 
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Use the graph to help answer the following questions. 
a. Which distributions have a clear left-skewed shape? 


b. Which outlier stands out the most, and why? 


c. How does the typical amount of sleep that the students got compare on these seven 
nights? 


d. On which night was there the most variation in how long the students slept? Justify 
your answer. 


121. SD contest This is a standard deviation contest. You must choose four numbers from the 
whole numbers 0 to 10, with repeats allowed. 


a. Choose four numbers that have the smallest possible standard deviation. 
b. Choose four numbers that have the largest possible standard deviation. 


c. Is more than one choice possible in either part (a) or (b)? Explain. 


122. What do they measure? For each of the following summary statistics, decide (i) whether 
it could be used to measure center or variability and (ii) whether it is resistant. 


Q1+93 
a. Q1+Q32. 2 


Max—Min 
b. Max — Min2 2 


Multiple Choice Select the best answer for Exercises 123-126. 
123. If a distribution is skewed to the right with no outliers, which expression is correct? 
a. mean < median 
b. mean * median 
c. mean = median 
d. mean > median 


e. We can’t tell without examining the data. 


124. The scores on a statistics test had a mean of 81 and a standard deviation of 9. One student 
was absent on the test day, and his score wasn’t included in the calculation. If his score 
of 84 was added to the distribution of scores, what would happen to the mean and 
standard deviation? 


a. Mean will increase, and standard deviation will increase. 
b. Mean will increase, and standard deviation will decrease. 
c. Mean will increase, and standard deviation will stay the same. 
d. Mean will decrease, and standard deviation will increase. 


e. Mean will decrease, and standard deviation will decrease. 


125. The stemplot shows the number of home runs hit by each of the 30 Major League 
Baseball teams in a single season. Home run totals above what value should be 
considered outliers? 
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a. 173 
b. 210 
c. 222 
d. 229 


e& 257 


126. Which of the following boxplots best matches the distribution shown in the histogram? 
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Recycle and Review 


127. How tall are you? (1.2) We used Census At School’s “Random Data Selector” to choose 
a sample of 50 Canadian students who completed a survey in a recent year. Here are the 
students’ heights (in centimeters): 


166.5 170.0 1780 163.0 1505 169.0 173.0 169.0 171.0 166.0 
190.0 183.0 1780 161.0 171.00 1700 1910 1685 1785 173.0 
175.0 1605 1660 1640 163.00 1740 1600 1740 182.0 167.0 
166.0 170.0 1700 181.00 171.5 1600 1780 157.0 165.0 187.0 
168.0 1575 1455 1560 182.0 1685 177.0 162.5 1605 185.5 


Make an appropriate graph to display these data. Describe the shape, center, and 
variability of the distribution. Are there any outliers? 


128. Success in college (1.1) The Freshman Survey asked first-year college students about 
their “habits of mind”—specific behaviors that college faculty have identified as being 
important for student success. One question asked students, “How often in the past year 
did you revise your papers to improve your writing?” Another asked, “How often in the 
past year did you seek feedback on your academic work?” The figure is a bar graph 
comparing the percent of males and females who answered “frequently” to these two 


questions. 45 


i Male {J Female 


% “Frequently” 


Revise papers to Seek feedback 
improve writing on work 


What does the graph reveal about the habits of mind of male and female college 
freshmen? 


MM _. FRAPPY! FREE RESPONSE AP® 
PROBLEM, YAY! 


The following problem is modeled after actual AP® Statistics exam free response 
questions. Your task is to generate a complete, concise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you use, because you will be 
scored on the correctness of your methods as well as on the accuracy and completeness of 
your results and explanations. 


Using data from the 2010 census, a random sample of 348 U.S. residents aged 18 and 
older was selected. Among the variables recorded were gender (male or female), housing 
status (rent or own), and marital status (married or not married). 


The two-way table below summarizes the relationship between gender and housing 
status. 


Male Female Total 
Own 132 122 254 
Rent 50 44 94 
Total 182 166 348 


a. What percent of males in the sample own their home? 
b. Make a graph to compare the distribution of housing status for males and females. 


c. Using your graph from part (b), describe the relationship between gender and 
housing status. 


d. The two-way table below summarizes the relationship between marital status and 
housing status. 


Married Not Married Total 
Own 172 82 254 
Rent 40 54 94 
Total 212 136 348 


For the members of the sample, is the relationship between marital status and 


housing status stronger or weaker than the relationship between gender and housing 


status that you described in part (c)? Justify your choice using the data provided in 
the two-way tables. 


After you finish, you can view two example solutions on the book’s website 
(highschool.bfwpub.comitps6e). Determine whether you think each solution is 
“complete,” “substantial,” “developing,” or “minimal.” If the solution is not 
complete, what improvements would you suggest to the student who wrote it? 
Finally, your teacher will provide a scoring rubric. Score your response and note 
what, if anything, you would do differently to improve your own score. 


Chapter 1 Review 


Introduction: Data Analysis: Making Sense of Data 


In this brief section, you learned several fundamental concepts that will be important 
throughout the course: the idea of a distribution and the distinction between quantitative and 
categorical variables. You also learned a strategy for exploring data: 


e Begin by examining each variable by itself. Then move on to study relationships between 
variables. 


e Start with a graph or graphs. Then add numerical summaries. 


Section 1.1: Analyzing Categorical Data 


In this section, you learned how to display the distribution of a single categorical variable with 
bar graphs and pie charts and what to look for when describing these displays. Remember to 
properly label your graphs! Poor labeling is an easy way to lose points on the AP® Statistics 
exam. You should also be able to recognize misleading graphs and be careful to avoid making 
misleading graphs yourself. 


Next, you learned how to investigate the relationship between two categorical variables. 
Using a two-way table, you learned how to calculate and display marginal and joint relative 
frequencies. Conditional relative frequencies and side-by-side or segmented bar graphs allow 
you to look for an association between the variables. If there is no association between the two 
variables, comparative bar graphs of the distribution of one variable for each value of the other 
variable will be identical. If differences in the corresponding conditional relative frequencies 
exist, there is an association between the variables. That is, knowing the value of one variable 
helps you predict the value of the other variable. 


Section 1.2: Displaying Quantitative Data with Graphs 


In this section, you learned how to create three different types of graphs for a quantitative 
variable: dotplots, stemplots, and histograms. Each of the graphs has distinct benefits, but all of 
them are good tools for examining the distribution of a quantitative variable. Dotplots and 
stemplots are handy for small sets of data. Histograms are the best choice when there are a 
large number of observations. On the AP® exam, you will be expected to create each of these 
types of graphs, label them properly, and comment on their characteristics. 


When you are describing the distribution of a quantitative variable, you should look at its 
graph for the overall pattern (shape, center, variability) and striking departures from that pattern 
(outliers). Use the acronym SOCV (shape, outliers, center, variability) to help remember these 
four characteristics. When comparing distributions, you should include explicit comparison 
words for center and variability such as “is greater than” or “is approximately the same as.” 
When asked to compare distributions, a very common mistake on the AP® exam is describing 


the characteristics of each distribution separately without making these explicit comparisons. 


Section 1.3: Describing Quantitative Data with Numbers 

To measure the center of a distribution of quantitative data, you learned how to calculate the 
mean and the median of a distribution. You also learned that the median is a resistant measure 
of center, but the mean isn’t resistant because it can be greatly affected by skewness or outliers. 


To measure the variability of a distribution of quantitative data, you learned how to 
calculate the range, standard deviation, and interquartile range. The standard deviation is the 
most commonly used measure of variability and approximates the typical distance of a value in 
the data set from the mean. The standard deviation is not resistant—it is heavily affected by 
extreme values. The interquartile range (IQR) is a resistant measure of variability because it 
ignores the upper 25% and lower 25% of the distribution, but the range isn’t resistant because it 
uses only the minimum and maximum value. 


To identify outliers in a distribution of quantitative data, you learned the 1.5 x JQR rule. 
You also learned that boxplots are a great way to visually summarize a distribution of 
quantitative data. Boxplots are helpful for comparing the center (median) and variability 
(range, IQR) for multiple distributions. Boxplots aren’t as useful for identifying the shape of a 
distribution because they do not display peaks, clusters, gaps, and other interesting features. 


What Did You Learn? 
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Example on Relevant Chapter Review 
Learning Target Section Page(s) Exercise(s) 
Identify the individuals and Intro 2 Rit 
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Chapter 1 Review Exercises 


These exercises are designed to help you review the important ideas and methods of the 
chapter. 


R1.1 \») Who buys cars? A car dealer keeps records on car buyers for future marketing 


purposes. The table gives information on the last 4 buyers. 


Buyer’s 

distance 
Buyer’s from dealer 
name Zip code Gender (mi) Car model Model year Price 
P. Smith 27514 M 13 Fiesta 2018 $26,375 
K. Ewing 27510 M 10 Mustang 2015 $39,500 
L. Shipman 27516 F 2 Fusion 2016 $38,400 
S. Reice 27243 F 4 F-150 2016 $56,000 


a. Identify the individuals in this data set. 


b. What variables were measured? Classify each as categorical or quantitative. 


R1.2 _») I want candy! Mr. Starnes bought some candy for his AP® Statistics class to eat on 


R1.3 


Halloween. He offered the students an assortment of Snickers®, Milky Way®, 
Butterfinger®, Twix®, and 3 Musketeers® candies. Each student was allowed to choose 
one option. Here are the data on the type of candy selected. Make a relative frequency 
bar graph to display the data. Describe what you see. 


Twix Snickers Butterfinger 
Butterfinger Snickers Snickers 

3 Musketeers Snickers Snickers 
Butterfinger Twix Twix 

Twix Twix Twix 
Snickers Snickers Twix 
Snickers Milky Way Twix 

Twix Twix Butterfinger 
Milky Way Butterfinger 3 Musketeers 
Milky Way Butterfinger Butterfinger 


») Id die without my phone! In a survey of over 2000 U.S. teenagers by Harris 


Interactive, 47% said that “their social life would end or be worsened without their cell 
phone.”42 One survey question asked the teens how important it is for their phone to 
have certain features. The following figure displays data on the percent who indicated 


that a particular feature is vital. 
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a. Explain how the graph gives a misleading impression. 
b. Would it be appropriate to make a pie chart to display these data? Why or why not? 
R1.4 |») Facebook and age Is there a relationship between Facebook use and age among 


college students? The following two-way table displays data for the 219 students who 
responded to the survey.“ 


Age 
Younger (18-22) | Middle (23-27) | Older (28 and up) 
21 
Facebook user? 
46 


. What percent of the students who responded were Facebook users? 


oOo go 


. What percent of the students in the sample were aged 28 or older? 


ie) 


. What percent of the students who responded were older Facebook users? 


jae 


. What percent of the Facebook users in the sample were younger students? 
R1.5 _») Facebook and age Refer to the preceding exercise. 

a. Find the distribution of Facebook use for each of the three age groups. Make a 
segmented bar graph to compare these distributions. 


b. Describe what the graph in part (a) reveals about the association between age and 
Facebook use. 
R1.6 |») Density of the earth In 1798, the English scientist Henry Cavendish measured the 


density of the earth several times by careful work with a torsion balance. The variable 
recorded was the density of the earth as a multiple of the density of water. Here are 
Cavendish’s 29 measurements:42 


5.50 5.61 4.88 5.07 5.26 5.55 5.36 5.29 5.58 5.65 
B00 5.53 5.62 5.29 9.44 5.34 5.19 5.10 5.27 5.39 
5.42 5.47 5.63 5.34 5.46 5.30 age) 5.68 5.85 


a. Make a stemplot of the data. 
b. Describe the distribution of density measurements. 


c. The currently accepted value for the density of earth is 5.51 times the density of 
water. How does this value compare to the mean of the distribution of density 
measurements? 


R1.7 \») Guinea pig survival times Here are the survival times (in days) of 72 guinea pigs 


after they were injected with infectious bacteria in a medical experiment.42 Survival 
times, whether of machines under stress or cancer patients after treatment, usually have 
distributions that are skewed to the right. 


43 45 53 56 56 o7 58 66 67 13 14 79 
80 80 81 81 81 82 83 83 84 88 89 91 
91 92 92 97 99 99 100 100 101 102 102 102 
103 104 107 108 109 113 114 118 a7 123 126 128 
137 138 139 144 145 147 156 162 174 178 179 184 
191 198 211 214 243 249 Pas) 380 403 on BE 522 598 


a. Make a histogram of the data. Does it show the expected right skew? 
b. Now make a boxplot of the data. 


c. Compare the histogram from part (a) with the boxplot from part (b). Identify an 
aspect of the distribution that one graph reveals but the other does not. 


R1.8 |») Household incomes Rich and poor households differ in ways that go beyond 


income. Here are histograms that compare the distributions of household size (number of 
people) for low-income and high-income households.22 Low-income households had 
annual incomes less than $15,000, and high-income households had annual incomes of at 
least $100,000. 


60 


Percent 


ives We & & F 


Household size, low income 


Percent 


I fF Sse SS 8 F 


Household size, high income 


a. About what percent of each group of households consisted of four or more people? 


b. Describe the similarities and differences in these two distributions of household size. 


Exercises R1.9 and R1.10 refer to the following setting. Do you like to eat tuna? Many people 
do. Unfortunately, some of the tuna that people eat may contain high levels of mercury. 
Exposure to mercury can be especially hazardous for pregnant women and small children. How 
much mercury is safe to consume? The Food and Drug Administration will take action (like 
removing the product from store shelves) if the mercury concentration in a 6-ounce can of tuna 
is 1.00 ppm (parts per million) or higher. 


What is the typical mercury concentration in cans of tuna sold in stores? A study conducted 
by Defenders of Wildlife set out to answer this question. Defenders collected a sample of 164 
cans of tuna from stores across the United States. They sent the selected cans to a laboratory 
that is often used by the Environmental Protection Agency for mercury testing.24 


R1.9 \») Mercury in tuna Here are a dotplot and numerical summaries of the data on 


mercury concentration in the sampled cans (in parts per million, ppm): 
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Mercury concentration (ppm) 
N Mean 
164 0.285 
Q 1 Med 
0.071 0.180 


a. Interpret the standard deviation. 


b. Determine whether there are any outliers. 


T T 
0.50 0.75 1.00 1.25 1.50 


0.380 


c. Explain why the mean is so much larger than the median of the distribution. 


R1.10 |») Mercury in tuna Is there a difference in the mercury concentration of light tuna 


and albacore tuna? Use the parallel boxplots and the computer output to write a few 
sentences comparing the two distributions. 
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Mercury concentration (ppm) 


N Mean 
20 0.401 
144 0.269 
Q1 Med 
0.293 0.400 
0.059 0.160 


SD 
0.152 
0.312 

Q3 
0.460 
0.347 


Chapter 1 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


T1.1 You record the age, marital status, and earned income of a sample of 1463 women. The 
number and type of variables you have recorded are 


a. 3 quantitative, 0 categorical. 
b. 4 quantitative, 0 categorical. 
c. 3 quantitative, 1 categorical. 
d. 2 quantitative, 1 categorical. 
e. 1 quantitative, 2 categorical. 


T1.2 The students in Mr. Tyson’s high school statistics class were recently asked if they 
would prefer a pasta party, a pizza party, or a donut party. The following bar graph 
displays the data. 


Frequency 
> ee 
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Donut Pasta Pizza 


Preferred party 

This graph is misleading because 

a. it should be a histogram, not a bar graph. 

b. there should not be gaps between the bars. 

c. the bars should be arranged in decreasing order by height. 
d. the vertical axis scale should start at 0. 


e. preferred party should be on the vertical axis and number of students should be on 
the horizontal axis. 


T1.3 Forty students took a statistics test worth 50 points. The dotplot displays the data. The 
third quartile is 


Test score 


45. 
a2, 
c. 44. 
23 
e. 43. 


7 


A. 


Questions T1.4—T1.6 refer to the following setting. Realtors collect data in order to serve their 
clients more effectively. In a recent week, data on the age of all homes sold in a particular area 
were collected and displayed in this histogram. 

10 
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05 15 25 35 45 55 65 75 
Age of home (yr) 
T1.4 Which of the following could be the median age? 
a. 19 years 
b. 34 years 
c. 24 years 
d. 39 years 
e. 29 years 
T1.5 Which of the following is most likely true? 
a. Mean > median, range < JQR 
b. mean < median, range < IQR 
c. mean > median, range > IQR 
d. mean < median, range > IQR 
e. mean = median, range > IQR 


T1.6 The standard deviation of the distribution of house age is about 16 years. Interpret this 
value. 


a. The age of all houses in the sample is within 16 years of the mean. 
b. The gap between the youngest and oldest house is 16 years. 


c. The age of all the houses in the sample is 16 years from the mean. 


d. The gap between the first quartile and the third quartile is 16 years. 


e. The age of the houses in the sample typically varies by about 16 years from the 
mean age. 


T1.7 The mean salary of all female workers is $35,000. The mean salary of all male workers 
is $41,000. What must be true about the mean salary of all workers? 


a. It must be $38,000. 

b. It must be larger than the median salary. 

c. It could be any number between $35,000 and $41,000. 
d. It must be larger than $38,000. 

e. It cannot be larger than $40,000. 


Questions T1.8 and 11.9 refer to the following setting. A survey was designed to study how 
business operations vary by size. Companies were classified as small, medium, or large. 
Questionnaires were sent to 200 randomly selected businesses of each size. Because not all 
questionnaires are returned, researchers decided to investigate the relationship between the 
response rate and the size of the business. The data are given in the following two-way table. 


Business size 


Small Medium Large 
Yes 40 
Response? 
No 160 


T1.8 What percent of all small companies receiving questionnaires responded? 
a. 12.5% 
b. 50.8% 
c. 20.8% 
d. 62.5% 
e. 33.3% 

T1.9 Which of the following conclusions seems to be supported by the data? 
a. There are more small companies than large companies in the survey. 


b. Small companies appear to have a higher response rate than medium or big 
companies. 


c. Exactly the same number of companies responded as didn’t respond. 
d. Overall, more than half of companies responded to the survey. 


e. If we combined the medium and large companies, then their response rate would be 
equal to that of the small companies. 


T1.10 An experiment was conducted to investigate the effect of a new weed killer to prevent 
weed growth in onion crops. Two chemicals were used: the standard weed killer (S) and 
the new chemical (N). Both chemicals were tested at high and low concentrations on 50 
test plots. The percent of weeds that grew in each plot was recorded. Here are some 
boxplots of the results. 


S—low conc. ee ee 
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T T T T T 
0 10 20 30 40 50 


Percent of weeds that grew 
Which of the following is not a correct statement about the results of this experiment? 


a. At both high and low concentrations, the new chemical results in better weed control 
than the standard weed killer. 


b. For both chemicals, a smaller percentage of weeds typically grew at higher 
concentrations than at lower concentrations. 


c. The results for the standard weed killer are less variable than those for the new 
chemical. 


d. High and low concentrations of either chemical have approximately the same effects 
on weed growth. 


e. Some of the results for the low concentration of weed killer show a smaller 
percentage of weeds growing than some of the results for the high concentration. 


Section Il: Free Response Show all your work. Indicate clearly the methods you use, 
because you will be graded on the correctness of your methods as well as on the accuracy and 
completeness of your results and explanations. 


T1.11 You are interested in how many contacts older adults have in their smartphones. Here 
are data on the number of contacts for a random sample of 30 elderly adults with 
smartphones in a large city: 


7 20 24 25 25 28 28 30 22 35 
42 43 44 45 46 47 48 48 50 91 


72 15 77 78 79 83 87 88 135 151 


a. Construct a histogram of these data. 
b. Are there any outliers? Justify your answer. 


c. Would it be better to use the mean and standard deviation or the median and IQR to 
describe the center and variability of this distribution? Why? 


71.12 A study among the Pima Indians of Arizona investigated the relationship between a 
mother’s diabetic status and the number of birth defects in her children. The results 
appear in the two-way table. 


Diabetic status 
Nondiabetic | Prediabetic | Diabetic 
None 38 
One or more 31 i3 9 


Number of birth defects 


a. What proportion of the women in this study had a child with one or more birth 
defects? 


b. What percent of the women in this study were diabetic or prediabetic, and had a 
child with one or more birth defects? 


c. Make a segmented bar graph to display the distribution of number of birth defects 
for the women with each of the three diabetic statuses. 


d. Describe the nature of the association between mother’s diabetic status and number 
of birth defects for the women in this study. 


T1.13 The back-to-back stemplot shows the lifetimes of several Brand X and Brand Y 
batteries. 
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a. What is the longest that any battery lasted? 
b. Give a reason someone might prefer a Brand X battery. 
c. Give a reason someone might prefer a Brand Y battery. 


T1.14 Catherine and Ana suspect that athletes (i.e., students who have been on at least one 
varsity team) typically have a faster reaction time than other students. To test this 
theory, they gave an online reflex test to 33 varsity athletes at their school and 29 other 
students. Here are parallel boxplots and numerical summaries of the data on reaction 
times (in milliseconds) for the two groups of students. Write a few sentences comparing 
the distribution of reaction time for the two types of students. 


=| 
s 
Ee 
7 ii: 
> 
150 200 250 300 350 400 450 500 
Reaction time (msec) 
Student n Mean StDev Min Q1 Med Q3 Max 
Other 29 297.3 65.9 197.0 255.0 292.0 325.0 478.0 


Athlete 33 270.1 Bi 189.6 236.0 261.0 300.0 398.0 


Chapter 1 Project American Community Survey 


Each month, the U.S. Census Bureau selects a random sample of about 300,000 U.S. 
households to participate in the American Community Survey (ACS). The chosen households 
are notified by mail and invited to complete the survey online. The Census Bureau follows up 
on any uncompleted surveys by phone or in person. Data from the ACS are used to determine 
how the federal government allocates over $400 billion in funding for local communities. 

The file acs survey ch1 project.xls, which can be accessed from the book’s website at 
highschool.bfwpub.com/tps6e, contains data for 3000 randomly selected households in one 
month’s ACS survey. Download the file to a computer for further analysis using the application 
specified by your teacher. 

Each row in the spreadsheet describes a household. A serial number that identifies the 
household is in the first column. The other columns contain values of several variables. See the 
code sheet on the book’s website for details on how each variable is recorded. Note that all the 
categorical variables have been coded to have numerical values in the spreadsheet. 


Use the files provided to answer the following questions. 
. How many variables are recorded? Classify each one as categorical or quantitative. 


’. Examine the distribution of location (division or region) for the households in the sample. 
Make a bar graph to display the data. Then calculate numerical summaries (counts, percents, 
or proportions). Describe what you see. 


3. Explore the relationship between two categorical variables of interest to you. Summarize the 
data in a two-way table. Then calculate appropriate conditional relative frequencies and 
make a side-by-side or segmented bar graph. Write a few sentences comparing the 
distributions. 


|. Analyze the distribution of household income (HINCP) using appropriate graphs and 
numerical summaries. 


». Compare the distribution of a quantitative variable that interests you in two or more groups. 
For instance, you might compare the distribution of number of people in a family (NPF) by 
region. Make appropriate graphs and calculate numerical summaries. Then write a few 
sentences comparing the distributions. 
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Chapter 2 Review Exercises 


Chapter 2 AP® Statistics Practice Test 


INTRODUCTION 


Suppose Emily earns 43 out of 50 points on a statistics test. Should she be satisfied or 
disappointed with her performance? That depends on how her score compares with the scores 
of the other students who took the test. If 43 is the highest score, Emily might be very pleased. 
Maybe her teacher will “scale” the grades so that Emily’s 43 becomes an “A.” But if Emily’s 
43 falls below the class average, she may not be so happy. 


Section 2.1 focuses on describing the location of an individual within a distribution. We 
begin by discussing a familiar measure of location: percentiles. Next, we introduce a new type 
of graph that is useful for displaying percentiles. Then we consider another way to describe an 
individual’s location that is based on the mean and standard deviation. In the process, we 
examine the effects of transforming data on the shape, center, and variability of a distribution. 


Sometimes it is helpful to use graphical models called density curves to describe the 
location of individuals within a distribution, rather than relying on actual data values. Such 
models are especially helpful when data fall in a bell-shaped pattern called a Normal 
distribution. Section 2.2 examines the properties of Normal distributions and shows you how to 
perform useful calculations with them. 


ACTIVITY Where do | stand? 


In this activity, you and your classmates will explore ways to describe where you stand 
(literally!) within a distribution. 


Henrik Sorensen/Getty Images 


1. Your teacher will mark out a number line on the floor with a scale running from about 58 
to 78 inches. 


2. Make a human dotplot. Each member of the class should stand at the appropriate location 
along the number line scale based on height (to the nearest inch). 


3. Your teacher will make a copy of the dotplot on the board for your reference. Describe the 


class’s distribution of heights. 


4. What percent of the students in the class have heights less than yours? This percentile is 
one way to measure your location in the distribution of heights. 


5. Calculate the mean and standard deviation of the class’s distribution of height from the 
dotplot. Confirm these values with your classmates. 


6. Does your height fall above or below the mean? How far above or below the mean is it? 
How many standard deviations above or below the mean is it? This standardized score 
(also called a z-score) is another way to measure your location in the class’s distribution of 
heights. 


7. Class discussion: What would happen to the class’s distribution of height if you converted 
each data value from inches to centimeters? (There are 2.54 centimeters in 1 inch.) How 
would this change of units affect the shape, center, variability, and the measures of 
location (percentile and z-score) that you calculated? 


Want to know more about where you stand—in terms of height, weight, or even body 
mass index? Do a web search for “Clinical Growth Charts” at the National Center for Health 


Statistics site, www.cdc.gov/nchs. 


SECTION 2.1 Describing Location in a Distribution 


LEARNING TARGETS By the end of the section, you should be able to: 
Find and interpret the percentile of an individual value in a distribution of data. 


Estimate percentiles and individual values using a cumulative relative frequency 
graph. 


Find and interpret the standardized score (z-score) of an individual value ina 
distribution of data. 


Describe the effect of adding, subtracting, multiplying by, or dividing by a 
constant on the shape, center, and variability of a distribution of data. 


There are 25 students in Mr. Tabor’s statistics class. He gives them a first test worth 50 points. 
Here are the students’ scores: 


35 18 37 38 42 41 2 37 36 32 12 43 31 
29, 32 48 44 45 38 40 45 38 38 40 22 


The score marked in red is Emily’s 43. How did she perform on this test relative to her 
classmates? 


Figure 2.1 displays a dotplot of the class’s test scores, with Emily’s score marked in red. 
The distribution is skewed to the left with some possible low outliers. From the dotplot, we can 
see that Emily’s score is above the mean (balance point) of the distribution. We can also see 
that Emily did better on the test than most other students in the class. 


Test score 


FIGURE 2.1 Dotplot of scores (out of 50 points) on Mr. Tabor’s first statistics test. Emily’s score 
of 43 is marked in red. 


Measuring Location: Percentiles 
One way to describe Emily’s location in the distribution of test scores is to calculate her 


percentile. 


DEFINITION Percentile 


An individual's percentile is the percent of values in a distribution that are less 
than the individual’s data value. 


Using the dotplot, we see that Emily’s 43 is the fifth highest score in the class. Because 20 
of the 25 observations (80%) are below her score, Emily is at the 80th percentile in the class’s 
test score distribution. 


@ Be careful with your language when describing percentiles. Percentiles are specific 


locations in a distribution, so an observation isn’t “in” the 80th percentile. Rather, it is “at” the 
80th percentile. 


EXAMPLE |Mr. Tabor’s first test (>) 
Finding and interpreting percentiles 


antoniodiaz/Shutterstock.com 


PROBLEM: Refer to the dotplot of 25 scores on Mr. Tabor’s first statistics test. 
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a. Find the percentile for Jacob, who scored 18 on the test. 


b. Maria’s test score is at the 48th percentile of the distribution. Interpret this value in 
context. What score did Maria earn? 


SOLUTION: 


a. iW 5 = 0.04, So Jacob scored at the 4th percentile on this test. 


Only 1 of the 25 scores in the class is less than Jacob’s 18. 


b. 48% of students in the class earned a lower test score than Maria. Because (0.48) 


(2 5)=12(0-48) (25) = 12 Maria scored higher than 12 of the 25 students. Maria earned 
a 38 on the test. 


Three other students in the class scored a 38 on the test. These students’ scores 


are also at the 48th percentile because 12 of the 25 students in the class earned 
lower scores. 


FOR PRACTICE, TRY EXERCISE 1 


Note: Some people define the pth percentile of a distribution as the value with p percent of 
observations less than or equal to it. Using this alternative definition of percentile, it is possible 
for an individual to fall at the 100th percentile. If we used this definition, Jacob’s score of 18 
would fall at the 8th percentile (2 of 25 scores were less than or equal to 18). Calculating 
percentiles is not an exact science, especially with small data sets! 


The median of a distribution is roughly the 50th percentile. For instance, 38 is the median 
score on Mr. Tabor’s first test. As you saw in part (b) of the example, Maria’s score of 38 put 
her at the 48th percentile of the distribution. The first quartile Q; is roughly the 25th percentile 


of a distribution because it separates the lowest 25% of values from the upper 75%. Likewise, 
the third quartile Q3 is roughly the 75th percentile. 
o@ A high percentile is not always a good thing. For example, a man whose cholesterol 


level is at the 90th percentile for his age group may need treatment for his high cholesterol! 


Cumulative Relative Frequency Graphs 


There are some interesting graphs that can be made with percentiles. One of the most common 
graphs starts with a frequency table for a quantitative variable. For instance, the frequency table 
on the next page summarizes the ages of the first 45 U.S. presidents when they took office. 


Age Frequency 
40-44 2 
45-49 a 
50-54 13 
55-59 12 
60-64 7 
65-69 3 
70-74 a 


Let’s expand this table to include columns for relative frequency, cumulative frequency, 
and cumulative relative frequency. 


e To get the values in the relative frequency column, divide the count in each interval by 45, 
the total number of presidents. Multiply by 100 to convert to a percent. 


e To fill in the cumulative frequency column, add the counts in the frequency column for the 
current interval and all intervals with smaller values of the variable. 


e For the cumulative relative frequency column, divide the entries in the cumulative frequency 
column by 45, the total number of presidents. Multiply by 100 to convert to a percent. 


Here is the original frequency table augmented with the relative frequency, cumulative 
frequency, and cumulative relative frequency columns: 


Cumulative Cumulative relative 
Age Frequency Relative frequency frequency frequency 
40-44 2 2/45 = 0.044 = 4.4% 2 2/45 = 0.044 = 4.4% 
45-49 7 7/45 = 0.156 = 15.6% 9 9/45 = 0.200 = 20.0% 
50-54 13 13/45 = 0.289 = 28.9% 22 22/45 = 0.489 = 48.9% 
55—59 12 12/45 = 0.267 = 26.7% 34 34/45 = 0.756 = 75.6% 
60-64 ri 7/45 = 0.156 = 15.6% 41 41/45 = 0.911 = 91.1% 
65-69 3 3/45 = 0.067 = 6.7% 44 44/45 = 0.978 = 97.8% 
70-74 1 1/45 = 0.022 = 2.2% 45 45/45 = 1.000 = 100% 


Now we can make a cumulative relative frequency graph. 


DEFINITION Cumulative relative frequency graph 


A cumulative relative frequency graph plots a point corresponding to the 
cumulative relative frequency in each interval at the smallest value of the next 
interval, starting with a point at a height of 0% at the smallest value of the 
firstinterval. Consecutive points are then connected with a line segment to form 
the graph. 


Some people refer to cumulative relative frequency graphs as ogives (pronounced “o-jives”). 


Figure 2.2 shows the completed cumulative relative frequency graph for the presidential 
age at inauguration data. Notice the following details: 


¢ The leftmost point is plotted at a height of 0% at age=4028¢ = 40, the smallest value in the 
first interval. This point tells us that none of the first 45 U.S. presidents took office before 
age 40. 


¢ The next point to the right is plotted at a height of 4.4% at age=45.28¢ = 45. This point tells 
us that 4.4% of these presidents (i.e., two of them) were inaugurated before they were 45 
years old. 


e The graph grows very gradually at first because few presidents were inaugurated when they 
were in their 40s. Then the graph gets very steep beginning at age 50 because most U.S. 
presidents were in their 50s when they were inaugurated. The rapid growth in the graph 


slows at age 60. 


e The rightmost point on the graph is plotted above age 75 and has cumulative relative 
frequency 100%. That’s because 100% of these U.S. presidents took office before age 75. 
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Age at inauguration (years) 
FIGURE 2.2 Cumulative relative frequency graph of the ages of U.S. presidents when they took 
office. 


A cumulative relative frequency graph can be used to describe the position of an individual 
within a distribution or to locate a specified percentile of the distribution. 


EXAMPLE | Ages of U.S. presidents (>) 
Interpreting a cumulative relative frequency graph 


Jae C. Hong/AP Photo 


PROBLEM: Use the graph in Figure 2.2 to help you answer each question. 
a. Was Barack Obama, who was first inaugurated at age 47, unusually young? 


b. Estimate and interpret the 65th percentile of the distribution. 


SOLUTION: 


Cumulative relative frequency (%) 


40 45 47 50 5S 60 65 70 #5 
Age at inauguration (years) 


Barack Obama’s inauguration age places him at about the 11th percentile. About 11% 
of the first 45 US. presidents first took office at a younger age than Obama did. So 
Obama was fairly young, but not unusually young, when he took office. 


To find President Obama’s location in the distribution, draw a vertical line up from 
his age (47) on the horizontal axis until it meets the graph. Then draw a horizontal 
line from this point to the vertical axis. 


Cumulative relative frequency (%) 


40 45 SO SSsp 60 65 FO FF 
Age at inauguration (years) 


The 6 5th percentile is about 58 years old. About 65% of the first 45 US. presidents 
were younger than 58 when they took office. 


The 65th percentile of the distribution is the age with cumulative relative frequency 
65%. To find this value, draw a horizontal line across from the vertical axis at a 


height of 65% until it meets the graph. Then draw a vertical line from this point 
down to the horizontal axis. 
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CHECK YOUR UNDERSTANDING 


R 
y 


1. Multiple choice: Select the best answer. Mark receives a score report detailing his 


performance on a statewide test. On the math section, Mark earned a raw score of 39, 
which placed him at the 68th percentile. This means that 


a. Mark did better than about 39% of the students who took the test. 
b. Mark did worse than about 39% of the students who took the test. 
c. Mark did better than about 68% of the students who took the test. 
d. Mark did worse than about 68% of the students who took the test. 
e. Mark got more than half of the questions correct on this test. 


2. Mrs. Munson is concerned about how her daughter’s height and weight compare with 
those of other girls of the same age. She uses an online calculator to determine that her 
daughter is at the 87th percentile for weight and the 67th percentile for height. Explain to 
Mrs. Munson what these values mean. 


Questions 3 and 4 relate to the following setting. The graph displays the cumulative 
relative frequency of the lengths of phone calls made from the mathematics department 
office at Gabalot High last month. 
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3. About what percent of calls lasted less than 30 minutes? 30 minutes or more? 


4. Estimate Q), Q3, and the IQR of the distribution of phone call length. 


Measuring Location: z-Scores 


A percentile is one way to describe the location of an individual in a distribution of quantitative 
data. Another way is to give the standardized score (z-score) for the observed value. 


DEFINITION Standardized score (z-score) 


The standardized score (z-score) for an individual value in a distribution tells us 
how many standard deviations from the mean the value falls, and in what 
direction. To find the standardized score (z-score), compute 

value — mean 


vA = ro. Va ean Soe 
z=value — meanstandard deviation standard deviation 


Values larger than the mean have positive z-scores. Values smaller than the mean have 
negative z-scores. 

Let’s return to the data from Mr. Tabor’s first statistics test. Figure 2.3 shows a dotplot of 
the data, along with numerical summaries. 


se — ° o J ee ee eseeeeses — 
SS ee 
10 15 20 25 30 35 40 45 50 
Test score 


n x Ss, Min Q; Med Q; Max 
25 35.44 8.77 12 315 38 415 48 


FIGURE 2.3 Dotplot and summary statistics of the scores on Mr. Tabor’s first statistics test. 
Emily's score of 43 is marked in red on the dotplot. 


The relationship between the mean and the median is about what you’d expect in this left-skewed 


distribution. 


Where does Emily’s 43 (marked in red on the dotplot) fall in the distribution? Her standardized 
score (z-score) is 


z=value — meanstandard deviation=43 — 35.448.77=0.86 
value — mean 43 — 35.44 ~ 0.86 


7 
a 


standard deviation _ 8.77 


That is, Emily’s test score is 0.86 standard deviations above the mean score of the class. 


EXAMPLE |Mr. Tabor’s first test, again (>) 
Finding and interpreting z-scores 


ESB Professional/Shutterstock.com 


PROBLEM: Use Figure 2.3 to answer each of the following questions. 
a. Find and interpret the z-score for Jacob, who earned an 18 on the test. 


b. Tamika had a standardized score of 0.292. Find Tamika’s test score. 


SOLUTION: 


, 18-35-44 99 
a. 2=18 -35.448.77=-1.99 8.77 


Jacob’s test score is 1.99 standard deviations below the class mean of 35.44. 


Be sure to interpret the magnitude (number of standard deviations) and direction 


(greater than or less than the mean) of a z-score in context. 


value — 35.44 
8.77 
0.292(8.77) + 35.44 = value 


0.292 = 


6. 0.292=value-35.448.770.292(8.77)+35.44=value38 = value 38 = value 


Tamika’s test score was 38. 


Be sure to show your work when finding the value that corresponds to a given z- 


score. 


FOR PRACTICE, TRY EXERCISE 13 


We often standardize observed values to express them on a common scale. For example, we 
might compare the heights of two children of different ages or genders by calculating their z- 
scores. 


e At age 9, Jordan is 55 inches tall. Her height puts her at a z-score of 1. That is, Jordan is 1 
standard deviation above the mean height of 9-year-old girls. 


e Zayne’s height at age 11 is 58 inches. His corresponding z-score is 0.5. In other words, 
Zayne is 1/2 standard deviation above the mean height of 11-year-old boys. 

Even though Zayne’s height is larger, Jordan is taller relative to girls her age than Zayne is 

relative to boys his age. The standardized heights tell us where each child stands (pun 

intended!) in the distribution for his or her age group. 


CHECK YOUR UNDERSTANDING 


i. 
y 


1. Mrs. Navard’s statistics class has just completed the first three steps of the “Where do I 


stand?” activity (page 90). The figure shows a dotplot of the distribution of height for the 
class, along with summary statistics from computer output. 
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Variable mn X s; Min Q; Med Q; Max 
Height 25 67 4.29 60 63 66 £70 75 


Lynette, a student in the class, is 62 inches tall. Find and interpret her z-score. 


2. Brent is amember of the school’s basketball team and is 74 inches tall. The mean height 
of the players on the team is 76 inches. Brent’s height translates to a z-score of —0.85 in 
the team’s distribution of height. What is the standard deviation of the team members’ 
heights? 


Transforming Data 


To find the standardized score (z-score) for an individual observation, we transform this data 
value by subtracting the mean and dividing the difference by the standard deviation. 
Transforming converts the observation from the original units of measurement (e.g., inches) to 
a standardized scale. 

There are other reasons to transform data. We may want to change the units of 
measurement for a data set from kilograms to pounds (1 kg¥2.2 Ib! kg © 2.2 1b), or from 

. . Ppo= SF 2 32)| 

Fahrenheit to Celsius [ °C=59 (°F — 32) ]. ? 
is calibrated wrong, so we have to add a constant to each data value to get accurate 
measurements. What effect do these kinds of transformations—adding or subtracting; 
multiplying or dividing—have on the shape, center, and variability of a distribution? 


Or perhaps a measuring device 


EFFECT OF ADDING OR SUBTRACTING A CONSTANT Recall that Mr. Tabor gave 
his class of 25 statistics students a first test worth 50 points. Here is a dotplot of the students’ 
scores along with some numerical summaries. 
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25 35.44 877 12 = ~# 315 38 41.5 48 


Suppose Mr. Tabor was nice and added 5 points to each student’s test score. How would 
this affect the distribution of scores? Figure 2.4 shows graphs and numerical summaries for the 
original test scores and adjusted scores. 


Original : 2 2 Lf 2 es eee eesese ° 
s o ; oo s 
Adjusted 3———.—*_, —*, —*_3-#__* 2 _4 222 422929 9 
10 15 20 25 30 35 40 45 50 55 
Test score 
n x Sy Min Q, Med Q3; Max JQR_ Range 


Original 25 35.44 877 12 315 38 41.5 48 10 36 
Adjusted 25 4044 877 17 365 43 46.5 53 10 36 


FIGURE 2.4 Dotplots and summary statistics for the original scores and adjusted scores (with 5 
points added) on Mr. Tabor’s statistics test. 


From both the graph and summary statistics, we can see that the measures of center (mean 
and median) and location (min, Q;, Q3, and max) increased by 5 points. The shape of the 


distribution did not change. Neither did the variability of the distribution—the range, the 
standard deviation, and the interquartile range (QR) all stayed the same. 


As this example shows, adding the same positive number to each value in a data set shifts 
the distribution to the right by that number. Subtracting a positive constant from each data 
value would shift the distribution to the left by that constant. 


THE EFFECT OF ADDING OR SUBTRACTING A CONSTANT 


Adding the same positive number a to (subtracting a from) each observation: 


e Adds a to (subtracts a from) measures of center and location (mean, five-number 


summary, percentiles) 
e Does not change measures of variability (range, IQR, standard deviation) 


e Does not change the shape of the distribution 


EXAMPLE | How wide is this room? (>) 
Effect of adding/subtracting a constant 


Regissercom/Shutterstock.com 


PROBLEM: Soon after the metric system was introduced in Australia, a group of students 
was asked to guess the width of their classroom to the nearest meter. A dotplot of the data 
and some numerical summaries are shown 

The actual width of the room was 13 meters. We can examine the distribution of students’ 
errors by defining a new variable as follows: error=guess—13.°!Tor = guess — 13. Note that a 


negative value for error indicates that a student’s guess for the width of the room was too 
small. 


0 10 20 30 40 
Guess (m) 


n x  §s, Min Q, Med Q; Max JOR Range 
Guess(m) 44 16.02 714 8 11 15 17 40+ 6 32 


a. What shape does the distribution of error have? 
b. Find the mean and the median of the distribution of error. 


c. Find the standard deviation and interquartile range (IQR) of the distribution of error. 


SOLUTION: 


a. The same shape as the original distribution of. guesses: skewed to the right with two 
distinct peaks. 


Mean:16.02—13=3.02 meters; Median: 15-13=2 meters,.Mean : 16.02 — 13 = 3.02 meters; Median : 15 — 13 = 2 meters. 


c. The same as for the original distribution of guesses: 


Standard deviation=7.14 meters, 1 QR=6 meters 
Standard deviation = 7.14 meters, IQR = 6 meters 


It is not a surprise that the mean is greater than the median in this right-skewed 


distribution. 
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Figure 2.5 confirms the results of the example. 


Guess (m) 


-10 0 10 20 30 40 


n x Ss; Min Q,; Med Q; Max JOR Range 
Guess(m) 44 1602 7.14 8 11 15 17+ 40 6 32 
Error(m) 44 3.02 7.14 -5 2 2 4 27 6 32 


FIGURE 2.5 Dotplots and summary statistics for the Australian students’ guesses of classroom 
width and the errors in their guesses, in meters. 


What about outliers? You can check that the four highest guesses—which are 27, 35, 38, 40 
meters—are outliers by the 1.5xIQR1-5 x JQR rule. The same individuals will still be outliers 
in the distribution of error, but each of their values will be decreased by 13 meters: 14, 22, 25, 
and 27 meters. 


EFFECT OF MULTIPLYING OR DIVIDING BY A CONSTANT Suppose that Mr. Tabor 
wants to convert his students’ adjusted test scores to percents. Because the test was out of 50 
points, he should multiply each score by 2 to be counted out of 100 points instead. Here are 
graphs and numerical summaries for the adjusted scores and the doubled scores: 


Adjusted 2 2 ee © ee cece seoscee 6 


Doubled | =), —$ 2 pe 
10 20 30 40 50 60 70 80 90 100 110 
Test score 


n s; Min Q,; Med @Q; Max JOR Range 
Adjusted 25 40.44 877 17 365 43 465 53 10 36 
Doubled 25 80.88 17.54 34 73 86 9 106 2 #=72 


From the graphs and summary statistics, we can see that the measures of center, location, 
and variability have all doubled, just like the individual observations. But the shape of the two 
distributions is the same. 


EFFECT OF MULTIPLYING OR DIVIDING BY A CONSTANT 


Multiplying (or dividing) each observation by the same positive number b: 


e Multiplies (divides) measures of center and location (mean, five-number summary, 
percentiles) by b 


e Multiplies (divides) measures of variability (range, IQR, standard deviation) by b 


e Does not change the shape of the distribution 


It is not common to multiply (or divide) each observation in a data set by a negative number b. 
Doing so would multiply (or divide) the measures of variability by the absolute value of b. We 
can’t have a negative amount of variability! Multiplying or dividing by a negative number would 
also affect the shape of the distribution, as all values would be reflected over the y axis. 


EXAMPLE | How far off were our guesses? (>) 
Effect of multiplying/dividing by a constant 


N center/Shutterstock.com 


PROBLEM: Refer to the preceding example. The graph and numerical summaries describe 
the distribution of the Australian students’ guessing errors (in meters) when they tried to 


estimate the width of their classroom. 


5 0 5 10 1 2 25 30 
Error (m) 


n x Sy Min Q; Med Q; Max JQR_ Range 
44 302 7.14 -5 2 2 4 27 6 32 


Because the students are having some difficulty with the metric system, it may not be helpful 
to tell them that their guesses tended to be about 2 meters too high. Let’s convert the error 
data to feet before we report back to them. 


There are roughly 3.28 feet in a meter. For the student whose error was —5 meters, that 
translates to 


3.28 feet _ 


—5 meters x — —16.4 feet 


—5 metersx3.28 feet1 meter=—16.4 feet 1 meter 


To change the units of measurement from meters to feet, we multiply each of the error values 
by 3.28. 


a. What shape does the resulting distribution of error have? 
b. Find the median of the distribution of error in feet. 


c. (c) Find the interquartile range (IQR) of the distribution of error in feet. 
SOLUTION: 


a. The same shape as the original distribution of. guesses: skewed to the right with two 
distinct peaks. 


b. Median=2x3.28=6.56 feetMedian = 2 x 3.28 = 6.56 feet 
C. IQR=6x3.28=19.68 feet1QR = 6 x 3.28 = 19.68 feet 


FOR PRACTICE, TRY EXERCISE 25 


Figure 2.6 confirms the results of the example. 


Error (m) 


Error (ft) 


nex S; Min OQ; Med Q, Max JQR Range 
Error(m) 44 3.02 714 5 —2 2 4 27 6 32 
Error (ft) 44 9.91 23.42 -164 -656 656 13.12 8856 19.68 104.96 


FIGURE 2.6 Dotplots and summary statistics for the errors in Australian students’ guesses of 
classroom width, in meters and in feet. 


PUTTING IT ALL TOGETHER: ADDING/SUBTRACTING AND 
MULTIPLYING/DIVIDING What happens if we transform a data set by both adding or 
subtracting a constant and multiplying or dividing by a constant? We just use the facts about 
transforming data that we’ve already established. 


EXAMPLE |Too cool at the cabin? (>) 
Analyzing the effects of transformations 


gulfix/Getty Images 


PROBLEM: During the winter months, the temperatures at the Starnes’s Colorado cabin 
can stay well below freezing (32°F, or 0°C) for weeks at a time. To prevent the pipes from 
freezing, Mrs. Starnes sets the thermostat at 50°F. She also buys a digital thermometer that 
records the indoor temperature each night at midnight. Unfortunately, the thermometer is 
programmed to measure the temperature in degrees Celsius. Following are a dotplot and 
numerical summaries of the midnight temperature readings for a 30-day period. 


Use the fact that °F=(9/5)°C+32 F = (9/5) “C + 32 to help you answer the following 
questions. 


a. Find the mean temperature in degrees Fahrenheit. Does the thermostat setting seem 
accurate? 


b. Calculate the standard deviation of the temperature readings in degrees Fahrenheit. 
Interpret this value in context. 


Temperature (°C) 


n Mean SD Min Q; Med Q; Max 
30 843 2.27 3.00 7.00 850 10.00 14.00 


SOLUTION: 


a. Mean=(9/s)(8.43)+32=47.17 °F Mean = (9/5) (8.43) + 32 = 47.17 °F The thermostat 
doesn’t seem to be very accurate. It is set at 50°F, but the mean temperature over the 


30-day period is about 47°F. 


Multiplying each observation by 9/5 multiplies the standard deviation by 9/5. 


However, adding 32 to each observation doesn’t affect the variability. 


6. SD=(9/s)(2.27)=4.09°F SD = (9/5) (2.27) = 4.09°F. rhe temperature readings typically 
vary from the mean by about 4°F. That’s a lot of variation! 


FOR PRACTICE, TRY EXERCISE 29 


Many other types of transformations can be very useful in analyzing data. We have only 
studied what happens when you transform data by adding, subtracting, multiplying, or dividing 
by a constant. 


CONNECTING TRANSFORMATIONS AND z-SCORES What happens if we standardize 
all the values in a distribution of quantitative data? Here is a dotplot of the original test scores 
for the 25 students in Mr. Tabor’s statistics class, along with some numerical summaries: 
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We calculate the z-score for each student using 


score — 35.44 


z=score - 35.448.77 8.77 


In other words, we subtract 35.44 from each student’s test score and then divide by 8.77. What 
effect would these transformations have on the shape, center, and variability of the distribution? 


Here is a dotplot of the class’s z-scores. Let’s describe the distribution. 
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e Shape: The shape of the distribution of z-scores is the same as the shape of the original 
distribution of test scores—skewed to the left. Neither subtracting a constant nor dividing by 
a constant would change the shape of the graph. 


e Center: The mean of the distribution of z-scores is 0. Subtracting 35.44 from each test score 
would reduce the mean from 35.44 to 0. Dividing each of these new data values by 8.77 
would divide the new mean of 0 by 8.77, which still yields a mean of 0. 


e Variability: The standard deviation of the distribution of z-scores is 1. Subtracting 35.44 
from each test score does not affect the standard deviation. However, dividing all of the 
resulting values by 8.77 would divide the original standard deviation of 8.77 by 8.77, 
yielding 1. 


CHECK YOUR UNDERSTANDING 


i. 


,, Knoebels Amusement Park in Elysburg, Pennsylvania, has earned acclaim for being an 
affordable, family-friendly entertainment venue. Knoebels does not charge for general 
admission or parking, but it does charge customers for each ride they take. How much do the 
rides cost at Knoebels? The figure shows a dotplot of the cost for each of 22 rides in a recent 
year, along with summary statistics. 


i a a | a | | | | 8 
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Cost ($) 


n Mean SD Min Q; Median Q; Max 
22 1.705 0.447 125 15 LS 1.75 3 


1. Suppose you convert the cost of the rides from dollars to cents ($1=100 cents). 


($1 = 100 cents) . Describe the shape, mean, and standard deviation of the distribution of 


ride cost in cents. 


2. Knoebels’ managers decide to increase the cost of each ride by 25 cents. How would the 
shape, center, and variability of this distribution compare with the distribution of cost in 
Question 1? 


3. Now suppose you convert the increased costs from Question 2 to z-scores. What would be 
the shape, mean, and standard deviation of this distribution? Explain your answers. 


Section 2.1 Summary 


e Two ways of describing an individual’s location in a distribution are percentiles and 
standardized scores (z-scores). An individual’s percentile is the percent of the distribution 
that is less than the individual’s data value. 


e To standardize any data value, subtract the mean of the distribution and then divide the 
difference by the standard deviation. The resulting z-score 


value — mean 
z=value — meanstandard deviation standard deviation 
measures how many standard deviations the data value lies above or below the mean of the 


distribution. We can also use percentiles and z-scores to compare the relative location of 
individuals in different distributions. 


e A cumulative relative frequency graph allows us to examine location in a distribution. The 
completed graph allows you to estimate the percentile for an individual value, and vice 
versa. 


e It is necessary to transform data when changing units of measurement. 


: When you add a positive constant a to (subtract a from) all the values in a data set, 
measures of center and location—mean, five-number summary, percentiles—increase 
(decrease) by a. Measures of variability—range, IQR, SD—do not change. 


« When you multiply (divide) all the values in a data set by a positive constant b, measures of 
center, location, and variability are multiplied (divided) by b. 


=: Neither of these transformations changes the shape of the distribution. 


Section 2.1 Exercises 


1. pg 92 |») Shoes How many pairs of shoes does a typical teenage boy own? To find out, 


two AP® Statistics students surveyed a random sample of 20 male students from their large 
high school. Then they recorded the number of pairs of shoes that each boy owned. Here is 
a dotplot of the data: 


Number of pairs of shoes 


a. Find the percentile for Jackson, who reported owning 22 pairs of shoes. 


b. Raul’s reported number of pairs of shoes is at the 45th percentile of the distribution. 
Interpret this value. How many pairs of shoes does Raul own? 


2. Old folks Here is a stemplot of the percents of residents aged 65 and older in the 50 states: 


7; 0 Key: 1s|2 means 15.2% of 
8 | 8 this state’s residents are 65 
918 or older. 

10 | 019 


11 | 16777 
12 | 01122456778999 
13 | 0001223344455689 


14 | 023568 
15 | 24 
16|9 


a. Find the percentile for Colorado, where 10.1% of the residents are aged 65 and older. 


b. Rhode Island is at the 80th percentile of the distribution. Interpret this value. What 
percent of Rhode Island’s residents are aged 65 and older? 


3. Wear your helmet! Many athletes (and their parents) worry about the risk of concussions 


4. 


when playing sports. A football coach plans to obtain specially made helmets for his 
players that are designed to reduce the chance of getting a concussion. Here are the 
measurements of head circumference (in inches) for the players on the team: 


23.0 22.2 21.7 22.0 22.3 22.6 22.7 21.5 
22.7 25.6 20.8 23.0 24.2 23.5 20.8 24.0 
2241 22.6 23.9 22.5 23.1 21.9 21.0 22.4 
23.5 22.5 23.9 23.4 21.6 23.3 


a. Antawn, the team’s starting quarterback, has a head circumference of 22.4 inches. What 
is Antawn’s percentile? 


b. Find the head circumference of the player at the 90th percentile of the distribution. 
Don’t call me According to a study by Nielsen Mobile, “Teenagers ages 13 to 17 are by far 
the most prolific texters, sending 1742 messages a month.” Mr. Williams, a high school 
Statistics teacher, was skeptical about the claims in the article. So he collected data from his 
first-period statistics class on the number of text messages they had sent over the past 24 
hours. Here are the data: 


a. Sunny was the student who sent 42 text messages. What is Sunny’s percentile? 


b. Find the number of text messages sent by Joelle, who is at the 16th percentile of the 
distribution. 

5. Setting speed limits According to the Los Angeles Times, speed limits on California 
highways are set at the 85th percentile of vehicle speeds on those stretches of road. Explain 
to someone who knows little statistics what that means. 

6. Blood pressure Larry came home very excited after a visit to his doctor. He announced 
proudly to his wife, “My doctor says my blood pressure is at the 90th percentile among 
men like me. That means I’m better off than about 90% of similar men.” How should his 
wife, who is a statistician, respond to Larry’s statement? 

7. Growth charts We used an online growth chart to find percentiles for the height and 
weight of a 16-year-old girl who is 66 inches tall and weighs 118 pounds. According to the 
chart, this girl is at the 48th percentile for weight and the 78th percentile for height. Explain 
what these values mean in plain English. 

8. Run fast Peter is a star runner on the track team. In the league championship meet, Peter 
records a time that would fall at the 80th percentile of all his race times that season. But his 
performance places him at the 50th percentile in the league championship meet. Explain 
how this is possible. (Remember that shorter times are better in this case!) 

9. pg 94 |») Run fast! As part of a student project, high school students were asked to 


sprint 50 yards and their times (in seconds) were recorded. A cumulative relative frequency 
graph of the sprint times is shown here. 
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Sprint time (sec) 


a. One student ran the 50 yards in 8 seconds. Is a sprint time of 8 seconds unusually slow? 


b. Estimate and interpret the 20th percentile of the distribution. 


10. Household incomes The cumulative relative frequency graph describes the distribution of 


median household incomes in the 50 states in a recent year.? 


35 40 45 50 55 60 65 70 75 
Median household income ($1000s) 


Cumulative relative frequency (%) 


a. The median household income in North Dakota that year was $55,766. Is North Dakota 
an unusually wealthy state? 


b. Estimate and interpret the 90th percentile of the distribution. 
11. Foreign-born residents The cumulative relative frequency graph shows the distribution of 
the percent of foreign-born residents in the 50 states. 


Cumulative relative frequency (%) 


0 5 10 15 20 25 
Percent foreign-born residents 


a. Estimate the interquartile range (IQR) of this distribution. Show your method. 
b. What is the percentile for Arizona, which had 15.1% foreign-born residents that year? 
c. Explain why the graph is fairly flat between 20% and 27.5%. 


d. Draw the histogram that corresponds to this graph. 
12. Shopping spree The figure is a cumulative relative frequency graph of the amount spent by 
50 consecutive grocery shoppers at a store. 


Cumukative relative frequency (%) 


0 10 20 30 40 50 60 70 80 9 100 
Amount spent ($) 
a. Estimate the interquartile range (IQR) of this distribution. Show your method. 
b. What is the percentile for the shopper who spent $19.50? 
c. Explain why the graph is steepest between $10 and $30. 
d. Draw the histogram that corresponds to this graph. 
13. pg 96 |») Foreign-born residents Refer to Exercise 11. Here are summary statistics for 


the percent of foreign-born residents in the 50 states: 


n Mean SD Min Q1 Med Q3 Max 
50 8.73 6.12 12 4.1 6.2 13.4 pay el 


a. Find and interpret the z-score for Montana, which had 1.9% foreign-born residents. 


b. New York had a standardized score of 2.10. Find the percent of foreign-born residents in 
New York at that time. 
14. Household incomes Refer to Exercise 10. Here are summary statistics for the state median 
household incomes: 


n Mean SD Min Q1 Med Q3 Max 
50 51,742.44 8210.64 36,641 46,071 50,009 57,020 71,836 


a. Find and interpret the z-score for North Carolina, with a median household income of 
$41,553. 


b. New Jersey had a standardized score of 1.82. Find New Jersey’s median household 
income for that year. 
15. Shoes Refer to Exercise 1. Jackson, who reported owning 22 pairs of shoes, has a 
standardized score of z=1.102 = 1.10, 


a. Interpret this z-score. 


b. The standard deviation of the distribution of number of pairs of shoes owned in this 
sample of 20 boys is 9.42. Use this information along with Jackson’s z-score to find the 
mean of the distribution. 


16. Don’t call me Refer to Exercise 4. Alejandro, who sent 92 texts, has a standardized score 
of z=1.892 = 1.89, 


a. Interpret this z-score. 


b. The standard deviation of the distribution of number of text messages sent over the past 
24 hours by the students in Mr. Williams’s class is 34.15. Use this information along 
with Alejandro’s z-score to find the mean of the distribution. 

17. Measuring bone density Individuals with low bone density (osteoporosis) have a high risk 
of broken bones (fractures). Physicians who are concerned about low bone density in 
patients can refer them for specialized testing. Currently, the most common method for 
testing bone density is dual-energy X-ray absorptiometry (DEXA). The bone density 
results for a patient who undergoes a DEXA test usually are reported in grams per square 
centimeter (g/cm?) and in standardized units. 


Judy, who is 25 years old, has her bone density measured using DEXA. Her results 
indicate bone density in the hip of 948 g/cm? and a standardized score of z=-1.45. 
z = —1.45. The mean bone density in the hip is 956 g/cm? in the reference population of 
25-year-old women like Judy.2 


a. Judy has not taken a statistics class in a few years. Explain to her in simple language 
what the standardized score reveals about her bone density. 


b. Use the information provided to calculate the standard deviation of bone density in the 
reference population. 

18. Comparing bone density Refer to Exercise 17. Judy’s friend Mary also had the bone 
density in her hip measured using DEXA. Mary is 35 years old. Her bone density is also 
reported as 948 g/cm, but her standardized score is z=0.50.2 = 0.50. The mean bone 
density in the hip for the reference population of 35-year-old women is 944 grams/cm2. 


a. Whose bones are healthier for her age: Judy’s or Mary’s? Justify your answer. 


b. Calculate the standard deviation of the bone density in Mary’s reference population. 
How does this compare with your answer to Exercise 17(b)? Are you surprised? 

19. SAT versus ACT Eleanor scores 680 on the SAT Mathematics test. The distribution of 
SAT Math scores is symmetric and single-peaked with mean 500 and standard deviation 
100. Gerald takes the American College Testing (ACT) Mathematics test and scores 29. 
ACT scores also follow a symmetric, single-peaked distribution—but with mean 21 and 
standard deviation 5. Find the standardized scores for both students. Assuming that both 
tests measure the same kind of ability, who has the higher score? 

20. Comparing batting averages Three landmarks of baseball achievement are Ty Cobb’s 
batting average of 0.420 in 1911, Ted Williams’s 0.406 in 1941, and George Brett’s 0.390 
in 1980. These batting averages cannot be compared directly because the distribution of 
major league batting averages has changed over the years. The distributions are quite 
symmetric, except for outliers such as Cobb, Williams, and Brett. While the mean batting 
average has been held roughly constant by rule changes and the balance between hitting 
and pitching, the standard deviation has dropped over time. Here are the facts: 


Decade Mean Standard deviation 


1910s 0.266 0.0371 
1940s 0.267 0.0326 
1970s 0.261 0.0317 


Find the standardized scores for Cobb, Williams, and Brett. Who had the best performance 
for the decade he played?4 

21. pg 99 |») Long jump A member of a track team was practicing the long jump and 
recorded the distances (in centimeters) shown in the dotplot. Some numerical summaries of 
the data are also provided. 


565 570 575 580 585 
Long-jump distance (cm) 


n Mean SD Min Q, Med Q3 Max 
40 577.3 4.713 564 574.5 577 581.5 586 


After chatting with a teammate, the jumper realized that he measured his jumps from the 
back of the board instead of the front. Thus, he had to subtract 20 centimeters from each of 
his jumps to get the correct measurement for each jump. 


a. What shape would the distribution of corrected long jump distance have? 
b. Find the mean and median of the distribution of corrected long-jump distance. 


c. Find the standard deviation and interquartile range (IQR) of the distribution of corrected 


long-jump distance. 
22. Step right up! A dotplot of the distribution of height for Mrs. Navard’s class is shown, 
along with some numerical summaries of the data. 
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n xX Ss Min Q Med Q, Max 
25 67 429 60 63 66 70 75 
Suppose that Mrs. Navard has the entire class stand on a 6-inch-high platform and then 
asks the students to measure the distance from the top of their heads to the ground. 
a. What shape would this distribution of distance have? 
b. Find the mean and median of the distribution of distance. 


c. Find the standard deviation and interquartile range (IQR) of the distribution of distance. 
23. Teacher raises A school system employs teachers at salaries between $38,000 and 


$70,000. The teachers’ union and school board are negotiating the form of next year’s 
increase in the salary schedule. Suppose that every teacher is given a $1000 raise. What 
effect will this raise have on each of the following characteristics of the resulting 
distribution of salary? 

a. Shape 

b. Mean and median 


c. Standard deviation and interquartile range (IQR) 

24. Used cars, cheap! A used-car salesman has 28 cars in his inventory, with prices ranging 
from $11,500 to $25,000. For a Labor Day sale, he reduces the price of each car by $500. 
What effect will this reduction have on each of the following characteristics of the resulting 
distribution of price? 

a. Shape 
b. Mean and median 
c. Standard deviation and interquartile range (IQR) 

25. pg 100 _») Long jump Refer to Exercise 21. Suppose that the corrected long-jump 
distances are converted from centimeters to meters (note that 100 cm=1 m100 cm = 1 m), 
a. What shape would the resulting distribution have? Explain your answer. 

b. Find the mean of the distribution of corrected long-jump distance in meters. 
c. Find the standard deviation of the distribution of corrected long-jump distance in meters. 

26. Step right up! Refer to Exercise 22. Suppose that the distances from the tops of the 
students’ heads to the ground are converted from inches to feet (note that 12 in.=1 ft). 

12 in: = 1 ft 
a. What shape would the resulting distribution have? Explain your answer. 


b. Find the mean of the distribution of distance in feet. 


c. Find the standard deviation of the distribution of distance in feet. 

27. Teacher raises Refer to Exercise 23. Suppose each teacher receives a 5% raise instead of a 
$1000 raise. What effect will this raise have on each of the following characteristics of the 
resulting salary distribution? 


a. Shape 
b. Median 


c. Interquartile range (IQR) 

28. Used cars, cheap! Refer to Exercise 24. Suppose each car’s price is reduced by 10% 
instead of $500. What effect will this discount have on each of the following characteristics 
of the resulting price distribution? 


a. Shape 
b. Median 


29. 


30. 


31. 


OZ. 


c. Interquartile range (IQR) 
pg 101 |») Cool pool? Coach Ferguson uses a thermometer to measure the temperature 


(in degrees Fahrenheit) at 20 different locations in the school swimming pool. An analysis 
of the data yields a mean of 77°F and a standard deviation of 3°F. 


(Recall that °C=59 oF-1609 secu that" C=9 F-% .) 


a. Find the mean temperature reading in degrees Celsius. 


b. Calculate the standard deviation of the temperature readings in degrees Celsius. 
Measure up Clarence measures the diameter of each tennis ball in a bag with a standard 
ruler. Unfortunately, he uses the ruler incorrectly so that each of his measurements is 0.2 
inch too large. Clarence’s data had a mean of 3.2 inches and a standard deviation of 0.1 
inch. (Recall that 1 in.=2.54 cm.Recall that 1 in. = 2.54 cm.) 


a. Find the mean of the corrected measurements in centimeters. 


b. Calculate the standard deviation of the corrected measurements in centimeters. 

Taxi! In 2016, taxicabs in Los Angeles charged an initial fee of $2.85 plus $2.70 per mile. 
In equation form, Fare Fare=2.85+2.7(miles).Fare = 2-85 + 2.7(miles). At the end of a 
month, a businessman collects all his taxicab receipts and calculates some numerical 
summaries. The mean fare he paid was $15.45 with a standard deviation of $10.20. What 
are the mean and standard deviation of the lengths of his cab rides in miles? 

Quiz scores The scores on Ms. Martin’s statistics quiz had a mean of 12 and a standard 
deviation of 3. Ms. Martin wants to transform the scores to have a mean of 75 anda 
standard deviation of 12. What transformations should she apply to each test score? 
Explain your answer. 


Multiple Choice Select the best answer for Exercises 33-38. 


33. Jorge’s score on Exam 1 in his statistics class was at the 64th percentile of the scores for 


all students. His score falls 

a. between the minimum and the first quartile. 
b. between the first quartile and the median. 

c. between the median and the third quartile. 

d. between the third quartile and the maximum. 


e. at the mean score for all students. 


34, When Sam goes to a restaurant, he always tips the server $2 plus 10% of the cost of the 


meal. If Sam’s distribution of meal costs has a mean of $9 and a standard deviation of $3, 
what are the mean and standard deviation of his tip distribution? 


a. $2.90, $0.30 


b. $2.90, $2.30 
c. $9.00, $3.00 
d. $11.00, $2.00 
e. $2.00, $0.90 


35. Scores on the ACT college entrance exam follow a bell-shaped distribution with mean 21 
and standard deviation 5. Wayne’s standardized score on the ACT was —0.6. What was 
Wayne’s actual ACT score? 


a. 3 
b. 13 


36. George’s average bowling score is 180; he bowls in a league where the average for all 
bowlers is 150 and the standard deviation is 20. Bill’s average bowling score is 190; he 
bowls in a league where the average is 160 and the standard deviation is 15. Who ranks 
higher in his own league, George or Bill? 


a. Bill, because his 190 is higher than George’s 180. 
b. Bill, because his standardized score is higher than George’s. 


c. Bill and George have the same rank in their leagues, because both are 30 pins above the 
mean. 


d. George, because his standardized score is higher than Bill’s. 


e. George, because the standard deviation of bowling scores is higher in his league. 


Exercises 37 and 38 refer to the following setting. The number of absences during the fall 
semester was recorded for each student in a large elementary school. The distribution of 
absences is displayed in the following cumulative relative frequency graph. 
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37. What is the interquartile range (IQR) for the distribution of absences? 
a 1 
b. 2 
c 3 
d. 5 
e. 14 


38. If the distribution of absences was displayed in a histogram, what would be the best 
description of the histogram’s shape? 


a. Symmetric 

b. Uniform 

c. Skewed left 
d. Skewed right 


e. Cannot be determined 


Recycle and Review Exercises 39 and 40 refer to the following setting. We used Census At 
School’s Random Data Selector to choose a sample of 50 Canadian students who completed a 
survey in a recent year. 


39. Travel time (1.2) The dotplot displays data on students’ responses to the question “How 
long does it usually take you to travel to school?” Describe the distribution. 


i=) 
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40 60 80 100 
Travel time (min) 


40. Lefties (1.1) Students were asked, “Are you right-handed, left-handed, or ambidextrous?” 


The responses (R=right-handed, L=left-handed, A=ambidextrous 
R = right-handed, L = left-handed, A = ambidextrous) are shown here. 


R R R is R R 
R R R R R A 
R R R R Li A 


DADA DV 
DAD DV 
DAD DN 
DAD DN 
Ara yD 
DAD DV 
DNADD DV 
Arena yDA 


a. Make an appropriate graph to display these data. 


b. Over 10,000 Canadian high school students took the Census At School survey that year. 
What percent of this population would you estimate is left-handed? Justify your answer. 


SECTION 2.2 Density Curves and Normal Distributions 


LEARNING TARGETS By the end of the section, you should be able to: 
Use a density curve to model distributions of quantitative data. 


Identify the relative locations of the mean and median of a distribution from a 
density curve. 


Use the 68—95-99.7 rule to estimate (i) the proportion of values in a specified 
interval, or (ii) the value that corresponds to a given percentile in a Normal 
distribution. 


Find the proportion of values in a specified interval in a Normal distribution using 
Table A or technology. 


Find the value that corresponds to a given percentile in a Normal distribution 
using Table A or technology. 


Determine whether a distribution of data is approximately Normal from graphical 
and numerical evidence. 


In Chapter 1, we developed graphical and numerical tools for describing distributions of 
quantitative data. Our work gave us a clear strategy for exploring data on a single quantitative 
variable. 


e Always plot your data: make a graph—usually a dotplot, stemplot, or histogram. 


e Look for the overall pattern (shape, center, variability) and for striking departures such as 
outliers. 


e Calculate numerical summaries to describe center and variability. 


In this section, we add one more step to this strategy. 


e When there’s a regular overall pattern, use a simplified model called a density curve to 
describe it. 


Density Curves 


age footstock/Alamy Stock Photo 


Selena works at a bookstore in the Denver International Airport. She takes the airport train 
from the main terminal to get to work each day. The airport just opened a new walkway that 
would allow Selena to get from the main terminal to the bookstore in 4 minutes. She wonders if 
it will be faster to walk or take the train to work. 

Figure 2.7(a) shows a dotplot of the amount of time it has taken Selena to get to the 
bookstore by train each day for the last 1000 days she worked. To estimate the percent of days 
on which it would be quicker for her to take the train, we could find the percent of dots 
(marked in red) that represent journey times of less than 4 minutes. Surely, there’s a simpler 
way than counting all those dots! 

Figure 2.7(b) shows the dotplot modeled with a density curve. You might wonder why the 
density curve is drawn at a height of 1/3. That’s so the area under the density curve between 2 
minutes and 5 minutes is equal to 


3x1/3=1.00=100% * 1/3 = 1.00 = 100% 


which represents 100% of the observations in the distribution shown in Figure 2.7(a). 


Density curve 
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FIGURE 2.7 (a) Dotplot of Selena’s travel time over the past several years via train from the 
Denver airport main terminal to the bookstore where she works. The red dots indicate times 
when it took her less than 4 minutes to get to work. (b) Density curve modeling the dotplot in 
part (a). The red shaded area estimates the proportion of times that it took Selena less than 4 
minutes to get to work. 


DEFINITION Density curve 

A density curve is a curve that 

e Is always on or above the horizontal axis 
e Has area exactly 1 underneath it 


The area under the curve and above any interval of values on the horizontal axis 
estimates the proportion of all observations that fall in that interval. 


The density curve in Figure 2.7(b) is called a uniform density curve because it has constant 


height. Recall from Chapter 1 that we can describe the dotplot of journey times in Figure 2.7(a) as 
having roughly a uniform distribution. 


The red shaded area under the density curve in Figure 2.7(b) provides a good 
approximation for the proportion or percent of red dots. Because the shaded region is 
rectangular, 


area=basexheight=2 x 1/3=2/3=0.667=66.7% 
area = base x height = 2 x 1/3 = 2/3 = 0.667 = 66.7% 


So we estimate that it would be quicker for Selena to take the train to work on about 66.7% of 
days. In fact, on 669 of the 1000 days, Selena’s journey from the terminal to the bookstore took 
less than 4 minutes. That’s 669/1000 = 0.669 = 66.9%—-very close to the estimate we got using 
the density curve. 

No set of quantitative data is exactly described by a density curve. The curve is an 
approximation that is easy to use and accurate enough in most cases. The density curve simply 
smooths out the irregularities in the distribution. 


EXAMPLE |That’s so random! (») 
Density curves 


PROBLEM: Suppose you use a calculator or computer random number generator to 
produce a number between 0 and 2 (like 0.84522 or 1.1111119). The random number 
generator will spread its output uniformly across the entire interval from 0 to 2 as we allow it 
to generate a long sequence of random numbers. 


a. Draw a density curve to model this distribution of random numbers. Be sure to include 
scales on both axes. 


b. About what percent of the randomly generated numbers will fall between 0.87 and 1.55? 


c. Estimate the 65th percentile of this distribution of random numbers. 


SOLUTION: 


0 2 
Random number 


The height of the curve needs to be 1/2 so that 


area = base x height 


area=basexheight =2x1/2=1 =2x1/2=1 


0 0.8% 155 2 
Random number 


Area=(1.55-0.87)x1/2=0.34=349ht€8 = (1.55 — 0.87) x 1/2 = 0.34 = 34% 


C. Area =0.65 | 
| 


Random number 


0.65 = (x —0) x 1/2 
0.65 = (1/2)x 


0.65=(x-0)x1/20.65=(1/2)x1.30=x 1-30 = x 
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Describing Density Curves 


Density curves come in many shapes. As with the distribution of a quantitative variable, we 
start by looking for rough symmetry or clear skewness. Then we identify any clear peaks. 


Figure 2.8 shows three density curves with distinct shapes. 
Skewed to the left, Roughly symmetric, Skewed to the right, 


single-peaked double-peaked single-peaked 


FIGURE 2.8 Density curves with different shapes. 


Our measures of center and variability apply to density curves as well as to distributions of 
quantitative data. Recall that the mean is the balance point of a distribution. Figure 2.9 


illustrates this idea for the mean of a density curve. 
| No! ™ Yes! 
The median of a distribution of quantitative data is the point with half the observations on 


FIGURE 2.9 The mean of a density curve is its balance point. 
either side. Similarly, the median of a density curve is the point with half of the area on each 
side. 


DEFINITION Mean of a density curve, Median of a density curve 


The mean of a density curve is the point at which the curve would balance if 
made of solid material. 


The median of a density curve is the equal-areas point, the point that divides the 
area under the curve in half. 


A symmetric density curve balances at its midpoint because the two sides are identical. So 
the mean and median of a symmetric density curve are equal, as in Figure 2.10(a). It isn’t so 
easy to spot the equal-areas point on a skewed density curve. We used technology to locate the 
median in Figure 2.10(b). The mean is greater than the median because the balance point of the 
distribution is pulled toward the long right tail. 


The long right tail pulls 
the mean to the right 
of the median. 


1A, 
(a) Median and mean (b) Median 
FIGURE 2.10 (a) Both the median and mean of a symmetric density curve lie at the point of 
symmetry. (b) In a right-skewed density curve, the mean is pulled away from the median toward 


the long tail. 


EXAMPLE | What does the left skew do? (>) 


Mean versus median 


ABC 
PROBLEM: The density curve that models a distribution of quantitative data is shown here. 
Identify the location of the mean and median by letter. Justify your answers. 


SOLUTION: 
Median=BMedian = B, Mean=A.Mean = A. B is the equal-areas point of the distribution. 
The mean will be less than the median due to the left-skewed shape. 


Even though C is directly under the peak of the curve, more than half of the area is to 


its left, so C cannot be the median. 
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Because a density curve is an idealized description of a distribution of quantitative data, we 
distinguish between the mean and standard deviation of the density curve and the mean x # 
and standard deviation s, computed from the actual observations. The usual notation for the 


mean of a density curve is p! (the lowercase Greek letter mu). We write the standard deviation 


of a density curve as o (the lowercase Greek letter sigma). 


You probably noticed that we used the same notation for the population mean and standard 
deviation, p and o, in Chapter 1 as we do here for the mean and standard deviation of a density 
curve. 


We can roughly locate the mean p of any density curve by eye, as the balance point. No 
easy way exists to estimate the standard deviation for density curves in general. But there is one 
family of density curves for which we can estimate the standard deviation by eye. 


Normal Distributions 


Blend Images/Shutterstock.com 


When we examine a distribution of quantitative data, how does it compare with an idealized 
density curve? Figure 2.11(a) shows a histogram of the scores of all seventh-grade students in 
Gary, Indiana, on the vocabulary part of the Iowa Test of Basic Skills (ITBS). The scores are 
grade-level equivalents, so a score of 6.3 indicates that the student’s performance is typical for 
a student in the third month of grade 6. The histogram is roughly symmetric, and both tails fall 
off smoothly from a single center peak. There are no large gaps or obvious outliers. 


The density curve drawn through the tops of the histogram bars in Figure 2.11(b) is a good 
description of the overall pattern of the ITBS score distribution. We call it a Normal curve. 
The distributions described by Normal curves are called Normal distributions. In this case, the 


ITBS vocabulary scores of Gary, Indiana, seventh-graders are approximately Normally 
distributed. 


2 4 6 8 10 12 
ITBS vaoumen score (b) ITBS vocabulary score 

FIGURE 2.11 (a) Histogram of the lowa Test of Basic Skills (ITBS) vocabulary scores of all 

seventh-grade students in Gary, Indiana. (b) The Normal density curve shows the overall shape 

of the distribution. 


Normal distributions play a large role in statistics, but they are rather special and not at all 
“normal” in the sense of being usual or typical. We capitalize Normal to remind you that these 
density curves are special. 


Look at the two Normal distributions in Figure 2.12. They illustrate several important facts: 


e Shape: All Normal distributions have the same overall shape: symmetric, single-peaked, and 
bell-shaped. 


e Center: The mean p is located at the midpoint of the symmetric density curve and is the 
same as the median. 


e Variability: The standard deviation o measures the variability (width) of a Normal 
distribution. 


a IK 


FIGURE 2.12 Two Normal curves, showing the mean yu and standard deviation o. 


You can estimate o by eye on a Normal curve. Here’s how. Imagine that you are skiing 
down a mountain that has the shape of a Normal distribution. At first, you descend at an 
increasingly steep angle as you go out from the peak. 


/ \ 


Fortunately, before you find yourself going straight down, the slope begins to get flatter rather 
than steeper as you go out and down: 


a ‘a 


The points at which this change of curvature takes place are located at a distance o on 
either side of the mean p. (Advanced math students know these as “inflection points.”) You can 
feel the change as you run a pencil along a Normal curve, which will allow you to estimate the 
standard deviation. 


DEFINITION Normal distribution, Normal curve 


A Normal distribution is described by a symmetric, single-peaked, bell-shaped 
density curve called a Normal curve. Any Normal distribution is completely 
specified by two numbers: its mean uy and standard deviation o. 


The distribution of ITBS vocabulary scores for seventh-grade students in Gary, Indiana, is 
modeled well by a Normal distribution with mean p=6.844 — 6.34 and standard deviation 
o=1.550 = 1.55, The figure shows this distribution with the points 1, 2, and 3 standard 
deviations from the mean labeled on the horizontal axis. 


T T T r. T T T 
2.19 3.74 5.29 684 839 9.94 11.49 
ITBS vocabulary score 


You will be asked to make reasonably accurate sketches of Normal distributions to model 
quantitative data sets like the ITBS vocabulary scores. The best way to learn is to practice. 


EXAMPLE | Stop the car! (») 
Sketching a Normal distribution 


fstop123/Getty Images 


PROBLEM: Many studies on automobile safety suggest that when automobile drivers make 
emergency stops, the stopping distances follow an approximately Normal distribution. 
Suppose that for one model of car traveling at 62 mph under typical conditions on dry 
pavement, the mean stopping distance is p=155 ft! — 155 ft with a standard deviation of 
o=3 fto = 3 ft. Sketch the Normal curve that approximates the distribution of stopping 
distance. Label the mean and the points that are 1, 2, and 3 standard deviations from the 
mean. 


SOLUTION: 


1 } ' } 


146 149 1sn Is5 158 161 164 
Stopping distance (ft) 


The mean (155) is at the midpoint of the bell-shaped density curve. The standard 
deviation (3) is the distance from the center to the change-of-curvature points on 
either side. Label the mean and the points that are 1, 2, and 3 SDs from the mean: 


1 SD: 155 — 1(3) = 152 and 155 + 1(3) = 158 
2 SD: 155 — 2(3) = 149 and 155 + 2(3) = 161 
3 SD: 155 — 3(3) = 146 and 155 + 3(3) = 164 
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Oo Remember that p and o alone do not specify the appearance of most distributions. 
The shape of density curves, in general, does not reveal o. These are special properties of 
Normal distributions. 


Why are Normal distributions important in statistics? Here are three reasons. 


.. Normal distributions are good descriptions for some distributions of real data. Distributions 
that are often close to Normal include: 


e Scores on tests taken by many people (such as SAT exams and IQ tests) 
e Repeated careful measurements of the same quantity (like the diameter of a tennis ball) 
e Characteristics of biological populations (such as lengths of crickets and yields of corn) 


’, Normal distributions are good approximations to the results of many kinds of chance 
outcomes, like the proportion of heads in many tosses of a fair coin. 


3. Many of the inference methods in Chapters 8—12 are based on Normal distributions. 


AGO701542723 
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Normal curves were first applied to data by the great mathematician Carl Friedrich Gauss 
(1777-1855). He used them to describe the small errors made by astronomers and surveyors in 
repeated careful measurements of the same quantity. You will sometimes see Normal 
distributions labeled “Gaussian” in honor of Gauss. His image was even featured on a previous 
German 10 DM bill, along with a sketch of the Normal distribution. 


The 68-95-99.7 Rule 


Earlier, we saw that the distribution of Iowa Test of Basic Skills (ITBS) vocabulary scores for 
seventh-grade students in Gary, Indiana, is approximately Normal with mean p=6.844 — 6.84 
and standard deviation o=1.55¢ = 1.55, How unusual is it for a Gary seventh-grader to get an 
ITBS score less than 3.74? The figure shows the Normal density curve for this distribution with 
the area of interest shaded. Note that the boundary value, 3.74, is exactly 2 standard deviations 
below the mean. 


2.19 3.74 5.29 6.84 8.39 9.94 11.49 
ITBS vocabulary score 


Calculating the shaded area isn’t as easy as multiplying base x heightase < height, but it’s 
not as hard as you might think. The following activity shows you how to do it. 


pePLe> 


(a 
ACTIVITY What’s so special about Normal distributions? ¢€ 


In this activity, you will use the Normal Density Curve applet at 
highschool.bfwpub.com/tps6e to explore an interesting property of Normal distributions. 

Change the mean to 6.84 and the standard deviation to 1.55, and click on “UPDATE.” 
(These are the values for the distribution of ITBS vocabulary scores of seventh-graders in 
Gary, Indiana.) A figure like the one that follows should appear: 


Use the applet to help you answer the following questions. 


5.290 8.390 


064 219 374 529 684 839 9.94 1149 13,04 


1. What percent of the area under the Normal curve lies within 1 standard deviation of the 
mean? That is, about what percent of Gary, Indiana, seventh-graders have ITBS 
vocabulary scores between 5.29 and 8.39? 

(b) What percent of the area under the Normal curve lies within 2 standard deviations of 
the mean? Interpret this result in context. 
(c) What percent of the area under the Normal curve lies within 3 standard deviations of 
the mean? Interpret this result in context. 


2. The distribution of IQ scores in the adult population is approximately Normal with mean p 


= 100 and standard deviation o = 15. Adjust the applet to display this distribution. About 
what percent of adults have IQ scores within 1, 2, and 3 standard deviations of the mean? 


3. Adjust the applet to have a mean of 0 and a standard deviation of 1. Then click 
“UPDATE.” The resulting density curve describes the standard Normal distribution. 
What percent of the area under this Normal density curve lies within 1, 2, and 3 standard 
deviations of the mean? 


4. Summarize by completing this sentence: “For any Normal distribution, the area under the 
Normal curve within 1, 2, and 3 standard deviations of the mean is about Yes %, 
and__—%.” 


When you hear the phrase “standard Normal distribution,” think standardized scores (z- 
scores), which have a mean of 0 and a standard deviation of 1. 


Although there are many Normal distributions, they all have properties in common. In 
particular, all Normal distributions obey the 68—95—99.7 rule. 


DEFINITION The 68-95-99.7 rule 


68% of 
values 


95% of 
values 


99.7% of 
values 


Standard deviations 


In a Normal distribution with mean u and standard deviation o: 

e Approximately 68% of the observations fall within o of the mean wy. 

e Approximately 95% of the observations fall within 20 of the mean wu. 

e Approximately 99.7% of the observations fall within 30 of the mean wv. 


This result is known as the 68-—95-99.7 rule. 


Some people refer to the 68—95-99.7 rule as the empirical rule (empirical means “learned from 
experience or by observation”). 


By remembering these three numbers, you can quickly estimate proportions or percents of 
observations (areas) using Normal distributions and recognize when an observation is unusual. 

Earlier, we asked how unusual it would be for a Gary seventh-grader to get an ITBS score 
less than 3.74. Figure 2.13 gives the answer in graphical form. By the 68—95—99.7 rule, about 
95% of these students have ITBS vocabulary scores between 3.74 and 9.94, which means that 
about 5% of the students have scores less than 3.74 or greater than 9.94. Due to the symmetry 
of the Normal distribution, about 5%/2 = 2.5% 0/2 = 2.5% oF students have scores less than 
3.74. So it is quite unusual for a Gary, Indiana, seventh-grader to get an ITBS vocabulary score 
below 3.74. 


| About 95% of scores 
are within 20 of jp. 


About 2.5% of scores 
are less than 3.74. 


2.19 3.74 529 684 839 994 11.49 
ITBS vocabulary score 
FIGURE 2.13 Using the 68—95—99.7 rule to estimate the percent of Gary, Indiana, seventh- 
graders with ITBS vocabulary scores less than 3.74. 


How well does the 68—95-—99.7 rule describe the distribution of ITBS vocabulary scores for 
Gary, Indiana, seventh-graders? Well, 900 of the 947 scores are between 3.74 and 9.94. That’s 
95.04%, which is very accurate indeed. Of the remaining 47 scores, 20 are below 3.74 and 27 
are above 9.94. The number of values in each tail is not quite equal, as it would be in an exactly 
Normal distribution. Normal distributions often describe real data better in the center of the 
distribution than in the extreme high and low tails. As famous statistician George Box once 
noted, “All models are wrong, but some are useful!” 


EXAMPLE | Stop the car! (») 
Using the 68-95-99.7 rule 


PROBLEM: Many studies on automobile safety suggest that when automobile drivers must 
make emergency stops, the stopping distances follow an approximately Normal distribution. 
Suppose that for one model of car traveling at 62 mph under typical conditions on dry 
pavement, the mean stopping distance is p=155 ft" — 155 ft with a standard deviation of 


o=3 fto = 3 ft. 


a. About what percent of cars of this model would take more than 158 feet to make an 
emergency stop? Show your method clearly. 


b. A car of this model that takes 158 feet to make an emergency stop would be at about what 
percentile of the distribution? Justify your answer. 


SOLUTION: 


a. About 68% 


About uae = 16% 


} | } } 
146 149 «1sR 155 «(158 «(161 «= “164 
Stopping distance (ft) 


About 16% of cars of this model would take more than 158 feet to make an emergency 
Stop. 


Start by sketching a Normal curve and labeling the values 1, 2, and 3 standard 
deviations from the mean. Then shade the area of interest. 


Use the 68—95-99.7 rule and the symmetry of the Normal distribution to find the 
desired area. 


6. About the 84th percentile because about 100%-16%=84%100% — 16% = 84% of cars of 
this model would stop in less than 158 feet. 


FOR PRACTICE, TRY EXERCISE 51 


ow Note that the 68—95—99.7 rule applies only to Normal distributions. Is there a 


similar rule that would apply to any distribution? Sort of. A result known as Chebyshev’s 


inequality says that in any distribution, the proportion of observations falling within k standard 
1 


l als 
deviations of the mean is at least 1-1k2._—s- &* . If k = 2, for example, Chebyshev’s inequality 


1—+ =0.75 
tells us that at least 1-122=0.75 2 of the observations in any distribution are within 


2 standard deviations of the mean. For Normal distributions, we know that this proportion is 
much higher than 0.75. In fact, it’s approximately 0.95. 


Chebyshev’s inequality is an interesting result, but it is not required for the AP® Statistics exam. 


CHECK YOUR UNDERSTANDING 


‘< 


v The distribution of heights of young women aged 18 to 24 is approximately Normal with 
mean p=64.5 inches! = 64.5 inches and standard deviation o=2.5 incheso = 2.5 inches, 


1. Sketch the Normal curve that approximates the distribution of young women’s height. 
Label the mean and the points that are 1, 2, and 3 standard deviations from the mean. 
2. About what percent of young women have heights less than 69.5 inches? Show your work. 


3. Is a young woman with a height of 62 inches unusually short? Justify your answer. 


Finding Areas in a Normal Distribution 


Let’s return to the distribution of ITBS vocabulary scores among all Gary, Indiana, seventh- 
graders. Recall that this distribution is approximately Normal with mean p=6.844 — 6.84 and 
standard deviation o6=1.55¢ = 1.55, What proportion of these seventh-graders have vocabulary 
scores that are below sixth-grade level? Figure 2.14 shows the Normal curve with the area of 
interest shaded. We can’t use the 68—95—99.7 rule to find this area because the boundary value 
of 6 is not exactly 1, 2, or 3 standard deviations from the mean. 


| 
2.19 3.74 5.29 6 6.84 839 9.94 1149 
ITBS vocabulary score 
FIGURE 2.14 Normal curve we would use to estimate the proportion of Gary, Indiana, seventh- 
graders with ITBS vocabulary scores that are less than 6—that is, below sixth-grade level. 


As the 68—95—99.7 rule suggests, all Normal distributions are the same if we measure in 
units of size o from the mean p. Changing to these units requires us to standardize, just as we 
did in Section 2.1: 

value — mean oy 
Co 


4 =—__________ = 
z=value — meanstandard deviation=x- pio — standard deviation 


Recall that subtracting a constant and dividing by a constant don’t change the shape of a 
distribution. If the quantitative variable we standardize has an approximately Normal 
distribution, then so does the new variable z. This new distribution of standardized values can 
be modeled with a Normal curve having mean p=04 — 9 and standard deviation o=1¢ = 1. It is 
called the standard Normal distribution. 


-3 ~2 ~] 0 ] 2 3 
z-score 


DEFINITION Standard Normal distribution 


The standard Normal distribution is the Normal distribution with mean O and 
standard deviation 1. 


Because all Normal distributions are the same when we standardize, we can find areas 
under any Normal curve using the standard Normal distribution. Table A in the back of the 
book gives areas under the standard Normal curve. The table entry for each z-score is the area 
under the curve to the left of z. 


For the ITBS test score data, we want to find the area to the left of 6 under the Normal 
distribution with mean 6.84 and standard deviation 1.55. See Figure 2.15(a). We start by 
standardizing the boundary value x=62 = 6: 


z=value — meanstandard deviation=6 — 6.841.55=—0.54 
value—mean _ 6—6.84 * 


= —0.54 
standard deviation 1.55 


~ 
“~ 


Figure 2.15(b) shows the standard Normal distribution with the area to the left of z=—-0.54 
z = —0.54 shaded. Notice that the shaded areas in the two graphs are the same. 


™ ™ 7 ees 
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(a) ITBS vocabulary score (b) z-score 
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FIGURE 2.15 (a) Normal distribution estimating the proportion of Gary, Indiana, seventh-graders 
who earn ITBS vocabulary scores less than 6th-grade level. (b) The corresponding area in the 
standard Normal distribution. 


To find the area to the left of z=-0.54% = —9-54, locate —0.5 in the left-hand column of 
Table A, then locate the remaining digit 4 as .04 in the top row. The entry to the right of -0.5 
and under .04 is .2946. This is the area we seek. We estimate that about 29.46% of Gary, 
Indiana, seventh-grader scores fall below the sixth-grade level on the ITBS vocabulary test. 
Note that we have made a connection between z-scores and percentiles when the shape of a 
distribution is approximately Normal. 


z .03 04 .05 
-0.6 2643 2611 2578 
-0.5 2981 29460 2912 
-0.4 3336 .3300 3264 


It is also possible to find areas under a Normal curve using technology. 


5. Technology Corner | FINDING AREAS FROM VALUES IN A 
NORMAL DISTRIBUTION (>) 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


The normalcdf command on the TI-83/84 can be used to find areas under a Normal curve. 
The syntax is normalcdf(lower bound, upper bound, mean, standard deviation). Let’s use 
this command to calculate the proportion of ITBS vocabulary scores in Gary, Indiana, that 
are less than 6. Note that we can do the area calculation using the standard Normal 
distribution or the Normal distribution with mean 6.84 and standard deviation 1.55. 


I. Using the standard Normal distribution: What proportion of observations in a standard 
Normal distribution are less than z=—0.542 = —0.54? Recall that the standard Normal 
distribution has mean p=0H — 0 and standard deviation o=1¢ = 1, 


e Press (Distr) and choose normalcdf(. OS 2.55 or later: In the dialog box, 
enter these values: lower:—1000, upper:—0.54, p:0F ° 0. o:1¢ : 1, choose Paste, and then 


press [ENTER| 


Older OS: Complete the command normalcdf(—1000,—0.54,0,1) and press [ENTER|. 


Note: We chose -1000 as the lower bound because it’s many, many standard deviations 


NORMAL FLOAT AUTO REAL RADIAN MP 1 


normalcdf (-1000, -0.54,0,1) 
Q. 2945984889 


less than the mean. 


I. Using the unstandardized Normal distribution: What proportion of observations in a 
Normal distribution with mean p=6.844 — 6.84 and standard deviation o=1.550 = 1.55 
are less than x=62 = 6? 

e Press (Distr) and choose normalcdf(. OS 2.55 or later: In the dialog box, 
enter these values: lower:—1000, upper:6, 1:6.84/:6.84 0:1.55¢ : 1.55, choose Paste, 
and then press [ENTER|. 

Older OS: Complete the command normalcdf(—1000,6,6.84,1.55) and press [ENTER|. 


This answer differs slightly from the one we got using the standard Normal distribution 
because we rounded the standardized score to two decimal places: z=-0.542 = —0.54, 


NORMAL FLOAT AUTO REAL RADIAN MP i 


normalcdf(-1000,.6,.6.84,1. 
@. 2939314473 


The following box summarizes the process of finding areas in a Normal distribution. In 
Step 2, each method of performing calculations has some advantages, so check with your 
teacher to see which option will be used in your class. 


| HOW TO FIND AREAS IN ANY NORMAL DISTRIBUTION | 


Step 1: Draw a Normal distribution with the horizontal axis labeled and scaled using the 
mean and standard deviation, the boundary value(s) clearly identified, and the area of 
interest shaded. 


Step 2: Perform calculations—show your work! Do one of the following: 


(i) Standardize each boundary value and use Table A or technology to find the desired area 
under the standard Normal curve; or 


(ii) Use technology to find the desired area without standardizing. 


Be sure to answer the question that was asked. 


AP® EXAM TIP 


Students often do not get full credit on the AP® Statistics exam because they use 
option (ii) with “calculator-speak” to show their work on Normal calculation questions 
—for example, normalcdf(—1000,6,6.84,1.55). This is not considered clear 
communication. To get full credit, follow the two-step process above, making sure to 
carefully label each of the inputs in the calculator command if you use technology in 
Step 2: normalcdf(lower: -1000, upper: 6, mean: 6.84, SD:1.55). 


EXAMPLE | Stop the car! (>) 
Finding area to the left 


fizkes/Shutterstock.com 


PROBLEM: As noted in the preceding example, studies on automobile safety suggest that 
stopping distances follow an approximately Normal distribution. For one model of car 
traveling at 62 mph, the mean stopping distance is p=155 ft/! = 155 ft with a standard 
deviation of o=3 fto = 3 ft. Danielle is driving one of these cars at 62 mph when she spots a 
wreck 160 feet in front of her and needs to make an emergency stop. About what percent of 
cars of this model when going 62 mph would be able to make an emergency stop in less than 
160 feet? Is Danielle likely to stop safely? 


SOLUTION: 


Stopping distance (ft) 


1. Draw anormal distribution. Be sure to: 
e Scale the horizontal axis. 


e Label the horizontal axis with the variable name, including units of 
measurement. 


e Clearly identify the boundary value(s). 


e Shade the area of interest. 


, — 180-155 _ | 7 
1. Z=— 160 — 1553 = 1.67" 3 — 


Using Table A: Area for PENA E TIONG (OL 0) 5 os 


Using technology: normalcdf(lower: -1000, upper: 1.67, mean: 0, DE 1) = 0.9525 
ii. normalcdf(lower: —1000, upper: 160, mean: 155, SD: 3) = OG a 


About 95% of cars of this model would be able to make an emergency stop within 160 feet. 
So Danielle is likely to be able to stop safely. 


2. Perform calculations—show your work! 
1. Standardize the boundary value and use Table A or technology to find the 


desired probability; or 
. Use technology to find the desired area without standardizing. 


z .06 .07 08 
1.5 9406 9418 9429 
1.6 9515 Se 9535 
1.7 9608 9616 9625 


Be sure to answer the question that was asked. 
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What percent of cars of this model would be able to make an emergency stop in less than 
140 feet? The standardized score for x=1402 = 140 is 


140—15£ 
Z=140 — 1553=-5.00 3 


Table A does not go beyond z=-3.502 = —3-50 and z=3.502 = 3.50 because it is highly 
unusual for a value to be more than 3.5 standard deviations from the mean in a Normal 
distribution. For practical purposes, we can act as if there is approximately zero probability 
outside the range of Table A. So there is almost no chance that a car of this model going 62 
mph would be able to make an emergency stop within 140 feet. 


FINDING AREAS TO THE RIGHT IN A NORMAL DISTRIBUTION What proportion of 
Gary, Indiana, seventh-grade scores on the ITBS vocabulary test are at least 9? Start with a 
picture. Figure 2.16(a) on the next page shows the Normal distribution with mean p}=6.84 
t = 6.84 and standard deviation o=1.55¢ = 1.55 with the area of interest shaded. Next, 
standardize the boundary value: 

9—6.84 


———= 1.39 
1.55 


2=9 - 6.841.55=1.39" 


Figure 2.16(b) shows the standard Normal distribution with the area to the right of z=1.39 
z = 1.39 shaded. Again, notice that the shaded areas in the two graphs are the same. 
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(a) ITBS vocabulary score (b) z-score 

FIGURE 2.16 (a) Normal distribution estimating the proportion of Gary, Indiana, seventh-graders 
who earn ITBS vocabulary scores at the ninth-grade level or higher. (b) The corresponding area 
in the standard Normal distribution. 


To find the area to the right of z=1.392 = 1.39, locate 1.3 in the left-hand column of Table 
A, then locate the remaining digit 9 as .09 in the top row. The entry to the right of 1.3 and 
under .09 is .9177. However, this is the area to the left of z=1.392 = 1.39. We can use the fact 
that the total area in the standard Normal distribution is 1 to find that the area to the right of 
Z=1.392 = 1.39 is 1-0.9177=0.08231 — 9.9177 = 0.0823. We estimate that about 8.23% of 
Gary, Indiana, seventh-graders earn scores at the ninth-grade level or above on the ITBS 
vocabulary test. 


Z .07 .08 .09 


Leg .8980 .8997 .9015 
1.4 9292 .9306 .9319 


@ A common student mistake is to look up a z-score in Table A and report the entry 


corresponding to that z-score, regardless of whether the problem asks for the area to the 
left or to the right of that z-score. This mistake can usually be prevented by drawing a 
Normal distribution and shading the area of interest. Look to see if the area should be closer to 
O or closer to 1. In the preceding example, for instance, it should be obvious that 0.9177 is not 
the correct area. 


EXAMPLE |Can Spieth clear the trees? (>) 
Finding area to the right 


Christian Petersen/Getty Images 


PROBLEM: When professional golfer Jordan Spieth hits his driver, the distance the ball 
travels can be modeled by a Normal distribution with mean 304 yards and standard deviation 
8 yards. On a specific hole, Jordan would need to hit the ball at least 290 yards to have a 
clear second shot that avoids a large group of trees. What percent of Spieth’s drives travel at 
least 290 yards? Is he likely to have a clear second shot? 


SOLUTION: 


280 mn 256 304 312 320 328 
250 


Distance traveled (ya) 


1. Draw a Normal distribution. 


290-304 __y 7 


i 2 = 290 — 3048 — =i 75 8 


Using Table A: Area for Z < -1.75 ts 0.0401. Area for Z = -1.75 is 1-0.0401 = 0.9599. 
Using technology: normalcdf{(lower: -1.75, upper: 1000, mean: o, S‘D: 1) = 0.9599 
ie normalcdf{(lower: 290, upper: 1000, mean: 304, S'‘D: 8) = 0.9599 


2. Perform calculations—show your work! 


i. Standardize and use Table A or technology; or 
ll. Use technology without standardizing. 


About 96% of Jordan Spieth’s drives travel at least 290 yards. So he is likely to have a 
clear second shot. 


Be sure to answer the question that was asked. 


FOR PRACTICE, TRY EXERCISE 55 


Think About It 


WHAT PROPORTION OF JORDAN SPIETH’S DRIVES GO EXACTLY 290 
YARDS? There is no area under the Normal density curve in the preceding example 
directly above the point 290.000000000... . So the answer to our question based on the 
Normal distribution is 0. One more thing: the areas under the curve with x>290% = 290 and 
x>2902 > 290 are the same. According to the Normal model, the proportion of Spieth’s 
drives that travel at least 290 yards is the same as the proportion that travel more than 290 
yards. 


FINDING AREAS BETWEEN TWO VALUES IN A NORMAL DISTRIBUTION How do 
you find the area in a Normal distribution that is between two values? For instance, suppose we 
want to estimate the proportion of Gary, Indiana, seventh-graders with ITBS vocabulary scores 
between 6 and 9. Figure 2.17(a) shows the desired area under the Normal curve with mean 
U=6.844 = 6.84 and standard deviation o=1.55¢ = 1.55, We can use Table A or technology to 
find the desired area. 


Option (i): If we standardize each boundary value, we get 


6—6.84 9—6.84 
=-0.54, z= 


Zz = 1.39 
z=6 — 6.841.55=-0.54, z=9 -6.841.55=1.39 1.55 1.55 


Figure 2.17(b) shows the corresponding area of interest in the standard Normal distribution. 


T T 
—2 ~1 _9.54 0 1 139 2 3 
(a) ITBS vocabulary score (b) z-score 


219 3.74 529 6 684 839 9 9.94 11.49 3 


FIGURE 2.17 (a) Normal distribution approximating the proportion of seventh-graders in Gary, 
Indiana, with ITBS vocabulary scores between 6 and 9. (b) The corresponding area in the 
standard Normal distribution. 


Using Table A: The table makes this process a bit trickier because it only shows areas to the 
left of a given z-score. The visual shows one way to think about the calculation. 


z-score 1.39 z-score z-score 


Area between z=—0.54 and z=1.39=Area to the left of z=1.39—Area to the left of z=—0.54=0.917 
Area between z = —0.54 and z = 1.39 


= Area to the left of z = 1.39 — Area to the left of z = —0.54 
= 0.9177 — 0.2946 
= 0.6231 


Using technology: normalcdf(lower: —0.54, upper: 1.39, mean: 0, SD: 1) = 0.6231 


Option (ii): normalcdf(lower: 6, upper: 9, mean: 6.84, SD: 1.55) = 0.6243. 


About 62% of Gary, Indiana, seventh-graders earned grade-equivalent scores between 6 


and 9. 


EXAMPLE |Can Spieth reach the green? (>) 
Finding areas between two values 


Ben Stansall/AFP/Getty Images 


PROBLEM: When professional golfer Jordan Spieth hits his driver, the distance the ball 
travels can be modeled by a Normal distribution with mean 304 yards and standard deviation 
8 yards. On another golf hole, Spieth has the opportunity to drive the ball onto the green if he 
hits the ball between 305 and 325 yards. What percent of Spieth’s drives travel a distance 
that falls in the interval? Is he likely to get the ball on the green with his drive? 


SOLUTION: 


280 288-296 sot 312 sz0} 328 
305 Bas 
Distance traveled (ya) 


1. Draw a Normal distribution. 


_ 305-304 __ 3$25—304 = 
i. Z=305-3048=0.13 iO =) SM BUNS 8 as 


Using Table A: 0.9957-0.5517 = 0.4440 
Using technology: normalcdf{(lower:0.13, upper:2.63, mean:o, SD:1) = 0.4440 


il. normalcdf(lower:305, upper:325, mean:304, SD:8) = 0.4459 


2. Perform calculations—show your work! 


1. Standardize and use Table A or technology; or 
li. Use technology without standardizing. 


About 45% of Spieth’s drives travel between 305 and 325 yards. He has a fairly good 
chance of getting the ball on the green—assuming he hits the shot straight. 


Be sure to answer the question that was asked. 


FOR PRACTICE, TRY EXERCISE 57 


CHECK YOUR UNDERSTANDING 


‘. 


”, High levels of cholesterol in the blood increase the risk of heart disease. For 14-year-old 
boys, the distribution of blood cholesterol is approximately Normal with mean 
u=170 milligrams! = 170 milligrams of cholesterol per deciliter of blood (mg/dl) and 


standard deviation o=30 mg/dl? = 30 mg/dl 6 


1. Cholesterol levels higher than 240 mg/dl may require medical attention. What percent of 
14-year-old boys have more than 240 mg/dl of cholesterol? 


2. People with cholesterol levels between 200 and 240 mg/dl are at considerable risk for 
heart disease. What proportion of 14-year-old boys have blood cholesterol between 200 
and 240 mg/dl? 


Working Backward: Finding Values from Areas 


So far, we have focused on finding areas in Normal distributions that correspond to specific 
values. What if we want to find the value that corresponds to a particular area? For instance, 
suppose we want to estimate the 90th percentile of the distribution of ITBS vocabulary scores 
for Gary, Indiana, seventh-graders. Figure 2.18(a) shows the Normal curve with mean p=6.84 
}t = 6.84 and standard deviation o=1.55¢ = 1.55 that models this distribution. We’re looking 
for the ITBS score x that has 90% of the area to its left. Figure 2.18(b) shows the standard 
Normal distribution with the corresponding area shaded. 


Area = 0.90 


Area = 0.90 


219 374 529 684 839 ¥ 994 1149 3 2 4 0 i 


Nm 


(a) ITBS vocabulary score (b) . z-score 
FIGURE 2.18 (a) Normal distribution showing the 90th percentile of ITBS vocabulary scores for 
Gary, Indiana, seventh-graders. (b) The 90th percentile in the standard Normal distribution. 


We can use Table A or technology to find the z-score with an area of 0.90 to its left. 
Because Table A gives the area to the left of a specified z-score, all we have to do is find the 
value closest to 0.90 in the middle of the table. From the reproduced portion of Table A, you 
can see that the desired value is z=1.282 = 1.28. Then we “unstandardize” to get the 
corresponding ITBS vocabulary score x. 


Z .07 .08 .09 
1.1 .8790 .8810 .8830 
1.3 .9147 .9162 .9177 
_ @—H 
sali oO 
128 = z—6.84 
1.55 
1.28(1.55)+6.84 = x 
Z=x — 101.28=x — 6.841.551.28(1.55)+6.84=x8.824=x 8.824 = x 


So the 90th percentile of the distribution of ITBS vocabulary scores for Gary, Indiana, seventh- 
graders is 8.824. 

It is also possible to find the 90th percentile of either distribution in Figure 2.18 using 
technology. 


6. Technology Corner | FINDING VALUES FROM AREAS IN A 
NORMAL DISTRIBUTION (>) 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


The TI-83/84 invNorm command calculates the value corresponding to a given percentile in 
a Normal distribution. The syntax is invNorm(area to the left, mean, standard deviation). 
Let’s use this command to confirm the 90th percentile for the ITBS vocabulary scores in 


Gary, Indiana. Note that we can do the calculation using the standard Normal distribution or 
the Normal distribution with mean 6.84 and standard deviation 1.55. 


i 


iff, 


Using the standard Normal distribution: What is the 90th percentile of the standard 
Normal distribution? 


e Press (Distr) and choose invNorm(. 


OS 2.55 or later: In the dialog box, enter these values: area:0.90, 11:0 ° 0 o:10: 1, 
choose Paste, and then press [ENTER], 
Older OS: Complete the command invNorm(0.90,0,1) and press [ENTER], 


Note: The most recent TI-84 Plus CE OS has added an option for specifying area in the 
LEFT, CENTER, or RIGHT of the distribution. Choose LEFT in this case. 


This result matches what we got using Table A. Now “unstandardize” as shown preceding 
the Technology Corner to get x=8.824% = 8.824, 


NORMAL FLOAT AUTO REAL RADIAN MP HT 


invNorm(@.9@,0,1,LEFT) 
1.281551567 


Using the unstandardized Normal distribution: What is the 90th percentile of a Normal 
distribution with mean p = 6.84 and standard deviation o=1.550 = 1.55? 


e Press (Distr) and choose invNorm(. 


OS 2.55 or later: In the dialog box, enter these values: area:0.90, p1:6.84, 0:1.55, 


choose Paste, and then press |ENTER|. 


Older OS: Complete the command invNorm(0.90,6.84,1.55) and press [ENTER]. 


Note: The most recent TI-84 Plus CE OS has added an option for specifying area in the 
LEFT, CENTER, or RIGHT of the distribution. Choose LEFT in this case. 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


invNorm(@.9@,6.84,1.55,LE) 
8. 826404928 


The following box summarizes the process of finding a value corresponding to a given area 
in a Normal distribution. In Step 2, each method of performing calculations has some 
advantages, so check with your teacher to see which option will be used in your class. 


HOW TO FIND VALUES FROM AREAS IN ANY NORMAL DISTRIBUTION 


Step 1: Draw a Normal distribution with the horizontal axis labeled and scaled using the 
mean and standard deviation, the area of interest shaded, and unknown boundary value 
clearly marked. 

Step 2: Perform calculations—show your work! Do one of the following: 

(i) Use Table A or technology to find the value of z with the indicated area under the 
standard Normal curve, then “unstandardize” to transform back to the original distribution; 
or 

(ii) Use technology to find the desired value without standardizing. 


Be sure to answer the question that was asked. 


AP® EXAM TIP 


As noted previously, to make sure that you get full credit on the AP® Statistics 
exam, do not use “calculator-speak” alone—for example, invNorm(0.90,6.84,1.55). 
This is not considered clear communication. To get full credit, follow the two-step 
process above, making sure to carefully label each of the inputs in the calculator 
command if you use technology in Step 2: invNorm(area: 0.90, mean: 6.84, 
SD:1.55). 


EXAMPLE | How tall are 3-year-old girls? (>) 


Finding a value from an area 


aly 7 
Cultura Creative (RF)/Alamy 
Stock Photo 
PROBLEM: According to www.cdc.gov/growthcharts/, the heights of 3-year-old females 
are approximately Normally distributed with a mean of 94.5 centimeters and a standard 
deviation of 4 centimeters. Seventy-five percent of 3-year-old girls are taller than what 
height? 


SOLUTION: 


Area = 0.75 
Area =1- 0.752025 


94.5 98.5 102.5 106.5 


Heights of 3-year-old girls (cm) 


If 75% of 3-year-old girls are taller than a certain height, then 25% of 3-year-old girls 
are shorter than that height. So we just need to find the 25th percentile of this 
distribution of height. 


1. Draw a Normal distribution. 
From the 68—95—99.7 rule, we know that about 16% of the observations ina 
Normal distribution will fall more than 1 standard deviation less than the mean. 
So the 25th percentile will be located slightly to the right of 90.5, as shown. 


i. Using Table A: 0.25 area to the left—z = = 0.07 


Using technology: invNorm(area: 0.25, mean: 0, SD: 1) = -0.67 
—0.67 = = 
—0.67(4) + 94.5 =x 


2=<x 


~0.67=x-94.54-0.67(4)+94.5=x91.82=x 91.8 
ii. invNorm(area: 0.25, mean: 94.5, S‘D: 4) = 91.80 
About 75% of 3-year-old girls are taller than 91.80 centimeters. 


2. Perform calculations—show your work! 
i. Use Table A or technology to find the value of z with the indicated area 


under the standard Normal curve, then “unstandardize” to transform back to 
the original distribution; or 
. Use technology to find the desired value without standardizing. 


Be sure to answer the question that was asked. 


FOR PRACTICE, TRY EXERCISE 63 


Here’s an activity that gives you a chance to apply what you have learned so far in this 
section. 


ACTIVITY Team challenge: The vending machine problem 


Ed Honowitz/The Image Bank/Getty Images 


In this activity, you will work in a team of three or four students to resolve a real-world 
problem. 
Have you ever purchased a hot drink from a vending machine? The intended sequence of 


events goes something like this: You insert your money into the machine and select your 
preferred beverage. A cup falls out of the machine, landing upright. Liquid pours out until 
the cup is nearly full. You reach in, grab the piping-hot cup, and drink happily. 

Except sometimes, things go wrong. The machine might swipe your money. Or the cup 
might fall over. More frequently, everything goes smoothly until the liquid begins to flow. It 
might stop flowing when the cup is only half full. Or the liquid might keep coming until your 
cup overflows. Neither of these results leaves you satisfied. 

The vending machine company wants to keep its customers happy. So it has decided to 
hire you as a Statistical consultant. The company provides you with the following summary 
of important facts about the vending machine: 


e Cups will hold 8 ounces. 


e The amount of liquid dispensed varies according to a Normal distribution centered at the 
mean p/ that is set in the machine. 
e o-0.2 ounceso = 0.2 ounces, 

If a cup contains too much liquid, a customer may get burned from a spill. This could 
result in an expensive lawsuit for the company. On the other hand, customers may be 
irritated if they get a cup with too little liquid from the machine. 

Given these issues, what mean setting for the machine would you recommend? Provide 
appropriate graphical and numerical evidence to support your conclusion. Be prepared to 
defend your answer. 


CHECK YOUR UNDERSTANDING 


V High levels of cholesterol in the blood increase the risk of heart disease. For 14-year-old 
boys, the distribution of blood cholesterol is approximately Normal with mean p=170 

w= 170 milligrams of cholesterol per deciliter of blood (mg/dl) and standard deviation 
o=30 mg/dl? = 39 mg/dl What cholesterol level would place a 14-year-old boy at the 10th 
percentile of the distribution? 


Assessing Normality 


Normal distributions provide good models for some distributions of quantitative data. 
Examples include SAT and IQ test scores, the highway gas mileage of 2018 Corvette 
convertibles, weights of 9-ounce bags of potato chips, and heights of 3-year-old girls (see 


Figure 2.19). 


825 865 95 945 85 1025 1065 
Heights of 3-year-old girls (cm) 
FIGURE 2.19 The heights of 3-year-old girls are approximately Normally distributed with a mean 
of 94.5 centimeters and standard deviation of 4 centimeters. 


The distributions of other quantitative variables are skewed and therefore distinctly non- 
Normal. Examples include single-family home prices in a certain city, survival times of cancer 
patients after treatment, and number of siblings for students in a statistics class. 


While experience can suggest whether or not a Normal distribution is a reasonable model in 
a particular case, it is risky to assume that a distribution is approximately Normal without first 
analyzing the data. As in Chapter 1, we start with a graph and then add numerical summaries to 
assess the Normality of a distribution of quantitative data. 


If a graph of the data is clearly skewed, has multiple peaks, or isn’t bell-shaped, that’s 
evidence the distribution is not Normal. Here is a dotplot of the number of siblings reported by 
each student in a statistics class. This distribution is skewed to the right and therefore not 
approximately Normal. 
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Even if a graph of the data looks roughly symmetric and bell-shaped, we shouldn’t assume 
that the distribution is approximately Normal. The 68—95—99.7 rule can give additional 
evidence in favor of or against Normality. 

Figure 2.20 shows a dotplot and numerical summaries for data on calories per serving in 77 
brands of breakfast cereal-2 The graph is roughly symmetric, single-peaked, and somewhat 
bell-shaped. Let’s count the number of data values within 1, 2, and 3 standard deviations of the 
mean: 


: 


60 80 100 ~=120 140 160 
Calories 


n Mean SD Min Q, Med Q; Max 
77 106.883 19.484 50 100 110 110 160 


FIGURE 2.20 Dotplot and summary statistics for data on calories per serving in 77 different 
brands of breakfast cereal. 
Mean + 1 SD: 106.883 + 1(19.484)87.399 to 126.36763 out of 77 = 81.8%Mean + 2 SD: 106.88: 
Mean + 1 SD: 106.883 + 1(19.484) 87.399 to 126.367 63 out of 77 = 81.8% 
Mean + 2 SD: 106.883 + 2(19.484) 67.915 to 145.851 71 out of 77 = 92.2% 
Mean + 3 SD: 106.883 + 3(19.484) 48.431 to 165.335 77 out of 77 = 100.0% 


In a Normal distribution, about 68% of the values fall within 1 standard deviation of the 
mean. For the cereal data, almost 82% of the brands had between 87.399 and 126.367 calories. 
These two percentages are far apart. So this distribution of calories in breakfast cereals is not 
approximately Normal. 


EXAMPLE | Are IQ scores Normally distributed? (») 
Assessing Normality 


Monkey Business Images/Shutterstock.com 


PROBLEM: Many people believe that the distribution of IQ scores follows a Normal 
distribution. Is that really the case? To find out, researchers obtained the IQ scores of 60 
randomly selected fifth-grade students from one school. Here are the data:® 


81 82 89 90 94 96 97 100 101 101 101 
102 102 102 103 105 106 108 109 109 109 110 
110 110 112 112 113 113 114 114 114 115 116 
117 117 117 118 118 122 122 123 124 124 124 
125 126 127 127 128 130 131 133 134 134 136 
137 139 i379 142 145 


A histogram and summary statistics for the data are shown. Is this distribution of IQ scores 
of fifth-graders at this school approximately Normal? Justify your answer based on the graph 
and the 68—95-—99.7 rule. 


80 90 100 «=110 120 130~=«©140~—~=«#150 
IQ score 


n Mean SD Min Q; Med Q; Max 
60 114.983 14.801 81 104 114 125.5 145 


SOLUTION: 
The histogram looks roughly symmetric, single-peaked, and somewhat bell-shaped. The 


percents of values within ees and a standard deviations of the mean are 

Mean + 1 SD: 114.983 + 1(14.801) 100.182 to 129.78441 out of 60 = 68.3%Mean + 2 SD: 114. 
Mean + 1 SD: 114.983 + 1(14.801) 100.182 to 129.784 41 out of 60 = 68.3% 
Mean + 2 SD: 114.983 + 2(14.801) 85.381 to 144.585 57 out of 60 = 95.0% 
Mean + 3 SD: 114.983 + 3(14.801) 70.580 to 159.386 60 out of 60 = 100.0% 


These percents are very close to the 68%, 95%, and 99.7% targets for a Normal 
distribution. The graphical and numerical evidence suggests that this distribution of HOE 
scores 1s approximately Normal. 


FOR PRACTICE, TRY EXERCISE 75 


AP® EXAM TIP 


Never say that a distribution of quantitative data is Normal. Real-world data always 
show at least slight departures from a Normal distribution. The most you can Say is 
that the distribution is “approximately Normal.” 


Because the IQ data come from a random sample, we can use the sample mean IQ score to 
make an inference about the population mean IQ score of all fifth-graders at the school. As you 
will see in later chapters, the methods for inference about a population mean work best when 
the population distribution is Normal. Because the distribution of IQ scores in the sample is 
approximately Normal, it is reasonable to believe that the population distribution is 
approximately Normal. 


NORMAL PROBABILITY PLOTS A graph called a Normal probability plot (or a Normal 
quantile plot) provides a good assessment of whether or not a distribution of quantitative data is 
approximately Normal. 


DEFINITION Normal probability plot 


A Normal probability plot is a scatterplot of the ordered pair (data value, 
expected z-score) for each of the individuals in a quantitative data set. That is, the 
x-coordinate of each point is the actual data value and the y-coordinate is the 
expected z-score corresponding to the percentile of that data value in a standard 
Normal distribution. 


AP® EXAM TIP 


Normal probability plots are not included on the AP® Statistics topic outline. 


However, these graphs are very useful for assessing Normality. You may use them 
on the AP® Statistics exam if you wish—just be sure that you know what you're 
looking for. 


Some software plots the data values on the horizontal axis and the z-scores on the vertical axis, 
while other software does just the reverse. The TI-83/84 gives you both options. We prefer the 
data values on the horizontal axis, which is consistent with other types of graphs we have made. 


Technology Corner 7 at the end of this subsection shows you how to make a Normal 
probability plot. For now, let’s focus on how to interpret Normal probability plots. 

Figure 2.21 shows dotplots and Normal probability plots for each of the data sets in this 
subsection. 


e Panel (a): We confirmed earlier that the distribution of IQ scores is approximately Normal. 
Its Normal probability plot has a linear form. 


e Panel (b): The distribution of number of siblings is clearly right-skewed. Its Normal 
probability plot has a curved form. 


e Panel (c): We determined earlier that the distribution of calories in breakfast cereals is not 
approximately Normal, even though the graph looks roughly symmetric and somewhat bell- 
shaped. Its Normal probability plot has a different kind of nonlinear form. 
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FIGURE 2.21 Dotplot and Normal probability plot of (a) IQ scores for 60 randomly selected fifth- 
graders from one school, (6) Number of siblings for each student in a college statistics class, 
and (c) Calories per serving in 77 brands of breakfast cereal. The distribution of |Q scores in (a) 
is approximately Normal because the Normal probability plot has a linear form. The nonlinear 


Normal probability plots in (b) and (c) confirm that neither of these distributions is approximately 
Normal. 


HOW TO ASSESS NORMALITY WITH A NORMAL PROBABILITY PLOT 


If the points on a Normal probability plot lie close to a straight line, the data are 
approximately Normally distributed. A nonlinear form in a Normal probability plot indicates 
a non-Normal distribution. 


When examining a Normal probability plot, look for shapes that show clear departures from 
Normality. Don’t overreact to minor wiggles in the plot. We used a TI-84 to generate three 
different random samples of size 20 from a Normal distribution. The screen shots show Normal 
probability plots for each of the samples. Although none of the plots is perfectly linear, it is 
reasonable to believe that each sample came from a Normal population. 


Sample 1 Sample 2 Sample 3 


EXAMPLE | How Normal are survival times? (>) 
Interpreting Normal probability plots 


PROBLEM: Researchers recorded the survival times in days of 72 guinea pigs after they 
were injected with infectious bacteria in a medical experiment.2 A Normal probability plot of 
the data is shown. Use the graph to determine if the distribution of survival times is 
approximately Normal. 


Expected z-score 


1 
0 100 200 300 400 500 600 
Survival time (days) 


SOLUTION: 


The Normal probability plot ts clearly curved, indicating that the distribution of survival 
time for the 72 guinea pigs ts not approximately Normal. 


FOR PRACTICE, TRY EXERCISE 79 


Think About It 


HOW CAN WE DETERMINE SHAPE FROM A NORMAL PROBABILITY PLOT? 
Look at the Normal probability plot of the guinea pig survival data in the example. Imagine 
all the points falling down onto the horizontal axis. The resulting dotplot would have many 
values stacked up between 50 and 150 days, and fewer values that are further spread apart 
from 150 to about 600 days. The distribution would be skewed to the right due to the greater 
variability in the upper half of the data set. The dotplot of the data confirms our answer. 
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7. Technology Corner | MAKING NORMAL PROBABILITY PLOTS 
Ne 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 
Let’s use the TI-83/84 to make a Normal probability plot for the IQ score data (page 132). 


1. Enter the data values in list L1. 


e Press|STAT| and choose Edit.... 


e Type the values into list Lj. 


NORMAL FLOAT AUTO REAL DEGREE MP 


Plot2 Plots 

Off 

Type: ke i Jb te th a 
Data List:Li 


Data Axis: Y 


2. Set up a Normal probability plot in the statistics plots menu. 


° Press (STAT PLOT). 
e Press or |1| to go into Plot1. 


e Adjust the settings as shown. 


DRHAL FLORT AUTO REAL DEGREE MP 1 


PlotiLa 


Y="2.39398 


3. Use ZoomStat to see the finished graph. 


Interpretation: The Normal probability plot is quite linear, which confirms our earlier belief 
that the distribution of IQ scores is approximately Normal. 


Section 2.2 Summary 


e We can describe the overall pattern of a distribution by a density curve. A density curve 
always remains on or above the horizontal axis and has total area 1 underneath it. An area 
under a density curve estimates the proportion of observations that fall in an interval of 
values. 


e A density curve is an idealized description of the overall pattern of a distribution that 
smooths out the irregularities in the actual data. We write the mean of a density curve as [ 
and the standard deviation of a density curve as o to distinguish them from the mean x & 
and the standard deviation s, of the actual data. 


The mean and the median of a density curve can be located by eye. The mean p is the 
balance point of the curve. The median divides the area under the curve in half. The standard 
deviation o cannot be located by eye on most density curves. 


The mean and median are equal for symmetric density curves. The mean of a skewed density 
curve is located farther toward the long tail than the median is. 


Normal distributions are described by a special family of bell-shaped, symmetric density 
curves, called Normal curves. The mean p and standard deviation o completely specify a 
Normal distribution. The mean is the center of the curve, and a is the distance from p to the 
change-of-curvature points on either side. 


The 68—95-99.7 rule describes what percent of observations in any Normal distribution fall 
within 1, 2, and 3 standard deviations of the mean. 


All Normal distributions are the same when observations are standardized. If x follows a 
Normal distribution with mean p and standard deviation o, we can standardize using 


rp 


Z=X-ll0s 
The variable z has the standard Normal distribution with mean 0 and standard deviation 1. 
Table A in the back of the book gives percentiles for the standard Normal distribution. You 
can use Table A or technology to determine area for given values of the variable or the value 
that corresponds to a given percentile in any Normal distribution. 
To find the area in a Normal distribution corresponding to given values: 
Step 1: Draw a Normal distribution with the horizontal axis labeled and scaled using the 
mean and standard deviation, the boundary value(s) clearly identified, and the area of interest 
shaded. 
Step 2: Perform calculations—show your work! Do one of the following: 


i. Standardize each boundary value and use Table A or technology to find the desired area 
under the standard Normal curve; or 
ii. Use technology to find the desired area without standardizing. 


Be sure to answer the question that was asked. 


To find the value in a Normal distribution corresponding to a given percentile (area): 

Step 1: Draw a Normal distribution with the horizontal axis labeled and scaled using the 
mean and standard deviation, the area of interest shaded, and unknown boundary value 
clearly marked. 

Step 2: Perform calculations—show your work! Do one of the following: 


i. Use Table A or technology to find the value of z with the indicated area under the 
standard Normal curve, then “unstandardize” to transform back to the original 
distribution; or 

ii. Use technology to find the desired area without standardizing. 


Be sure to answer the question that was asked. 


e To assess Normality for a given set of quantitative data, we first observe the shape of a 
dotplot, stemplot, or histogram. Then we can check how well the data fit the 68—95—99.7 rule 
for Normal distributions. Another good method for assessing Normality is to construct a 
Normal probability plot. If the Normal probability plot has a linear form, then we can say 
that the distribution is approximately Normal. 


2.2 Technology Corner 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


5. Finding areas from values in a Normal distribution Page 121 
6. Finding values from areas in a Normal distribution Page 128 
7. Making Normal probability plots Page 136 


Section 2.2 Exercises 


41. pg 111 |») Where’s the bus? Sally takes the same bus to work every morning. The 


amount of time (in minutes) that she has to wait for the bus can be modeled by a uniform 
distribution on the interval from 0 minutes to 10 minutes. 


a. Draw a density curve to model the amount of time that Sally has to wait for the bus. Be 
sure to include scales on both axes. 


b. On about what percent of days does Sally wait between 2.5 and 5.3 minutes for the 
bus? 


c. Find the 70th percentile of Sally’s wait times. 


42. Still waiting for the server? How does your web browser get a file from the Internet? 
Your computer sends a request for the file to a web server, and the web server sends back 
a response. For one particular web server, the time (in seconds) after the start of an hour at 
which a request is received can be modeled by a uniform distribution on the interval from 
0 to 3600 seconds. 


a. Draw a density curve to model the amount of time after an hour at which a request is 
received by the web server. Be sure to include scales on both axes. 


b. About what proportion of requests are received within the first 5 minutes (300 seconds) 
after the hour? 


c. Find the interquartile range of this distribution. 


43. Quick, click! An Internet reaction time test asks subjects to click their mouse button as 
soon as a light flashes on the screen. The light is programmed to go on at a randomly 
selected time after the subject clicks “Start.” The density curve models the amount of time 
the subject has to wait for the light to flash. 


0 1 2 3 2 5 
Time (sec) until the light flashes 


a. What height must the density curve have? Justify your answer. 


b. About what percent of the time will the light flash more than 3.75 seconds after the 
subject clicks “Start”? 


c. Calculate and interpret the 38th percentile of this distribution. 


44. Class is over! Mr. Shrager does not always let his statistics class out on time. In fact, he 
seems to end class according to his own “internal clock.” The density curve models the 
distribution of the amount of time after class ends (in minutes) when Mr. Shrager 
dismisses the class. (A negative value indicates he ended class early.) 


1 0 4 


Dismissal time (minutes after class ends) 
a. What height must the density curve have? Justify your answer. 


b. About what proportion of the time does Mr. Shrager dismiss class within 1 minute of its 
scheduled end time? 


c. Calculate and interpret the 20th percentile of the distribution. 


45. pg 112 |») Mean and median The figure displays two density curves that model 


different distributions of quantitative data. Identify the location of the mean and median 
by letter for each graph. Justify your answers. 


46. Mean and median The figure displays two density curves that model different 
distributions of quantitative data. Identify the location of the mean and median by letter 
for each graph. Justify your answers. 


(a) ABC (b) AB C 


47. pg 115 |») Potato chips The weights of 9-ounce bags of a particular brand of potato 


chips can be modeled by a Normal distribution with mean p=9.12/ = 9-12 ounces and 
standard deviation o=0.05 ounces = 0.05 ounce, Sketch the Normal density curve. Label 
the mean and the points that are 1, 2, and 3 standard deviations from the mean. 


48. Batter up! In baseball, a player’s batting average is the proportion of times the player gets 
a hit out of his total number of times at bat. The distribution of batting averages in a recent 
season for Major League Baseball players with at least 100 plate appearances can be 
modeled by a Normal distribution with mean p=0.261/4 — 0.261 and standard deviation 
0=0.0340 = 0.034, Sketch the Normal density curve. Label the mean and the points that 
are 1, 2, and 3 standard deviations from the mean. 


49. Normal curve Estimate the mean and standard deviation of the Normal density curve 
below. 


= ee aa i al ik ke at T _— =z! esl aS 
3 45 6 7 8 9 10 11 122 13 144 15 16 17 


50. Normal curve Estimate the mean and standard deviation of the Normal density curve 
below. 


10 15 20 25 30 35 40 45 50 


51. pg 118 |») Potato chips Refer to Exercise 47. Use the 68-95-99.7 rule to answer the 


following questions. 


a. About what percent of bags weigh less than 9.02 ounces? Show your method clearly. 


52. 


53. 


54. 


50. 


56. 


o7. 


58. 


59. 


b. A bag that weighs 9.07 ounces is at about what percentile in this distribution? Justify 
your answer. 


Batter up! Refer to Exercise 48. Use the 68—95—99.7 rule to answer the following 
questions. 


a. About what percent of Major League Baseball players with 100 plate appearances had 
batting averages of 0.363 or higher? Show your method clearly. 


b. A player with a batting average of 0.227 is at about what percentile in this distribution? 
Justify your answer. 


pg 122 |») Potato chips Refer to Exercise 47. About what percent of 9-ounce bags of 


this brand of potato chips weigh less than the advertised 9 ounces? Is this likely to pose a 
problem for the company that produces these chips? 


Batter up! Refer to Exercise 48. A player with a batting average below 0.200 is at risk of 
sitting on the bench during important games. About what percent of players are at risk? 


pg 124 |») Watch the salt! A study investigated about 3000 meals ordered from 


Chipotle restaurants using the online site Grubhub. Researchers calculated the sodium 
content (in milligrams) for each order based on Chipotle’s published nutrition information. 
The distribution of sodium content is approximately Normal with mean 2000 mg and 
standard deviation 500 mg.2 About what percent of the meals ordered exceeded the 
recommended daily allowance of 2400 mg of sodium? 


Blood pressure According to a health information website, the distribution of adults’ 
diastolic blood pressure (in millimeters of mercury) can be modeled by a Normal 
distribution with mean 70 and standard deviation 20. A diastolic pressure above 100 for an 
adult is classified as very high blood pressure. About what proportion of adults have very 
high blood pressure according to this criterion? 


pg 126 |») Watch the salt! Refer to Exercise 55. About what percent of the meals 
ordered contained between 1200 mg and 1800 mg of sodium? 


Blood pressure Refer to Exercise 56. According to the same health information website, a 
diastolic blood pressure between 80 and 90 indicates borderline high blood pressure. 
About what percent of adults have borderline high blood pressure? 


Standard Normal areas Find the proportion of observations in a standard Normal 
distribution that satisfies each of the following statements. 


a. 
7>-1.662 > —1.66 


~1.66<z<2.85—1.66 < z < 2.85 


60. 


61. 


62. 


63. 


64. 


65. 


66. 


67. 


Standard Normal areas Find the proportion of observations in a standard Normal 
distribution that satisfies each of the following statements. 


a. 
7<—-2.462 < —2.46 


0.89<z<2.460.89 < z < 2.46 


Sudoku Mrs. Starnes enjoys doing Sudoku puzzles. The time she takes to complete an 
easy puzzle can be modeled by a Normal distribution with mean 5.3 minutes and standard 
deviation 0.9 minute. 


a. What proportion of the time does Mrs. Starnes finish an easy Sudoku puzzle in less 
than 3 minutes? 


b. How often does it take Mrs. Starnes more than 6 minutes to complete an easy puzzle? 


c. What percent of easy Sudoku puzzles take Mrs. Starnes between 6 and 8 minutes to 
complete? 


Hit an ace! Professional tennis player Novak Djokovic hits the ball extremely hard. His 
first-serve speeds can be modeled by a Normal distribution with mean 112 miles per hour 
(mph) and standard deviation 5 mph. 


a. How often does Djokovic hit his first serve faster than 120 mph? 
b. What percent of Djokovic’s first serves are slower than 100 mph? 


c. What proportion of Djokovic’s first serves have speeds between 100 and 110 mph? 
pg 129 |») Sudoku Refer to Exercise 61. Find the 20th percentile of Mrs. Starnes’s 
Sudoku times for easy puzzles. 


Hit an ace! Refer to Exercise 62. Find the 85th percentile of Djokovic’s first-serve 
speeds. 


Deciles The deciles of any distribution are the values at the 10th, 20th, ... , 90th 
percentiles. The first and last deciles are the 10th and the 90th percentiles, respectively. 
What are the first and last deciles of the standard Normal distribution? 


Outliers The percent of the observations that are classified as outliers by the 1.5xIQR 
1.5 x IQR rule is the same in any Normal distribution. What is this percent? Show your 
method clearly. 


IQ test scores Scores on the Wechsler Adult Intelligence Scale (an IQ test) for the 20- to 


34-year-old age group are approximately Normally distributed with p=1104 — 110 and 
o=250 = 25, 


a. What percent of people aged 20 to 34 have IQs between 125 and 150? 


b. MENSA is an elite organization that admits as members people who score in the top 
2% on IQ tests. What score on the Wechsler Adult Intelligence Scale would an 
individual aged 20 to 34 have to earn to qualify for MENSA membership? 


68. Post office A local post office weighs outgoing mail and finds that the weights of first- 
class letters are approximately Normally distributed with a mean of 0.69 ounce and a 
standard deviation of 0.16 ounce. 


a. Estimate the 60th percentile of first-class letter weights. 


b. First-class letters weighing more than 1 ounce require extra postage. What proportion 
of first-class letters at this post office require extra postage? 


Exercises 69 and 70 refer to the following setting. At some fast-food restaurants, customers 
who want a lid for their drinks get them from a large stack near the straws, napkins, and 
condiments. The lids are made with a small amount of flexibility so they can be stretched 
across the mouth of the cup and then snugly secured. When lids are too small or too large, 
customers can get very frustrated, especially if they end up spilling their drinks. At one 
particular restaurant, large drink cups require lids with a “diameter” of between 3.95 and 4.05 
inches. The restaurant’s lid supplier claims that the diameter of its large lids follows a Normal 
distribution with mean 3.98 inches and standard deviation 0.02 inch. Assume that the supplier’s 
claim is true. 


69. Put a lid on it! 
a. What percent of large lids are too small to fit? 
b. What percent of large lids are too big to fit? 


c. Compare your answers to parts (a) and (b). Does it make sense for the lid manufacturer 
to try to make one of these values larger than the other? Why or why not? 


70. Put a lid on it! The supplier is considering two changes to reduce to 1% the percentage of 
its large-cup lids that are too small. One strategy is to adjust the mean diameter of its lids. 
Another option is to alter the production process, thereby decreasing the standard 
deviation of the lid diameters. 


a. If the standard deviation remains at o=0.020 = 0.02 inch, at what value should the 
supplier set the mean diameter of its large-cup lids so that only 1% are too small to fit? 


b. If the mean diameter stays at p=3.984 — 3.98 inches, what value of the standard 
deviation will result in only 1% of lids that are too small to fit? 


71, 


72. 


73. 


74. 


iy 


c. Which of the two options in parts (a) and (b) do you think is preferable? Justify your 
answer. (Be sure to consider the effect of these changes on the percent of lids that are 
too large to fit.) 


Flight times An airline flies the same route at the same time each day. The flight time 
varies according to a Normal distribution with unknown mean and standard deviation. On 
15% of days, the flight takes more than an hour. On 3% of days, the flight lasts 75 minutes 
or more. Use this information to determine the mean and standard deviation of the flight 
time distribution. 


Brush your teeth The amount of time Ricardo spends brushing his teeth follows a 
Normal distribution with unknown mean and standard deviation. Ricardo spends less than 
1 minute brushing his teeth about 40% of the time. He spends more than 2 minutes 
brushing his teeth 2% of the time. Use this information to determine the mean and 
standard deviation of this distribution. 


Normal highway driving? The dotplot shows the EPA highway gas mileage estimates in 


miles per gallon (mpg) for a random sample of 21 model year 2018 midsize cars. 


Explain why this distribution of highway gas mileage is not approximately Normal. 


7 ee ee ee: ee a © 
25 30 35 40 45 50 
Highway gas mileage (mpg) 


Normal to be foreign born? The histogram displays the percent of foreign-born residents 
in each of the 50 states.42 Explain why this distribution of the percent of foreign-born 
residents in the states is not approximately Normal. 


Number of states 


0 5 10 15 20 25 30 
Percent foreign-born residents 


132 Refrigerators Consumer Reports magazine collected data on the usable 
pg. ey 


capacity (in cubic feet) of a sample of 36 side-by-side refrigerators. Here are the data: 


12.9 13.7 141 142 145 145 146 147 #151 #152 153 £153 
153 153 155 156 156 158 160 160 162 162 163 164 


165 166 166 166 168 170 170 172 #174 #174 #179 «184 


A histogram of the data and summary statistics are shown here. Is this distribution of 
refrigerator Capacities approximately Normal? Justify your answer based on the graph and 
the 68—95—99.7 rule. 


Usable capacity (ft) 


n Mean SD Min Q,; Med Q, Max 
36 15.825 1.217 12.9 15.15 159 166 18.4 


76. Big sharks Here are the lengths (in feet) of 44 great white sharks:42 


18.7 12.3 18.6 16.4 15.7 18.3 14.6 15.8 14.9 17.6 12.1 
16.4 16.7 17.8 16.2 12.6 17.8 13.8 iZ-2 1oee 14.7 12.4 
13.2 15.8 14.3 16.6 9.4 18.2 13.2 13.6 15.3 16.1 13.5 
od 16.2 22.8 16.8 13.6 13.2 15% 19.7 18.7 13.2 16.8 


A dotplot of the data and summary statistics are shown below. Is this distribution of shark 
length approximately Normal? Justify your answer based on the graph and the 68—95-99.7 


rule. 
— & 
es $3. ° a eseeoe : ses os ones J & = 
I i] 
8 10 12 14 16 18 20 22 
Length (ft) 
n Mean SD Min Q1 Med Q3 Max 


44 15.586 2.55 9.4 13.55 15.75 17.2 22.8 


77. Is Michigan Normal? We collected data on the tuition charged by colleges and 
universities in Michigan. Here are some numerical summaries for the data: 


Mean SD Min Max 
10,614 8049 1873 30,823 


Based on the relationship between the mean, standard deviation, minimum, and maximum, 
is it reasonable to believe that the distribution of Michigan tuitions is approximately 
Normal? Explain your answer. 


78. Are body weights Normal? The heights of people of the same gender and similar ages 
follow Normal distributions reasonably closely. How about body weights? The weights of 
women aged 20 to 29 have mean 141.7 pounds and median 133.2 pounds. The first and 
third quartiles are 118.3 pounds and 157.3 pounds. Is it reasonable to believe that the 
distribution of body weights for women aged 20 to 29 is approximately Normal? Explain 
your answer. 


79. pg 135 _») Runners’ heart rates The following figure is a Normal probability plot of 


the heart rates of 200 male runners after 6 minutes of exercise on a treadmill.14 Use the 
graph to determine if this distribution of heart rates is approximately Normal. 


Expected z-score 


i I 
70 80 90 100 110 120 130 140 150 
Heart rate (beats per minute) 
80. Carbon dioxide emissions The following figure is a Normal probability plot of the 
emissions of carbon dioxide (CO) per person in 48 countries./2 Use the graph to 


determine if this distribution of CO, emissions is approximately Normal. 


Expected z-score 


| 
0 2 4 6 8 10 12 14 16 18 20 
CO, emissions (metric tons per person) 


81. Normal states? The Normal probability plot displays data on the areas (in thousands of 
square miles) of each of the 50 states. Use the graph to determine if the distribution of 
land area is approximately Normal. 


Expected z-score 


T “T it 
200 400 600 


Area (thousands of mi’) 


— ae ae) 


82. Density of the earth In 1798, the English scientist Henry Cavendish measured the density 
of the earth several times by careful work with a torsion balance. The variable recorded 
was the density of the earth as a multiple of the density of water. A Normal probability 
plot of the data is shown.1® Use the graph to determine if this distribution of density 
measurement is approximately Normal. 


Expected z-score 


5.0 5.2 5.4 5.6 5.8 
Density of earth 


83. Refrigerators Refer to Exercise 75. 


a. Use your calculator to make a Normal probability plot of the data. Sketch this graph on 
your paper. 


b. What does the graph in part (a) imply about whether the distribution of refrigerator 
capacity is approximately Normal? Explain. 


84. Big sharks Refer to Exercise 76. 


a. Use your calculator to make a Normal probability plot of the data. Sketch this graph on 
your paper. 


b. What does the graph in part (a) imply about whether the distribution of shark length is 
approximately Normal? Explain. 


Multiple Choice Select the best answer for Exercises 85—90. 


85. Two measures of center are marked on the density curve shown. Which of the following is 
correct? 


a 


a. The median is at the yellow line and the mean is at the red line. 


b. The median is at the red line and the mean is at the yellow line. 
c. The mode is at the red line and the median is at the yellow line. 
d. The mode is at the yellow line and the median is at the red line. 


e. The mode is at the red line and the mean is at the yellow line. 


Exercises 86—88 refer to the following setting. The weights of laboratory cockroaches 
can be modeled with a Normal distribution having mean 80 grams and standard 
deviation 2 grams. The following figure is the Normal curve for this distribution of 
weights. 


A B Cc D E F G 


86. Point C on this Normal curve corresponds to 
a. 84 grams. 
b. 82 grams. 


c. 78 grams. 


d 


e 


. 76 grams. 


. 74 grams. 


87. About what percent of the cockroaches have weights between 76 and 84 grams? 


88. 


89. 


90. 


a. 
b. 
et 
d. 


e. 


99.7% 
95% 
68% 
47.5% 


34% 


About what proportion of the cockroaches will have weights greater than 83 grams? 


a. 
b. 
ci 
d. 


e. 


0.0228 
0.0668 
0.1587 
0.9332 
0.0772 


A different species of cockroach has weights that are approximately Normally distributed 
with a mean of 50 grams. After measuring the weights of many of these cockroaches, a lab 
assistant reports that 14% of the cockroaches weigh more than 55 grams. Based on this 
report, what is the approximate standard deviation of weight for this species of 
cockroaches? 


a. 


b 


et 
d. 


e. 


4.6 
yo) 
6.2 
14.0 


Cannot determine without more information. 


The following Normal probability plot shows the distribution of points scored for the 551 
players in a single NBA season. 
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Points 


If the distribution of points was displayed in a histogram, what would be the best 
description of the histogram’s shape? 


a. Approximately Normal 

b. Symmetric but not approximately Normal 
c. Skewed left 

d. Skewed right 


e. Cannot be determined 
Recycle and Review 


91. Making money (2.1) The parallel dotplots show the total family income of randomly 
chosen individuals from Indiana (38 individuals) and New Jersey (44 individuals). Means 
and standard deviations are given below the dotplots. 


Indiana **** 


New Jersey — i atest . ™ —s . 
0 25,000 50,000 75,000 100,000 125,000 150,000 175,000 
Total family income ($) 
tandard 
Mean s =e 
deviation 


Indiana $47,400 $29,400 
New Jersey $58,100 $41,900 


Consider individuals in each state with total family incomes of $95,000. Which individual 
has a higher income, relative to others in his or her state? Use percentiles and z-scores to 
support your answer. 


92. More money (1.3) Refer to Exercise 91. 


a. How do the ranges of the two distributions compare? Justify your answer. 


b. Explain why the standard deviation of the total family incomes in the New Jersey 
sample is so much larger than for the Indiana sample. 


MM _. FRAPPY! FREE RESPONSE AP® 
PROBLEM, YAY! 


The following problem is modeled after actual AP® Statistics exam free response 
questions. Your task is to generate a complete, concise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you use, because you will be 
scored on the correctness of your methods as well as on the accuracy and completeness of 
your results and explanations. 


The distribution of scores on a recent test closely followed a Normal distribution with a 
mean of 22 points and a standard deviation of 4 points. 


a. What proportion of the students scored at least 25 points on this test? 
b. What is the 31st percentile of the distribution of test scores? 


c. The teacher wants to transform the test scores so that they have an approximately 
Normal distribution with a mean of 80 points and a standard deviation of 10 points. 
To do this, she will use a formula in the form 


new score=atb (old score)’ 8©ore = @ + b(old score) 


Find the values of a and b that the teacher should use to transform the distribution 
of test scores. 


. Before the test, the teacher gave a review assignment for homework. The maximum 
score on the assignment was 10 points. The distribution of scores on this 
assignment had a mean of 9.2 points and a standard deviation of 2.1 points. Would 
it be appropriate to use a Normal distribution to calculate the proportion of students 
who scored below 7 points on this assignment? Explain your answer. 


After you finish, you can view two example solutions on the book’s website 
(highschool.bfwpub.com/tps6e). Determine whether you think each solution is 
“complete,” “substantial,” “developing,” or “minimal.” If the solution is not 
complete, what improvements would you suggest to the student who wrote it? 
Finally, your teacher will provide a scoring rubric. Score your response and note 
what, if anything, you would do differently to improve your own score. 


Chapter 2 Review 


Section 2.1: Describing Location in a Distribution 


In this section, you learned two different ways to describe the location of individuals in a 
distribution: percentiles and standardized scores (z-scores). Percentiles describe the location of 
an individual by measuring what percent of the observations in the distribution have a value 
less than the individual’s value. A cumulative relative frequency graph is a handy tool for 
identifying percentiles in a distribution. You can use it to estimate the percentile for a particular 
value of a variable or to estimate the value of the variable at a particular percentile. 


Standardized scores (z-scores) describe the location of an individual in a distribution by 
measuring how many standard deviations the individual is above or below the mean. To find 
the standardized score for a particular observation, transform the value by subtracting the mean 
and then dividing the difference by the standard deviation. Besides describing the location of an 
individual in a distribution, you can also use z-scores to compare observations from different 
distributions—standardizing the values puts them on a standard scale. 


You also learned to describe the effects on the shape, center, and variability of a 
distribution when transforming data from one scale to another. Adding a positive constant to 
(or subtracting it from) each value in a data set changes the measures of center and location, but 
not the shape or variability of the distribution. Multiplying or dividing each value in a data set 
by a positive constant changes the measures of center and location and measures of variability, 
but not the shape of the distribution. 


Section 2.2: Density Curves and Normal Distributions 


In this section, you learned how density curves are used to model distributions of data. An area 
under a density curve estimates the proportion of observations that fall in a specified interval of 
values. The total area under a density curve is 1, or 100%. 


The most commonly used density curve is called a Normal curve. The Normal curve is 
symmetric, single-peaked, and bell-shaped with mean p and standard deviation o. For any 
distribution of data that is approximately Normal in shape, about 68% of the observations will 
be within 1 standard deviation of the mean, about 95% of the observations will be within 2 
standard deviations of the mean, and about 99.7% of the observations will be within 3 standard 
deviations of the mean. Conveniently, this relationship is called the 68—95—99.7 rule. 


When observations do not fall exactly 1, 2, or 3 standard deviations from the mean, you 
learned how to use Table A or technology to identify the proportion of values in any specified 
interval under a Normal curve. You also learned how to use Table A or technology to 
determine the value of an individual that falls at a specified percentile in a Normal distribution. 
On the AP® Statistics exam, it is extremely important that you clearly communicate your 
methods when answering questions that involve a Normal distribution. Shading a Normal curve 


with the mean, standard deviation, and boundaries clearly identified is a great start. If you use 
technology to perform calculations, be sure to label the inputs of your calculator commands. 

Finally, you learned how to determine if a distribution of data is approximately Normal 
using graphs (dotplots, stemplots, histograms) and the 68—95-—99.7 rule. You also learned that a 
Normal probability plot is a great way to determine whether the shape of a distribution is 
approximately Normal. The more linear the Normal probability plot, the more Normal the 
distribution of the data. 


What Did You Learn? 


Related Relevant 
Example Chapter 


on Review 
Learning Target Section Page(s) Exercise(s) 
Find and interpret the percentile of an individual value rant 92 R2.1, 
in a distribution of data. R2.3(C) 
Estimate percentiles and individual values using a 24 94 Rei 
cumulative relative frequency graph. 
Find and interpret the standardized score (z-score) of 2.1 96 R2.1 


an individual value in a distribution of data. 
Describe the effect of adding, subtracting, multiplying 2.1 99, 100, R23 


by, or dividing by a constant on the shape, center, and 101 

variability of a distribution of data. 

Use a density curve to model distributions of 2.2 111 R2.4 
quantitative data. 

Identify the relative locations of the mean and median 2.2 112 R2.4 
of a distribution from a density curve. 

Use the 68—95-—99.7 rule to estimate (i) the proportion 2.2 118 R2.5 


of values in a specified interval, or (ii) the value that 
corresponds to a given percentile in a Normal 


distribution. 

Find the proportion of values in a specified interval ina 2.2 122,124, AZ 5, 
Normal distribution using Table A or technology. 126 R26, R27 
Find the value that corresponds to a given percentile in 2.2 129 AZ, 

a Normal distribution using Table A or technology. R2.6, R2.7 
Determine whether a distribution of data is 2.2 132,135 R28, R29 


approximately Normal from graphical and numerical 
evidence. 


Chapter 2 Review Exercises 


These exercises are designed to help you review the important ideas and methods of the 
chapter. 


R2.1 |») Is Paul tall? According to the National Center for Health Statistics, the distribution 


of heights for 15-year-old males has a mean of 170 centimeters (cm) and a standard 
deviation of 7.5 cm. Paul is 15 years old and 179 cm tall. 


a. Find the z-score corresponding to Paul’s height. Explain what this value means. 


b. Paul’s height puts him at the 85th percentile among 15-year-old males. Explain what 
this means to someone who knows no statistics. 


R2.2 |») Computer use Mrs. Causey asked her students how much time they had spent 


watching television during the previous week. The figure shows a cumulative relative 
frequency graph of her students’ responses. 


a. At what percentile is a student who watched TV for 7 hours last week? 


b. Estimate from the graph the interquartile range (IQR) for time spent watching TV. 


Cumulative relative frequency (%) 


Time spent watching TV (h/wk) 


R2.3 |») Aussie, Aussie, Aussie A group of Australian students were asked to estimate the 


width of their classroom in feet. Use the dotplot and summary statistics to answer the 
following questions. 
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Guess (ft) 
Variable no Mean SD Min = Q, Med Q3; Max 
Guess (ft) 66 43.70 12.50 24.00 35.50 42.00 48.00 94.00 


a. Suppose we converted each student’s guess from feet to meters (3.28 ft=1 m 
3.28 ft = 1m), How would the shape of the distribution be affected? Find the mean, 
median, standard deviation, and JQR for the transformed data. 


b. The actual width of the room was 42.6 feet. Suppose we calculated the error in each 
student’s guess as follows: guess — 42.6. Find the mean and standard deviation of the 
errors in feet. 


c. Find the percentile for the student who estimated the classroom width as 63 feet. 
R2.4 |») Density curves The following figure is a density curve that models a distribution of 


quantitative data. Trace the curve onto your paper. 


Area = 0.08 


0 10 = 13 20 


a. What percent of observations have values less than 13? Justify your answer. 

b. Mark the approximate location of the median. Explain your choice of location. 

c. Mark the approximate location of the mean. Explain your choice of location. 
R2.5 |») Low-birth-weight babies Researchers in Norway analyzed data on the birth 


weights of 400,000 newborns over a 6-year period. The distribution of birth weights is 
approximately Normal with a mean of 3668 grams and a standard deviation of 511 
grams. Babies that weigh less than 2500 grams at birth are classified as “low birth 
weight.” 


a. Fill in the blanks: About 99.7% of the babies had birth weights between and 
grams. 


b. What percent of babies will be identified as low birth weight? 
c. Find the quartiles of the birth weight distribution. 


R2.6 ») Acing the GRE The Graduate Record Examinations (GREs) are widely used to help 
predict the performance of applicants to graduate schools. The scores on the GRE 


Chemistry test are approximately Normal with mean = 694 and standard deviation = 112. 


a. Approximately what percent of test takers earn a score less than 500 or greater than 
900 on the GRE Chemistry test? 


b. Estimate the 99th percentile score on the GRE Chemistry test. 


R2.7 \») Ketchup A fast-food restaurant has just installed a new automatic ketchup dispenser 


for use in preparing its burgers. The amount of ketchup dispensed by the machine can be 
modeled by a Normal distribution with mean 1.05 ounces and standard deviation 0.08 
ounce. 


a. If the restaurant’s goal is to put between 1 and 1.2 ounces of ketchup on each burger, 
about what percent of the time will this happen? 


b. Suppose that the manager adjusts the machine’s settings so that the mean amount of 
ketchup dispensed is 1.1 ounces. How much does the machine’s standard deviation 
have to be reduced to ensure that at least 99% of the restaurant’s burgers have 
between 1 and 1.2 ounces of ketchup on them? 


R2.8 |») Where the old folks live Here are a stemplot and numerical summaries of the 


percents of residents aged 65 and older in the 50 states and the District of Columbia. Is 
this distribution of the percent of state residents who are age 65 and older approximately 
Normal? Justify your answer based on the graph and the 68—95-99.7 rule. 


11 ¢ Key: 12|2 represents 
8 a state in which 

9; 0 12.2% of residents 
10) 379 are 65 and older. 
11| 44 

12) 02333445899 

13) 02234555557788889 

14) 012334445689 

15) 49 

16; 0 

171 3 


nm Mean SD Min Q; Med Q3 Max 
51 13.255 1.668 7.7 12.4 13.5 143 17.3 


R2.9 |») Assessing Normality Catherine and Ana gave an online reflex test to 33 varsity 


athletes at their school. The following Normal probability plot displays the data on 
reaction times (in milliseconds) for these students. Is the distribution of reaction times for 
these athletes approximately Normal? Why or why not? 


31098-2 payedxy 


Reaction time (ms) 


Chapter 2 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


T2.1 Many professional schools require applicants to take a standardized test. Suppose that 
1000 students take such a test. Several weeks after the test, Pete receives his score 
report: he got a 63, which placed him at the 73rd percentile. This means that 


a. Pete’s score was below the median. 
b. 

C. 
d. 


e. 


Pete did worse than about 63% of the test takers. 
Pete did worse than about 73% of the test takers. 
Pete did better than about 63% of the test takers. 
Pete did better than about 73% of the test takers. 


T2.2 For the Normal distribution shown, the standard deviation is closest to 


e. 


oT p 


0 
1. 
2 
3 


Di 


T2.3 Rainwater was collected in water containers at 30 different sites near an industrial 
complex, and the amount of acidity (pH level) was measured. The mean and standard 
deviation of the values are 4.60 and 1.10, respectively. When the pH meter was 
recalibrated back at the laboratory, it was found to be in error. The error can be 
corrected by adding 0.1 pH unit to all of the values and then multiplying the result by 
1.2. What are the mean and standard deviation of the corrected pH measurements? 


a. 


5.64, 1.44 


b. 5.64, 1.32 


c. 5.40, 1.44 
d. 5.40, 1.32 
e. 5.64, 1.20 


T2.4 The figure shows a cumulative relative frequency graph of the number of ounces of 
alcohol consumed per week in a sample of 150 adults who report drinking alcohol 
occasionally. About what percent of these adults consume between 4 and 8 ounces per 


week? 
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Consumption (0z) 
a. 20% 
b. 40% 
c. 50% 
d. 60% 
e. 80% 


T2.5 The average yearly snowfall in Chillyville is approximately Normally distributed with 
a mean of 55 inches. If the snowfall in Chillyville exceeds 60 inches in 15% of the 
years, what is the standard deviation? 


a. 4.83 inches 
b. 5.18 inches 
c. 6.04 inches 
d. 8.93 inches 
e. The standard deviation cannot be computed from the given information. 


T2.6 The figure shown is the density curve of a distribution. Seven values are marked on the 
density curve. Which of the following statements is true? 


ry B C D EFG 


The mean of the distribution is E. 


. The area between B and F is 0.50. 


7 pf 


c. The median of the distribution is C. 
d. The 3rd quartile of the distribution is D. 
e. The area under the curve between A and G is 1. 


T2.7 If the heights of a population of men are approximately Normally distributed, and the 
middle 99.7% have heights between 5'0” and 7'0", what is the standard deviation of the 
heights in this population? 


a 
b. 3” 
c. 4” 
d. 6” 
e, 12” 


T2.8 The distribution of the time it takes for different people to solve a certain crossword 
puzzle is strongly skewed to the right with a mean of 30 minutes and a standard 
deviation of 15 minutes. The distribution of z-scores for those times is 


a. Normally distributed with mean 30 and SD 15. 
b. skewed to the right with mean 30 and SD 15. 


ie) 


Normally distributed with mean 0 and SD 1. 
d. skewed to the right with mean 0 and SD 1. 


e. skewed to the right, but the mean and standard deviation cannot be determined 
without more information. 


T2.9 The Environmental Protection Agency (EPA) requires that the exhaust of each model 
of motor vehicle be tested for the level of several pollutants. The level of oxides of 
nitrogen (NOX) in the exhaust of one light truck model was found to vary among 
individual trucks according to an approximately Normal distribution with mean p=1.45 
t= 1.45 grams per mile driven and standard deviation o=0.40c = 0.40 gram per mile. 


Which of the following best estimates the proportion of light trucks of this model with 
NOX levels greater than 2 grams per mile? 


a. 0.0228 
b. 0.0846 
c. 0.4256 
d. 0.9154 
e. 0.9772 


T2.10 Until the scale was changed in 1995, SAT scores were based on a scale set many 
years ago. For Math scores, the mean under the old scale in the early 1990s was 470 
and the standard deviation was 110. In 2016, the mean was 510 and the standard 
deviation was 103. Gina took the SAT in 1994 and scored 500. Her cousin Colleen 
took the SAT in 2016 and scored 530. Who did better on the exam, and how can you 
tell? 


a. Colleen—she scored 30 points higher than Gina. 
b. Colleen—her standardized score is higher than Gina’s. 
c. Gina—her standardized score is higher than Colleen’s. 
d. Gina—the standard deviation was larger in 2016. 


e. The two cousins did equally well—their z-scores are the same. 


Section Il: Free Response Show all your work. Indicate clearly the methods you use, 
because you will be graded on the correctness of your methods as well as on the accuracy and 
completeness of your results and explanations. 


T2.11 The dotplot gives the sale prices for 40 houses in Ames, Iowa, sold during a recent 
month. The mean sale price was $203,388 with a standard deviation of $87,609. 
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a. Find the percentile of the house indicated in red on the dotplot. 


b. Calculate and interpret the standardized score (z-score) for the house indicated by 
the red dot, which sold for $234,000. 


T2.12 A study of 12,000 able-bodied male students at the University of Illinois found that 


their times for the mile run were approximately Normally distributed with mean 7.11 


minutes and standard deviation 0.74 minute.1® 


a. About how many students ran the mile in less than 6 minutes? 
b. Approximately how long did it take the slowest 10% of students to run the mile? 


c. Suppose that these mile run times are converted from minutes to seconds. Estimate 


the percent of students who ran the mile in between 400 and 500 seconds. 


T2.13 A study recorded the amount of oil recovered from the 64 wells in an oil field, in 
thousands of barrels. Here are descriptive statistics for that set of data from statistical 


software. 

Descriptive Statistics: Oilprod 
Variable N Mean Median StDev Min Max Q1 Q3 
Oilprod 64 48.25 37.80 40.24 2.00 204.90 21.40 60.75 


Based on the summary statistics, is the distribution of amount of oil recovered from the 
wells in this field approximately Normal? Justify your answer. 


Chapter 3 Describing Relationships 
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INTRODUCTION 


Investigating relationships between variables is central to what we do in statistics. When we 
understand the relationship between two variables, we can use the value of one variable to help 
us make predictions about the other variable. In Section 1.1, we explored relationships between 
categorical variables, such as membership in an environmental club and snowmobile use for 
visitors to Yellowstone National Park. The association between these two variables suggests 
that members of environmental clubs are less likely to own or rent snowmobiles than 
nonmembers. 

In this chapter, we investigate relationships between two quantitative variables. Does 
knowing how much money a Major League Baseball team spent on its players tell us anything 
about how many wins that team will have? What can we learn about the price of a used car 
from the number of miles it has been driven? Can students with larger hands grab more candy? 
The following activity will help you explore the last question. 


ACTIVITY Candy grab 


In this activity, you will investigate if students with a larger hand span can grab more candy 
than students with a smaller hand span.4 


Josh Tabor 


1. Measure the span of your dominant hand to the nearest half-centimeter (cm). Hand span is 


the distance from the tip of the thumb to the tip of the pinkie finger on your fully 
stretched-out hand. 


. One student at a time, go to the front of the class and use your dominant hand to grab as 
many candies as possible from the container. You must grab the candies with your fingers 
pointing down (no scooping!) and hold the candies for 2 seconds before counting them. 
After counting, put the candy back into the container. 


. On the board, record your hand span and number of candies in a table with the following 
headings: 


Hand span (cm) Number of candies 


. While other students record their values on the board, copy the table onto a piece of paper 
and make a graph. Begin by constructing a set of coordinate axes. Label the horizontal 
axis “Hand span (cm)” and the vertical axis “Number of candies.” Choose an appropriate 
scale for each axis and plot each point from your class data table as accurately as you can 
on the graph. 


. What does the graph tell you about the relationship between hand span and number of 
candies? Summarize your observations in a sentence or two. 


SECTION 3.1 Scatterplots and Correlation 


LEARNING TARGETS By the end of the section, you should be able to: 
Distinguish between explanatory and response variables for quantitative data. 


Make a scatterplot to display the relationship between two quantitative variables. 


Describe the direction, form, and strength of a relationship displayed ina 


scatterplot and identify unusual features. 
Interpret the correlation. 


Understand the basic properties of correlation, including how the correlation is 
influenced by outliers. 


Distinguish correlation from causation. 


Most statistical studies examine data on more than one variable. Fortunately, analysis of 
relationships between two variables builds on the same tools we used to analyze one variable. 
The principles that guide our work also remain the same: 


e Plot the data, then look for overall patterns and departures from those patterns. 
e Add numerical summaries. 


e When there’s a regular overall pattern, use a simplified model to describe it. 


Explanatory and Response Variables 


In the “Candy grab” activity, the number of candies is the response variable. Hand span is the 
explanatory variable because we anticipate that knowing a student’s hand span will help us 
predict the number of candies that student can grab. 


DEFINITION Response variable, Explanatory variable 


A response variable measures an outcome of a study. An explanatory variable 
may help predict or explain changes in a response variable. 


You will often see explanatory variables called independent variables and response variables 


called dependent variables. Because the words independent and dependent have other meanings 
in statistics, we won’t use them here. 


It is easiest to identify explanatory and response variables when we initially specify the 
values of one variable to see how it affects another variable. For instance, to study the effect of 


alcohol on body temperature, researchers gave several different amounts of alcohol to mice. 
Then they measured the change in each mouse’s body temperature 15 minutes later. In this 
case, amount of alcohol is the explanatory variable, and change in body temperature is the 
response variable. When we don’t specify the values of either variable before collecting the 
data, there may or may not be a clear explanatory variable. 


EXAMPLE | Diamonds and the SAT (>) 
Explanatory or response? 


stori/Deposit Photos 


PROBLEM: Identify the explanatory variable and response variable for the following 
relationships, if possible. Explain your reasoning. 
a. The weight (in carats) and the price (in dollars) for a sample of diamonds. 


b. The SAT math score and the SAT evidence-based reading and writing score for a sample 
of students. 


SOLUTION: 


a. Explanatory: weight; ‘Response: price. The weight of a diamond helps explain how 
expensive ats: 

b. Either variable could be the explanatory variable because each one could be used to 
predict or explain the other. 


FOR PRACTICE, TRY EXERCISE 1 


In many studies, the goal is to show that changes in one or more explanatory variables 
actually cause changes in a response variable. However, other explanatory—response 
relationships don’t involve direct causation. In the alcohol and mice study, alcohol actually 
causes a change in body temperature. But there is no cause-and-effect relationship between 
SAT math and evidence-based reading and writing scores. 


Displaying Relationships: Scatterplots 


Although there are many ways to display the distribution of a single quantitative variable, a 
scatterplot is the best way to display the relationship between two quantitative variables. 


DEFINITION Scatterplot 


A scatterplot shows the relationship between two quantitative variables 
measured on the same individuals. The values of one variable appear on the 
horizontal axis, and the values of the other variable appear on the vertical axis. 
Each individual in the data set appears as a point in the graph. 


Figure 3.1 shows a scatterplot that displays the relationship between hand span (cm) and 
number of Starburst™ candies for the 24 students in Mr. Tyson’s class who did the “Candy 
grab” activity. As you can see, students with larger hand spans were typically able to grab more 
candies. 
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FIGURE 3.1 Scatterplot of hand span (cm) and number of Starburst candies grabbed by 24 
students. Only 23 points appear because two students had hand spans of 19 cm and grabbed 
28 Starburst candies. 


After collecting data for two quantitative variables, it is easy to make a scatterplot. 


HOW TO MAKE A SCATTERPLOT 


e Label the axes. Put the name of the explanatory variable under the horizontal axis and the 
name of the response variable to the left of the vertical axis. If there is no explanatory 
variable, either variable can go on the horizontal axis. 


e Scale the axes. Place equally spaced tick marks along each axis, beginning at a 
convenient number just below the smallest value of the variable and continuing until you 


exceed the largest value. 


e Plot individual data values. For each individual, plot a point directly above that 
individual’s value for the variable on the horizontal axis and directly to the right of that 
individual’s value for the variable on the vertical axis. 


The following example illustrates the process of constructing a scatterplot. 


EXAMPLE | Buying wins (>) 
Making a scatterplot 


Ezra Shaw/Getty Images 


PROBLEM: Do baseball teams that spend more money on players also win more games? 
The table shows the payroll (in millions of dollars) and number of wins for each of the 30 

Major League Baseball teams during the 2016 regular season.2 Make a scatterplot to show 
the relationship between payroll and wins. 


Team Payroll Wins 
Arizona Diamondbacks 103 69 
Atlanta Braves 122 68 
Baltimore Orioles 157 89 
Boston Red Sox 245 93 
Chicago Cubs 182 103 
Chicago White Sox 141 78 
Cincinnati Reds 114 68 
Cleveland Indians 114 94 
Colorado Rockies 121 15 


Detroit Tigers 206 86 


Houston Astros 118 84 


Kansas City Royals 145 81 
Los Angeles Angels 181 74 
Los Angeles Dodgers 274 91 
Miami Marlins 81 79 
Milwaukee Brewers 75 73 
Minnesota Twins 112 59 
New York Mets 150 87 
New York Yankees 22h 84 
Oakland Athletics 98 69 
Philadelphia Phillies 117 71 
Pittsburgh Pirates 106 78 
San Diego Padres dy, 68 
San Francisco Giants 181 87 
Seattle Mariners 155 86 
St. Louis Cardinals 167 86 
Tampa Bay Rays 71 68 
Texas Rangers 169 95 
Toronto Blue Jays 159 89 
Washington Nationals 163 95 
SOLUTION: 
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e Label the axes. The explanatory variable is payroll because we think it might help 
explain the number of wins. 


e Scale the axes. 
e Plot individual data values. 


FOR PRACTICE, TRY EXERCISE 3 


Describing a Scatterplot 


To describe a scatterplot, follow the basic strategy of data analysis from Chapter 1: look for 
patterns and important departures from those patterns. 


The scatterplot in Figure 3.2(a) shows a positive association between wins and payroll for 
MLB teams in 2016. That is, teams that spent more money typically won more games. Other 
scatterplots, such as the one in Figure 3.2(b), show a negative association. Teams that allow 


their opponent to score more runs typically win fewer games. 
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FIGURE 3.2 Scatterplots using data from the 30 Major League Baseball teams in 2016. (a) 
There is a positive association between payroll (in millions of dollars) and number of wins. (b) 
There is a negative association between runs allowed and number of wins. 


In some cases, there is no association between two variables. For example, the following 
scatterplot shows the relationship between height (in centimeters) and the typical amount of 
sleep on a non-school night (in hours) for a sample of students.2 Knowing the height of a 
student doesn’t help predict how much he or she likes to sleep on Saturday night! 


Amount of sleep (h) 


Height (cm) 


DEFINITION Positive association, Negative association, No 
association 


Two variables have a positive association when above-average values of one 
variable tend to accompany above-average values of the other variable and when 
below-average values also tend to occur together. 


Two variables have a negative association when above-average values of one 
variable tend to accompany below-average values of the other variable. 


There is no association between two variables if knowing the value of one 
variable does not help us predict the value of the other variable. 


Identifying the direction of an association in a scatterplot is a good start, but there are 
several other characteristics that we need to address when describing a scatterplot. 


HOW TO DESCRIBE A SCATTERPLOT 


To describe a scatterplot, make sure to address the following four characteristics in the 
context of the data: 


e Direction: A scatterplot can show a positive association, negative association, or no 
association. 


Form: A scatterplot can show a linear form or a nonlinear form. The form is linear if the 
overall pattern follows a straight line. Otherwise, the form is nonlinear. 


Strength: A scatterplot can show a weak, moderate, or strong association. An association 
is strong if the points don’t deviate much from the form identified. An association is weak 
if the points deviate quite a bit from the form identified. 


Unusual features: Look for outliers that fall outside the overall pattern and distinct 
clusters of points. 


AP® EXAM TIP 


When you are asked to describe the association shown in a scatterplot, you are 
expected to discuss the direction, form, and strength of the association, along with 
any unusual features, in the context of the problem. This means that you need to 
use both variable names in your description. 


Even though they have opposite directions, both associations in Figure 3.2 on page 156 
have a linear form. However, the association between runs allowed and wins is stronger than 
the relationship between payroll and wins because the points in Figure 3.2(b) deviate less from 
the linear pattern. Each scatterplot has one potential outlier: In Figure 3.2(a), the Los Angeles 


Dodgers spent $274 million and had “only” 91 wins. In Figure 3.2(b), the Texas Rangers gave 
up 757 runs but had 95 wins. 
o@ Even when there is a clear relationship between two variables in a scatterplot, the 


direction of the association describes only the overall trend—not an absolute relationship. 
For example, even though teams that spend more generally have more wins, there are plenty of 
exceptions. The Minnesota Twins spent more money than six other teams, but had fewer wins 
than any team in the league. 


EXAMPLE | Old Faithful and fertility (») 
Describing a scatterplot 


BigshotD3/iStock/Getty Images 
PROBLEM: Describe the relationship in each of the following contexts. 


a. The scatterplot on the left shows the relationship between the duration (in minutes) of an 
eruption and the interval of time until the next eruption (in minutes) of Old Faithful during 
a particular month. 


b. The scatterplot on the right shows the relationship between the average income (gross 
domestic product per person, in dollars) and fertility rate (number of children per woman) 


in 187 countries.4 
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SOLUTION: 


a. There isa strong, positive linear relationship between the duration of an eruption and 
the interval of time until the next eruption. There are two main clusters of ‘points: one 
cluster has durations around 2 minutes with intervals around 55 minutes, and the 
other cluster has durations around 4.5 minutes with intervals around 90 minutes. 


Even with the clusters, the overall direction is still positive. In some cases, 
however, the points in a cluster go in the opposite direction of the overall 
association. 


b. There isa moderately strong, negative nonlinear relationship between average income 
and fertility rate in these countries. There is a potential outlier with an average income 
around $30,000 and a fertility rate around Au, 


The association is called “nonlinear” because the pattern of points is clearly 


curved. 


FOR PRACTICE, TRY EXERCISE 5 


CHECK YOUR UNDERSTANDING 


i. 


,, Is there a relationship between the amount of sugar (in grams) and the number of 
calories in movie-theater candy? Here are the data from a sample of 12 types of candy.2 


Name Sugar (g) Calories 


Butterfinger Minis 45 450 


Junior Mints 107 570 
M&M’S® 62 480 
Milk Duds 44 370 
Peanut M&M’S® 79 790 
Raisinets 60 420 
Reese’s Pieces 61 580 
Skittles 87 450 
Sour Patch Kids 92 490 
Sweetarts 136 680 
Twizzlers 59 460 
Whoppers 48 350 


1. Identify the explanatory and response variables. Explain your reasoning. 


2. Make a scatterplot to display the relationship between amount of sugar and the number of 
calories in movie-theater candy. 


3. Describe the relationship shown in the scatterplot. 


8. Technology Corner | MAKING SCATTERPLOTS 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


Making scatterplots with technology is much easier than constructing them by hand. We’ || 
use the MLB data from page 155 to show how to construct a scatterplot on a TI-83/84. 
1. Enter the payroll values in L1 and the number of wins in L2. 


e Press |STAT| and choose Edit.... 


e Type the values into L1 and L2. 


L1CD=103 


2. Set up a scatterplot in the statistics plots menu. 


° Press (STAT PLOT). 


e Press or |1| to go into Plot1. 


e Adjust the settings as shown. 


NORMAL FLOAT AUTO REAL RADIAN HP fl 
PRESS (<] OR (>] TO SELECT AN OPTION 


Plot2 Plot3 


Calor BLUE K> 


3. Use ZoomStat to let the calculator choose an appropriate window. 


e Press |ZOOM| and choose ZoomStat. 


AP® EXAM TIP 


If you are asked to make a scatterplot, be sure to label and scale both axes. 
Don’t just copy an unlabeled calculator graph directly onto your paper. 


Measuring Linear Association: Correlation 


A scatterplot displays the direction, form, and strength of a relationship between two 
quantitative variables. Linear relationships are particularly important because a straight line is a 
simple pattern that is quite common. A linear relationship is considered strong if the points lie 
close to a straight line and is considered weak if the points are widely scattered about the line. 
Unfortunately, our eyes are not the most reliable tools when it comes to judging the strength of 
a linear relationship. When the association between two quantitative variables is linear, we can 
use the correlation r to help describe the strength and direction of the association. 


Some people refer to r as the “correlation coefficient.” 


DEFINITION Correlation r 


For a linear association between two quantitative variables, the correlation r 
measures the direction and strength of the association. 


Here are some important properties of the correlation r: 
e The correlation r is always a number between —1 and 1 (-1 <r< 1). 


e The correlation r indicates the direction of a linear relationship by its sign: r > O for a 
positive association and r < 0 for a negative association. 


e The extreme values r = —1 andr = 1 occur only in the case of a perfect linear relationship, 
when the points lie exactly along a straight line. 


e If the linear relationship is strong, the correlation r will be close to 1 or -1. If the linear 
relationship is weak, the correlation r will be close to 0. 
Oo It is only appropriate to use the correlation to describe strength and direction for 
a linear relationship. This is why the word linear kept appearing in the list above! 


Figure 3.3 shows six scatterplots that correspond to various values of r. To make the 
meaning of r clearer, the standard deviations of both variables in these plots are equal, and the 
horizontal and vertical scales are the same. The correlation r describes the direction and 
strength of the linear relationship in each scatterplot. 


Correlation r = —0.7 Correlation r= 0.9 Correlation r = —0.99 


FIGURE 3.3 How correlation measures the strength and direction of a linear relationship. When 


the dots are tightly packed around a line, the correlation will be close to 1 or -1. 
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ACTIVITY Guess the correlation ¢€ 


In this activity, we will have a class competition to see who can best guess the correlation. 


1. Load the Guess the Correlation applet at www.rossmanchance.com/applets. 


2. The teacher will press the “New Sample” button to see a “random” scatterplot. As a class, 
try to guess the correlation. Type the guess in the “Correlation guess” box and press 
“Check Guess” to see how the class did. Repeat several times to see more examples. For 
the competition, there will be two rounds. 


3. Starting on one side of the classroom and moving in order to the other side, the teacher 
will give each student one new sample and have him or her guess the correlation. The 
teacher will then record how far off the guess was from the true correlation. 


4. Once every student has made an attempt, the teacher will give each student a second 
sample. This time, the students will go in reverse order so that the student who went first 
in Round 1 will go last in Round 2. The student who has the closest guess in either round 
wins a prize! 


Correlation Guessing Game 


Number of points: |25 100 isin 
New Sample 
© Edit/Paste Data li ee 
60 © 
Correlation guess: fo | > = 
Check Guess 40 ¢ Ma & 
20 s 


The following example illustrates how to interpret the correlation. 


EXAMPLE | Payroll and wins (>) 


Interpreting a correlation 


jonathansloane/E+/Getty Images 


PROBLEM: Here is the scatterplot showing the relationship between payroll (in millions of 
dollars) and wins for MLB teams in 2016. For these data, r = 0.613. Interpret the value of r. 


Number of wins 


50 100 150 200 250 300 
Payroll ($ millions) 


SOLUTION: 


The correlation of = Oe 2 confirms that the linear association between payroll and 
number of wins 1s moderately strong and positive. 


Always include context by using the variable names in your answer. 


FOR PRACTICE, TRY EXERCISE 15 


CHECK YOUR UNDERSTANDING 


A 


y The scatterplot shows the 40-yard-dash times (in seconds) and long-jump distances (in 
inches) for a small class of 12 students. The correlation for these data is r = —0.838. Interpret 


this value. 


Long-jump distance (in.) 


5.0 5.5 6.0 6.5 7.0 7.5 
Dash time (sec) 


Cautions about Correlation 


While the correlation is a good way to measure the strength and direction of a linear 
relationship, it has several limitations. 

@ Correlation doesn’t imply causation. In many cases, two variables might have a 
strong correlation, but changes in one variable are very unlikely to cause changes in the other 
variable. Consider the following scatterplot showing total revenue generated by skiing facilities 
in the United States and the number of people who died by becoming tangled in their bedsheets 
in 10 recent years.© The correlation for these data is r = 0.97. Does an increase in skiing 
revenue cause more people to die by becoming tangled in their bedsheets? We doubt it! 


= 


= 


from tangling 
e & 


Number of deaths 


a 


1600 1800 2000 2200 2400 
Revenue from skiing ($ millions) 


Even though we shouldn’t automatically conclude that there is a cause-and-effect 
relationship between two variables when they have an association, in some cases there might 
actually be a cause-and-effect relationship. You will learn how to distinguish these cases in 
Chapter 4. 

The following activity helps you explore some additional limitations of the correlation. 


pePLe> 


ACTIVITY Correlation and Regression applet % 


In this activity, you will use an applet to investigate some important properties of the 
correlation. Go to the book’s website at highschool.bfwpub.com/tps6e and launch the 
Correlation and Regression applet. 


1. You are going to use the Correlation and Regression applet to make several scatterplots 
that have correlation close to 0.7 


a. Start by putting two points on the graph. What’s the value of the correlation? Why does 
this make sense? 


b. Make a lower-left to upper-right pattern of 10 points with correlation about r = 0.7. You 
can drag points up or down to adjust r after you have 10 points. 


c. Make a new scatterplot, this time with 9 points in a vertical stack at the left of the plot. 
Add 1 point far to the right and move it until the correlation is close to 0.7. 


d. Make a third scatterplot, this time with 10 points in a curved pattern that starts at the 
lower left and rises to the right. Adjust the points up or down until you have a smooth 
curve with correlation close to 0.7. 

Summarize: If you know only that the correlation between two variables is r = 0.7, what can 
you say about the form of the relationship? 


2. Click on the scatterplot to create a group of 7 points in a U shape so that there is a strong 
nonlinear association. What is the correlation? 

Summarize: If you know only that the correlation between two variables is r = 0, what can 

you say about the strength of the relationship? 


3. Click on the scatterplot to create a group of 10 points in the lower-left corner of the 
scatterplot with a strong linear pattern (correlation about 0.9). 


a. Add 1 point at the upper right that is in line with the first 10. How does the correlation 
change? 

b. Drag this last point straight down. How small can you make the correlation? Can you 
make the correlation negative? 


Summarize: What did you learn from Step 3 about the effect of an outlier on the correlation? 


The activity highlighted some important cautions about correlation. oOo Correlation does 
not measure form. Here is a scatterplot showing the speed (in miles per hour) and the distance 
(in feet) needed to come to a complete stop when a motorcycle’s brake was applied.Z The 
association is clearly curved, but the correlation is quite large: r = 0.98. In fact, the correlation 
for this nonlinear association is much greater than the correlation of r = 0.613 for the MLB 
payroll data, which had a clear linear association. 


Braking distance (ft) 


0 10 20 30 40 50 
Speed (mph) 
Oo Correlation should only be used to describe linear relationships. The association 


displayed in the following scatterplot is extremely strong, but the correlation is r = 0. This isn’t 
a contradiction because correlation doesn’t measure the strength of nonlinear relationships. 


0 1 2 3 4 5 6 7 8 


yp The correlation is not a resistant measure of strength. In the following scatterplot, 
the correlation is r = —0.13. But when the outlier is excluded, the correlation becomes r = 0.72. 


Like the mean and the standard deviation, the correlation can be greatly influenced by 


outliers. 


EXAMPLE |Nobel chocolate (>) 
Cautions about correlation 


amphotora/Getty Images 


PROBLEM: Most people love chocolate for its great taste. But does it also make you 
smarter? A scatterplot like this one recently appeared in the New England Journal of 
Medicine.® The explanatory variable is the chocolate consumption per person for a sample of 
countries. The response variable is the number of Nobel Prizes per 10 million residents of 


that country. 
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a. If people in the United States started eating more chocolate, could we expect more Nobel 
Prizes to be awarded to residents of the United States? Explain. 


b. What effect does Switzerland have on the correlation? Explain. 
SOLUTION 


a. No: even though there is a strong correlation between chocolate consumption and the 
number of Nobel laureates in a country, causation should not be inferred. It is possible 
that both of these variables are changing due to another variable, such as per capita 
income. 


Not all questions about cause and effect include the word cause. Make sure to 


read questions—and reports in the media—very carefully. 


b. When Switzerland is included with the rest of the points, it makes the association 
stronger because it doesn’t vary much from the linear pattern. This makes the 
correlation closer to 1. 


FOR PRACTICE, TRY EXERCISES 17 AND 19 


Calculating Correlation 


Now that you understand the meaning and limitations of the correlation, let’s look at how it’s 
calculated. 


HOW TO CALCULATE THE CORRELATION r 


Suppose that we have data on variables x and y for n individuals. The values for the first 
individual are x; and y,, the values for the second individual are xj and y>, and so on. The 


means and standard deviations of the two variables are x © and s, for the x-values, and y ¥ 


and sy for the y-values. The correlation r between x and y is 


r=1n- 1[ (x1-x sx)(yl-y sy)+(x2- x sx)(y2-y sy)+...+(xn-x sx)(yn-y sy) ] 


al CS) 
(=) (4) 


r= D( 


or, More compactly, 


r=1n- 1)\(xi- x sx)(yi-y sy) 


The formula for the correlation r is a bit complex. It helps us understand some properties of 
correlation, but in practice you should use your calculator or software to find r. Exercises 21 
and 22 ask you to calculate a correlation step by step from the definition to solidify its 
meaning. 

Figure 3.4 shows the relationship between the payroll (in millions of dollars) and the 
number of wins for the 30 MLB teams in 2016. The red dot on the right represents the Los 
Angeles Dodgers, whose payroll was $274 million and who won 91 games. 


Number of wins 


The Los Angeles Dodgers had a 
payroll of $274 million and won 91 
games. 


50 100 150 200 250 300 
Payroll ($ millions) 
FIGURE 3.4 Scatterplot showing the relationship between payroll (in millions of dollars) and 
number of wins for 30 Major League Baseball teams in 2016. The point representing the Los 


Angeles Dodgers is highlighted in red. 


The formula for r begins by standardizing the observations. The value 


a 


xi-x sx 82 
in the correlation formula is the standardized payroll (z-score) of the ith team. In 2016, the 
mean payroll was x = $145.033% = $145.033 million with a standard deviation of 
sx = $46.87982 = $46.879 million. For the Los Angeles Dodgers, the corresponding z-score is 

__ 274—145.033 


2: > = 2.75 
ZX=274 — 145.03346.879=2.75 46.879 


The Dodgers’ payroll is 2.75 standard deviations above the mean. 
Likewise, the value 


yi—-Y 


yi-y sy *Y 
in the correlation formula is the standardized number of wins for the ith team. In 2016, the 
mean number of wins was y_ = 80.99 = 80-9 with a standard deviation of sx = 10.669. 
8, = 10.669. For the Los Angeles Dodgers, the corresponding z-score is 


zy=91 - 80.910.669=0.95" 10.669 


ZUMA Press Inc/Alamy 


The Dodgers’ number of wins is 0.95 standard deviation above the mean. 

Multiplying the Dodgers’ two z-scores, we get a product of (2.75)(0.95) = 2.6125. The 
correlation r is an “average” of the products of the standardized scores for all the teams. Just as 
in the case of the standard deviation s,, we divide by 1 fewer than the number of individuals to 
find the average. Finishing the calculation reveals that r = 0.613 for the 30 MLB teams. 


Some people like to write the correlation formula as 


r=—— Jnr 


r=1n -1) zxzy n-1 


to emphasize the product of standardized scores in the calculation. 


To understand what correlation measures, consider the graphs in Figure 3.5 on the next 
page. At the left is a scatterplot of the MLB data with two lines added—a vertical line at the 
mean payroll and a horizontal line at the mean number of wins. Most of the points fall in the 
upper-right or lower-left “quadrants” of the graph. Teams with above-average payrolls tend to 
have above-average numbers of wins, like the Dodgers. Teams with below-average payrolls 
tend to have numbers of wins that are below average. This confirms the positive association 
between the variables. 
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FIGURE 3.5 (a) Scatterplot showing the relationship between payroll (in millions of dollars) and 
number of wins for 30 Major League Baseball teams in 2016, with lines showing the mean of 
each variable. (b) Scatterplot showing the relationship between the standardized values of 
payroll and the standardized values of number of wins for the same 30 teams. 


Below on the right is a scatterplot of the standardized scores. To get this graph, we 
transformed both the x- and the y-values by subtracting their mean and dividing by their 
standard deviation. As we saw in Chapter 2, standardizing a data set converts the mean to O and 
the standard deviation to 1. That’s why the vertical and horizontal lines in the right-hand graph 
are both at 0. 

For the points in the upper-right quadrant and the lower-left quadrant, the products of the 
standardized values will be positive. Because most of the points are in these two quadrants, the 
sum of the z-score products will also be positive, resulting in a positive correlation r. 

What if there was a negative association between two variables? Most of the points would 
be in the upper-left and lower-right quadrants and their z-score products would be negative, 
resulting in a negative correlation. 


Additional Facts about Correlation 


Now that you have seen how the correlation is calculated, here are some additional facts about 
correlation. 


.. Correlation requires that both variables be quantitative, so that it makes sense to do the 


arithmetic indicated by the formula for r. We cannot calculate a correlation between the 
incomes of a group of people and what city they live in because city is a categorical variable. 
When one or both of the variables are categorical, use the term association rather than 
correlation. 


’. Correlation makes no distinction between explanatory and response variables. When 
calculating the correlation, it makes no difference which variable you call x and which you 
call y. Can you see why from the formula? 


AE (2) (2 
r=In-1¥ (xi-x sx)(yi-y sy) ™ + Sx Sy 


3. Because r uses the standardized values of the observations, r does not change when we 
change the units of measurement of x, y, or both. The correlation between height and weight 
won’t change if we measure height in centimeters rather than inches and measure weight in 
kilograms rather than pounds. 


|. The correlation r has no unit of measurement. It is just a number. 


EXAMPLE |Long strides (>) 
More about correlation 


Monkey Business Images/Shutterstock.com 


PROBLEM: The following scatterplot shows the height (in inches) and number of steps 
needed for a random sample of 36 students to walk the length of a school hallway. The 
correlation is r = —0.632. 
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a. Explain why it isn’t correct to say that the correlation is —0.632 steps per inch. 


b. What would happen to the correlation if number of steps was used as the explanatory 
variable and height was used as the response variable? 


c. What would happen to the correlation if height was measured in centimeters instead of 
inches? Explain. 


SOLUTION: 


a. Because correlation ts calculated using standardized values, it doesn’t have units. 


Although it is unlikely that you will need to calculate the correlation by hand, 


understanding how the formula works makes it easier to answer questions like these. 


b. The correlation would be the same because correlation doesn’t make a distinction 
between explanatory and response variables. 


c. The correlation would be the same. Because r ts calculated using standardized values, 
changes of units don’t affect correlation. 


Changing from inches to centimeters won’t change the locations of the points, only 


the numbers on the horizontal scale. 


FOR PRACTICE, TRY EXERCISE 23 


Section 3.1 Summary 


e A scatterplot displays the relationship between two quantitative variables measured on the 


same individuals. Mark values of one variable on the horizontal axis (x axis) and values of 
the other variable on the vertical axis (y axis). Plot each individual’s data as a point on the 
graph. 


e If we think that a variable x may help predict, explain, or even cause changes in another 
variable y, we call x an explanatory variable and y a response variable. Always plot the 
explanatory variable on the x axis of a scatterplot. Plot the response variable on the y axis. 


e When describing a scatterplot, look for an overall pattern (direction, form, strength) and 
departures from the pattern (unusual features) and always answer in context. 


: Direction: A relationship can have a positive association (when larger values of one 
variable tend to be paired with larger values of the other variable), a negative association 
(when larger values of one variable tend to be paired with smaller values of the other 
variable), or no association (when knowing the value of one variable doesn’t help predict 
the value of the other variable). 


: Form: The form of a relationship can be linear or nonlinear (curved). 


« Strength: The strength of a relationship is determined by how close the points in the 
scatterplot lie to a simple form such as a line. 


: Unusual features: Look for outliers that fall outside the pattern and distinct clusters of 
points. 


e For linear relationships, the correlation r measures the strength and direction of the 
association between two quantitative variables x and y. 


e Correlation indicates the direction of a linear relationship by its sign: r > 0 for a positive 
association and r < 0 for a negative association. Correlation always satisfies -1 <r < 1 with 
values of r closer to 1 and —1 indicating stronger associations. Correlations of r = 1 and r = 
—1 occur only when the points on a scatterplot lie exactly on a straight line. 


e Remember these limitations of r: Correlation does not imply causation. The correlation is not 
resistant, so outliers can greatly change the value of r. The correlation should only be used to 
describe linear relationships. 


e Correlation ignores the distinction between explanatory and response variables. The value of 
r does not have units and is not affected by changes in the unit of measurement of either 
variable. 


3.1 Technology Corner 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 


8. Making scatterplots Page 159 


Section 3.1 Exercises 


1. pg 154 |») Coral reefs and cell phones Identify the explanatory variable and the 
response variable for the following relationships, if possible. Explain your reasoning. 


a. The weight gain of corals in aquariums where the water temperature is controlled at 
different levels 


b. The number of text messages sent and the number of phone calls made in a sample of 
100 students 
2. Teenagers and corn yield Identify the explanatory variable and the response variable for 
the following relationships, if possible. Explain your reasoning. 


a. The height and arm span of a sample of 50 teenagers 


b. The yield of corn in bushels per acre and the amount of rain in the growing season 
3. PQ 155 |») Heavy backpacks Ninth-grade students at the Webb Schools go on a 


backpacking trip each fall. Students are divided into hiking groups of size 8 by selecting 
names from a hat. Before leaving, students and their backpacks are weighed. The data here 
are from one hiking group. Make a scatterplot by hand that shows how backpack weight 
relates to body weight. 


Body weight (Ib) 120 187 109 103 131 #165 158 #116 
Backpack weight (Ib) 26 30 26 24 29 35 31 28 


4. Putting success How well do professional golfers putt from various distances to the hole? 
The data show various distances to the hole (in feet) and the percent of putts made at each 
distance for a sample of golfers.2 Make a scatterplot by hand that shows how the percent of 
putts made relates to the distance of the putt. 


Distance (ft) Percent made 
2 93.3 
3 83.1 
4 74.1 
5 58.9 
6 54.8 
7 5o.1 
8 46.3 
9 31.8 

10 33.5 

A. 31.6 

12 25.7 

13 24.0 

14 31.0 


15 16.8 


16 13.4 


17 15.9 
18 17.3 
19 13.6 
20 15.8 


5. PQ 158 |») Olympic athletes The scatterplot shows the relationship between height (in 


inches) and weight (in pounds) for the members of the United States 2016 Olympic Track 
and Field team.22 Describe the relationship between height and weight for these athletes. 
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6. Starbucks The scatterplot shows the relationship between the amount of fat (in grams) and 
number of calories in products sold at Starbucks./4 Describe the relationship between fat 
and calories for these products. 


Calories 


0 5 10 15 20 25 30 35 
Fat (g) 


7. More heavy backpacks Refer to your graph from Exercise 3. Describe the relationship 
between body weight and backpack weight for this group of hikers. 

8. More putting success Refer to your graph from Exercise 4. Describe the relationship 
between distance from hole and percent of putts made for the sample of professional 
golfers. 


9. Does fast driving waste fuel? How does the fuel consumption of a car change as its speed 
increases? Here are data for a British Ford Escort. Speed is measured in kilometers per 
hour, and fuel consumption is measured in liters of gasoline used per 100 kilometers 


traveled 12 
Speed (km/h) Fuel used (L/100 km) 
10 21.00 
20 13.00 
30 10.00 
AO 8.00 
50 7.00 
60 5.90 
70 6.30 
80 6.95 
90 tor 
100 8.27 
110 9.03 
120 9.87 
130 10.79 
140 L177 
150 12.83 


a. Make a scatterplot to display the relationship between speed and fuel consumption. 


b. Describe the relationship between speed and fuel consumption. 

10. Do muscles burn energy? Metabolic rate, the rate at which the body consumes energy, is 
important in studies of weight gain, dieting, and exercise. We have data on the lean body 
mass and resting metabolic rate for 12 women who are subjects in a study of dieting. Lean 
body mass, given in kilograms, is a person’s weight leaving out all fat. Metabolic rate is 
measured in calories burned per 24 hours. The researchers believe that lean body mass is an 
important influence on metabolic rate. 


Mass 36.1 546 485 420 506 420 403 33.1 424 345 51.1 41.2 
Rate 995 1425 1396 1418 1502 1256 1189 913 1124 1052 1347 1204 


a. Make a scatterplot to display the relationship between lean body mass and metabolic 
rate. 


b. Describe the relationship between lean body mass and metabolic rate. 

11. More Olympics Athletes who participate in the shot put, discus throw, and hammer throw 
tend to have different physical characteristics than other track and field athletes. The 
scatterplot shown here enhances the scatterplot from Exercise 5 by plotting these athletes 
with blue squares. How are the relationships between height and weight the same for the 
two groups of athletes? How are the relationships different? 
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12. More Starbucks How do the nutritional characteristics of food products differ from drink 


products at Starbucks? The scatterplot shown here enhances the scatterplot from Exercise 6 
by plotting the food products with blue squares. How are the relationships between fat and 
calories the same for the two types of products? How are the relationships different? 


Calories 


13. Manatees Manatees are large, gentle, slow-moving sea creatures found along the coast of 


Florida. Many manatees are injured or killed by boats. Here is a scatterplot showing the 
relationship between the number of boats registered in Florida (in thousands) and the 
number of manatees killed by boats for the years 1977 to 2015.48 Is r > 0 or r < 0? Closer 
to r= 0 orr=+1? Explain your reasoning. 


100 4 
90 + 
80 4 e Us 
70 4 . eae 
60 + > 
50 + 
40 = © 
30 : See 
20 + “"s 
J 

10 +— 

400 500 600 700 800 900 1000 1100 

Boats registered in Florida (1000s) 


Manatees killed 


14. Windy city Is it possible to use temperature to predict wind speed? Here is a scatterplot 


showing the average temperature (in degrees Fahrenheit) and average wind speed (in miles 
per hour) for 365 consecutive days at O’Hare International Airport in Chicago./4 Is r > 0 or 
r < 0? Closer to r = 0 or r = +1? Explain your reasoning. 


Average wind speed (mph) 


0 20 40 60 80 
Average temperature (°F) 
15. pg 162 |») Points and turnovers Here is a scatterplot showing the relationship between 


the number of turnovers and the number of points scored for players in a recent NBA 
season./2 The correlation for these data is r = 0.92. Interpret the correlation. 


Points 


0) 50 100 =150 200 ©6250 = 8=6300 
Turnovers 


16. Oh, that smarts! Infants who cry easily may be more easily stimulated than others. This 
may be a sign of higher IQ. Child development researchers explored the relationship 
between the crying of infants 4 to 10 days old and their IQ test scores at age 3 years. A 
snap of a rubber band on the sole of the foot caused the infants to cry. The researchers 
recorded the crying and measured its intensity by the number of peaks in the most active 20 
seconds. The correlation for these data is r = 0.45.4© Interpret the correlation. 


IQ at age 3 years 
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17. pg 165 |») More turnovers? Refer to Exercise 15. Does the fact that r = 0.92 suggest 


that an increase in turnovers will cause NBA players to score more points? Explain your 
reasoning. 


18. More crying? Refer to Exercise 16. Does the fact that r = 0.45 suggest that making an 
infant cry will increase his or her IQ later in life? Explain your reasoning. 
19. pg 165 |») Hot dogs Are hot dogs that are high in calories also high in salt? The 


following scatterplot shows the calories and salt content (measured in milligrams of 
sodium) in 17 brands of meat hot dogs.12 
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a. The correlation for these data is r = 0.87. Interpret this value. 


b. What effect does the hot dog brand with the smallest calorie content have on the 
correlation? Justify your answer. 
20. All brawn? The following scatterplot plots the average brain weight (in grams) versus 


average body weight (in kilograms) for 96 species of mammals.48 There are many small 
mammals whose points overlap at the lower left. 
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a. The correlation between body weight and brain weight is r = 0.86. Interpret this value. 


b. What effect does the human have on the correlation? Justify your answer. 

21. Dem bones Archaeopteryx is an extinct beast that had feathers like a bird but teeth and a 
long bony tail like a reptile. Only six fossil specimens are known to exist today. Because 
these specimens differ greatly in size, some scientists think they are different species rather 


than individuals from the same species. If the specimens belong to the same species and 
differ in size because some are younger than others, there should be a positive linear 
relationship between the lengths of a pair of bones from all individuals. An outlier from 
this relationship would suggest a different species. Here are data on the lengths (in 
centimeters) of the femur (a leg bone) and the humerus (a bone in the upper arm) for the 


five specimens that preserve both bones:/2 
Femur (x) 38 56 59 64 74 
Humerus (y) 41 63 70 72 84 


a. Make a scatterplot. Do you think that all five specimens come from the same species? 
Explain. 


b. Find the correlation r step by step, using the formula on page 166. Explain how your 
value for r matches your graph in part (a). 
22. Data on dating A student wonders if tall women tend to date taller men than do short 
women. She measures herself, her dormitory roommate, and the women in the adjoining 
dorm rooms. Then she measures the next man each woman dates. Here are the data (heights 


in inches): 
Women (x) 66 64 66 65 70 65 
Men (y) 72 68 70 68 71 65 


a. Make a scatterplot of these data. Describe what you see. 


b. Find the correlation r step by step, using the formula on page 166. Explain how your 
value for r matches your description in part (a). 
23. pg 169 |») More hot dogs Refer to Exercise 19. 


a. Explain why it isn’t correct to say that the correlation is 0.87 mg/cal. 


b. What would happen to the correlation if the variables were reversed on the scatterplot? 
Explain your reasoning. 


c. What would happen to the correlation if sodium was measured in grams instead of 
milligrams? Explain your reasoning. 
24. More brains Refer to Exercise 20. 


a. Explain why it isn’t correct to say that the correlation is 0.86 g/kg. 


b. What would happen to the correlation if the variables were reversed on the scatterplot? 
Explain your reasoning. 


c. What would happen to the correlation if brain weight was measured in kilograms instead 
of grams? Explain your reasoning. 
25. Rank the correlations Consider each of the following relationships: the heights of fathers 
and the heights of their adult sons, the heights of husbands and the heights of their wives, 
and the heights of women at age 4 and their heights at age 18. Rank the correlations 


26. 


ai, 


28. 


20. 


30. 


between these pairs of variables from largest to smallest. Explain your reasoning. 
Teaching and research A college newspaper interviews a psychologist about student 
ratings of the teaching of faculty members. The psychologist says, “The evidence indicates 
that the correlation between the research productivity and teaching rating of faculty 
members is close to zero.” The paper reports this as “Professor McDaniel said that good 
researchers tend to be poor teachers, and vice versa.” Explain why the paper’s report is 
wrong. Write a statement in plain language (don’t use the word correlation) to explain the 
psychologist’s meaning. 

Correlation isn’t everything Marc and Rob are both high school English teachers. 
Students think that Rob is a harder grader, so Rob and Marc decide to grade the same 10 
essays and see how their scores compare. The correlation is r = 0.98, but Rob’s scores are 
always lower than Marc’s. Draw a possible scatterplot that illustrates this situation. 
Limitations of correlation A carpenter sells handmade wooden benches at a craft fair 
every week. Over the past year, the carpenter has varied the price of the benches from $80 
to $120 and recorded the average weekly profit he made at each selling price. The prices of 
the bench and the corresponding average profits are shown in the table. 


Price $80 $90 $100 $110 $120 
Average profit $2400 $2800 $3000 $2800 $2400 


a. Make a scatterplot to show the relationship between price and profit. 


b. The correlation for these data is r = 0. Explain how this can be true even though there is 
a strong relationship between price and average profit. 


Multiple Choice Select the best answer for Exercises 29-34. 


You have data for many years on the average price of a barrel of oil and the average retail 
price of a gallon of unleaded regular gasoline. If you want to see how well the price of oil 
predicts the price of gas, then you should make a scatterplot with as the 
explanatory variable. 


a. the price of oil 

b. the price of gas 

c. the year 

d. either oil price or gas price 


e. time 
In a scatterplot of the average price of a barrel of oil and the average retail price of a gallon 
of gas, you expect to see 


a. very little association. 
b. a weak negative association. 


c. a strong negative association. 


d. a weak positive association. 


e. a strong positive association. 

31. The following graph plots the gas mileage (in miles per gallon) of various cars from the 
same model year versus the weight of these cars (in thousands of pounds). The points 
marked with red dots correspond to cars made in Japan. From this plot, we may conclude 
that 
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there is a positive association between weight and gas mileage for Japanese cars. 


oT p 


the correlation between weight and gas mileage for all the cars is close to 1. 
c. there is little difference between Japanese cars and cars made in other countries. 
d. Japanese cars tend to be lighter in weight than other cars. 


e. Japanese cars tend to get worse gas mileage than other cars. 
If women always married men who were 2 years older than themselves, what would be the 
correlation between the ages of husband and wife? 


a. 2 
b. 1 
c. 0.5 
d. 0 


e. Can’t tell without seeing the data 

The scatterplot shows reading test scores against IQ test scores for 14 fifth-grade children. 
There is one low outlier in the plot. What effect does this low outlier have on the 
correlation? 


32. 


33. 


. It makes the correlation closer to 1. 


a 
b. 


aA 9 


It makes the correlation closer to 0 but still positive. 
It makes the correlation equal to 0. 
It makes the correlation negative. 


It has no effect on the correlation. 
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34. If we leave out the low outlier, the correlation for the remaining 13 points in the preceding 
figure is closest to 


a. —0.95. 
b. -0.65. 

c. 0. 

d. 0.65. 

e. 0.95. 

Recycle and Review 


35. Big diamonds (1.2) Here are the weights (in milligrams) of 58 diamonds from a nodule 
carried up to the earth’s surface in surrounding rock. These data represent a population of 


diamonds formed in a single event deep in the earth.22 

13.8 oA 33.8 11.8 27.0 18.9 19.3 20.8 25.4 Zoid 7.8 
10.9 9.0 9.0 14.4 6.5 iso 5.6 18.5 1.1 11.2 7.0 
7.6 9.0 9.5 7.0 7.6 ue 6.5 5.4 7.2 7.8 ce) 
5.4 5.1 50 3.8 2.1 ZA 4.7 if 3.8 4.9 2.4 
1.4 Oa 4.7 1.5 2.0 0.1 0.1 1.6 a0 cm 2.6 
4.0 20 4.5 


Make a histogram to display the distribution of weight. Describe the distribution. 

36. Fruit fly thorax lengths (2.2) Fruit flies are used frequently in genetic research because of 
their quick reproductive cycle. The length of the thorax (in millimeters) for male fruit flies 
is approximately Normally distributed with a mean of 0.80 mm and a standard deviation of 


0.08 mm.2! 
a. What proportion of male fruit flies have a thorax length greater than 1 mm? 


b. What is the 30th percentile for male fruit fly thorax lengths? 


SECTION 3.2 Least-Squares Regression 


LEARNING TARGETS By the end of the section, you should be able to: 


Make predictions using regression lines, keeping in mind the dangers of 
extrapolation. 


Calculate and interpret a residual. 
Interpret the slope and y intercept of a regression line. 


Determine the equation of a least-squares regression line using technology or 
computer output. 


Construct and interpret residual plots to assess whether a regression model is 
appropriate. 


Interpret the standard deviation of the residuals and r@ and use these values to 
assess how well a least-squares regression line models the relationship 
between two variables. 


Describe how the least-squares regression line, standard deviation of the 
residuals, and r are influenced by outliers. 


Find the slope and y intercept of the least-squares regression line from the 
means and standard deviations of x and y and their correlation. 


Linear (straight-line) relationships between two quantitative variables are fairly common. In 
the preceding section, we found linear relationships in settings as varied as Major League 
Baseball, geysers, and Nobel prizes. Correlation measures the strength and direction of these 
relationships. When a scatterplot shows a linear relationship, we can summarize the overall 
pattern by drawing a line on the scatterplot. A regression line summarizes the relationship 
between two variables, but only in a specific setting: when one variable helps explain the other. 
Regression, unlike correlation, requires that we have an explanatory variable and a response 
variable. 


DEFINITION Regression line 


A regression line is a line that describes how a response variable y changes as 
an explanatory variable x changes. Regression lines are expressed in the form 


y=b0+b1x* = bo + bi Where y(pronounced “y-hat”) is the predicted value of y for 
a given value of x. 


It is common knowledge that cars and trucks lose value the more they are driven. Can we 
predict the price of a used Ford F-150 SuperCrew 4 x 4 truck if we know how many miles it 


has on the odometer? A random sample of 16 used Ford F-150 SuperCrew 4 x 4s was selected 
from among those listed for sale at autotrader.com. The number of miles driven and price (in 
dollars) were recorded for each of the trucks.?2 Here are the data: 


Tim Graham/Alamy 


Miles driven 70,583 129,484 29,932 29,953 24,495 75,678 8359 4447 


Price (S$) 21,994 9500 29,875 41,995 41,995 28,986 31,891 37,991 
Miles driven 34,077 58,023 44,447 68,474 144,162 140,776 29,397 131,385 
Price ($) 34,995 29,988 22,896 33,961 16,883 20,897 27,495 13,997 


Figure 3.6 is a scatterplot of these data. The plot shows a moderately strong, negative linear 
association between miles driven and price. There are no outliers; however, there are two 
distinct clusters of trucks: a group of 12 trucks between 0 and 80,000 miles driven and a group 
of 4 trucks between 120,000 and 160,000 miles driven. The correlation is r = —0.815. The line 
on the plot is a regression line for predicting price from miles driven. 
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FIGURE 3.6 Scatterplot showing the price and miles driven of used Ford F-150s, along with a 
regression line. 


Prediction 


We can use a regression line to predict the value of the response variable for a specific value of 
the explanatory variable. For the Ford F-150 data, the equation of the regression line is 


price’ _ 38,257-0.1629 (miles driven)Price = 38,257 — 0.1629 (miles driven) 


If a used Ford F-150 has 100,000 miles driven, substitute x = 100,000 in the equation. The 
predicted price is 


price’ = 38,257-0.1629 (100,000) = $21,967Price = 38,257 — 0.1629 (100,000) = $21,967 


a 


When we want to refer to the predicted value of a variable, we add a hat on top. Here, price‘PTice 
refers to the predicted price of a used Ford F-150. 


This prediction is illustrated in Figure 3.7. 
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FIGURE 3.7 Using the regression line to predict price for a Ford F-150 with 100,000 miles 
driven. 


Even though the value y=$2 1,967" = $21,967 is unlikely to be the actual price of a truck 
that has been driven 100,000 miles, it’s our best guess based on the linear model using x = 


miles driven. We can also think of §=$21,967" = $21,967 


trucks that have each been driven 100,000 miles. 


as the average price for a sample of 


Can we predict the price of a Ford F-150 with 300,000 miles driven? We can certainly 
substitute 300,000 into the equation of the line. The prediction is 


price\=38,257—-0.1629(300,000)=-$10,613Price = 38,257 — 0.1629(300,000) —— —$10,613 


The model predicts that we would need to pay $10,613 just to have someone take the truck off 
our hands! 


A negative price doesn’t make much sense in this context. Look again at Figure 3.7. A 
truck with 300,000 miles driven is far outside the set of x values for our data. We can’t say 
whether the relationship between miles driven and price remains linear at such extreme values. 
Predicting the price for a truck with 300,000 miles driven is an extrapolation of the 


relationship beyond what the data show. 


DEFINITION Extrapolation 


Extrapolation is the use of a regression line for prediction far outside the interval 
of x values used to obtain the line. Such predictions are often not accurate. 


Few relationships are linear for all values of the explanatory variable. @ Don’t make 


predictions using values of x that are much larger or much smaller than those that 
actually appear in your data. 


EXAMPLE | How much candy can you grab? (>) 
Prediction 


a, | 


Josh Tabor 


PROBLEM: The scatterplot below shows the hand span (in cm) and number of Starburst™ 


candies grabbed by each student when Mr. Tyson’s class did the “Candy grab” activity. The 


= —29.8 + 2.832 


regression line §=-29.8+2.83x" has been added to the scatterplot. 


Number of Starburst candies 


17 18 19 20 21 22 23 24 25 
Hand span (cm) 
a. Andres has a hand span of 22 cm. Predict the number of Starburst™ candies he can grab. 


b. Mr. Tyson’s young daughter McKayla has a hand span of 12 cm. Predict the number of 
Starburst candies she can grab. 


c. How confident are you in each of these predictions? Explain. 
SOLUTION: 


a. Y=-29.842.83(22)" 29.8 + 2.83(22) 


iJ 


§/=32.46 Starburst andj 32.46 Starburst candies 


Don't worry that the predicted number of Starburst candies isn’t an integer. Think 


of 32.46 as the average number of Starburst candies that a group of students, 
each with a hand span of 22 cm, could grab. 


b. Y=-209.8 ee 2.83(12)" — —29.8 = 2.83(12) 


—= —4.16 Starburst candies 


A 


_ A 
Y=-4.16 Starburst candies 


c. The prediction for Andres is believable because x = 22 is within the interval of x-values 
used to create the model. However, the prediction for McKayla 1s not trustworthy 


because oe = 1 is far outside of the x-values used to create the regression line. The linear 


form may not extend to hand spans this small. 


FOR PRACTICE, TRY EXERCISE 37 


Residuals 


In most cases, no line will pass exactly through all the points in a scatterplot. Because we use 
the line to predict y from x, the prediction errors we make are errors in y, the vertical direction 
in the scatterplot. 


Figure 3.8 shows a scatterplot of the Ford F-150 data with a regression line added. The 
prediction errors are marked as bold vertical segments in the graph. These vertical deviations 


represent “leftover” variation in the response variable after fitting the regression line. For that 
reason, they are called residuals. 
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FIGURE 3.8 Scatterplot of the Ford F-150 data with a regression line added. A good regression 
line should make the residuals (Shown as bold vertical segments) as small as possible. 


DEFINITION Residual 
A residual is the difference between the actual value of y and the value of y 
predicted by the regression line. That is, 

residual = actual y — predicted y 


a 
« 


residual=actual y—predicted y=y-y = se 


In Figure 3.8 above, the highlighted data point represents a Ford F-150 that had 70,583 
miles driven and a price of $21,994. The regression line predicts a price of 

price = 38,257-0.1629 (70,583) = $26,759Price = 38,257 — 0.1629 (70,583) = $26,759 
for this truck, but its actual price was $21,994. This truck’s residual is 


residual=actual y—predicted y=y—y4=21,994-26,759=-$4765 
residual = actualy — predicted y 
y-y 
= 21,994 — 26,759 = —$4765 
The actual price of this truck is $4765 less than the cost predicted by the regression line 
with x = miles driven. Why is the actual price less than predicted? There are many possible 


reasons. Perhaps the truck needs body work, has mechanical issues, or has been in an accident. 


EXAMPLE |Can you grab more than expected? (>) 
Calculating and interpreting a residual 


Josh Tabor 


PROBLEM: Here again is the scatterplot showing the hand span (in cm) and number of 
Starburst™ candies grabbed by each student in Mr. Tyson’s class. The regression line is 
yA=-29.8+2.83xY = —29.8 + 2.832, 


Number of Starburst candies 


17 18 19 20 21 22 23 24 25 
Hand span (cm) 


Find and interpret the residual for Andres, who has a hand span of 22 cm and grabbed 36 
Starburst candies. 


SOLUTION: 


YA=-29.8+2.83(22)=32.469 = —29.8 + 2.83(22) = 32.46 crarburst candies 
Residual = 36 — 32.46 = 3,54Residual = 36 — 32.46 = 3.54 Starburst candies 
Andres grabbed 3.54 more Starburst candies than the number predicted by the regression 


line with x = hand span. 


Residual = actualy — predicted y 


Residual=actual y — predicted y=y — y” = y-7y 


FOR PRACTICE, TRY EXERCISE 39 


CHECK YOUR UNDERSTANDING 


. 


y Some data were collected on the weight of a male white laboratory rat for the first 25 
weeks after its birth. A scatterplot of y = weight (in grams) and x = time since birth (in 
weeks) shows a fairly strong, positive linear relationship. The regression equation 
y\=100+40xY = 100 + 402 models the data fairly well. 


1. Predict the rat’s weight at 16 weeks old. 
2. Calculate and interpret the residual if the rat weighed 700 grams at 16 weeks old. 


3. Should you use this line to predict the rat’s weight at 2 years old? Use the equation to 
make the prediction and discuss your confidence in the result. (There are 454 grams in a 


pound.) 


Interpreting a Regression Line 


A regression line is a model for the data, much like the density curves of Chapter 2. The y 
intercept and slope of the regression line describe what this model tells us about the 
relationship between the response variable y and the explanatory variable x. 


DEFINITION $y intercept, Slope 


In the regression equation y*=b0+b1x:¥ = 50 + 6X : 


e bo is the y intercept, the predicted value of y when x = 0 


e b, is the slope, the amount by which the predicted value of y changes when x 
increases by 1 unit 


You are probably accustomed to the form y = mx + b for the equation of a line. However, 
statisticians prefer the form yA=b0+b1x¥ = 0 + 61% because it extends easily to settings where 


multiple explanatory variables are used to predict the value of a response variable. Either form 
can be used, but we will stick with y\=b0+b1x¥ = 4 + 412, In both cases, the slope is the 
coefficient of x. 


Many graphing calculators use the form yA=atbx¥ = a+ b2 to avoid using subscripts. The slope 


is always the coefficient of x, no matter which form is used. 


Let’s return to the Ford F-150 data. The equation of the regression line for these data is 
y\=38,257-0.1629xY = 38, 257 — 0.1629, | where x = miles driven and y = price. The slope 
b1= -0.1629%1= — 9.1629 tells us that the predicted price of a used Ford F-150 goes down by 
$0.1629 (16.29 cents) for each additional mile that the truck has been driven. The y intercept 
bO= 38,2570= 38,257 is the predicted price (in dollars) of a used Ford F-150 that has been 
driven 0 miles. 


AP® EXAM TIP 


When asked to interpret the slope or y intercept, it is very important to include the 
word predicted (or equivalent) in your response. Otherwise, it might appear that you 
believe the regression equation provides actual values of y. 


The slope of a regression line is an important numerical description of the relationship 
between the two variables. Although we need the value of the y intercept to draw the line, it is 
statistically meaningful only when the explanatory variable can actually take values close to 0, 
as in the Ford F-150 data. In other cases, using the regression line to make a prediction for x = 
O is an extrapolation. 


EXAMPLE |Grabbing more candy (>) 
Interpreting the slope and y intercept 


PROBLEM: The scatterplot shows the hand span (in cm) and number of Starburst™ 
candies grabbed by each student in Mr. Tyson’s class, along with the regression line 
yA=-29.8+2.83xY = —29.8 + 2.832 


a. Interpret the slope of the regression line. 


b. Does the value of the y intercept have meaning in this context? If so, interpret the y 
intercept. If not, explain why. 


Number of Starburst candies 


17 18 19 20 21 22 23 24 25 


Hand span (cm) 
SOLUTION: 


a. The predicted number of Starburst candies grabbed goes up by 2.83 for each increase of 


1 cm in hand span. 


Remember that the slope describes how the predicted value of y changes, not the 
actual value of y. 


b. The y intercept does not have meaning in this case, as it is impossible to have a hand 
span of ocm. 


Predicting the number of Starburst candies when x = 0 is an extrapolation—and 


results in an unrealistic prediction of -29.8. 


FOR PRACTICE, TRY EXERCISE 41 


For the Ford F-150 data, the slope b1= —0.162991= ~ 9.1629 is very close to 0. This does 
not mean that change in miles driven has little effect on price. The size of the slope depends on 
the units in which we measure the two variables. In this setting, the slope is the predicted 
change in price (in dollars) when the distance driven increases by 1 mile. There are 100 cents in 
a dollar. If we measured price in cents instead of dollars, the slope would be 100 times steeper, 
b1= -16.29%1= ~ 16.29. You can’t say how strong a relationship is by looking at the slope of 
the regression line. 


CHECK YOUR UNDERSTANDING 


\ 


y Some data were collected on the weight of a male white laboratory rat for the first 25 
weeks after its birth. A scatterplot of y = weight (in grams) and x = time since birth (in 
weeks) shows a fairly strong, positive linear relationship. The regression equation 

y\ = 100 + 40x¥ = 100 + 402 models the data fairly well. 


1. Interpret the slope of the regression line. 


2. Does the value of the y intercept have meaning in this context? If so, interpret the y 
intercept. If not, explain why. 


The Least-Squares Regression Line 


There are many different lines we could use to model the association in a particular scatterplot. 
A good regression line makes the residuals as small as possible. 


Price ($) 


(a) 


Duncan Selby/Alamy 


In the F-150 example, the regression line we used is y4 = 38,257 — 0.1629x 

y = 38,257 — 0.16292, How does this line make the residuals “as small as possible”? Maybe 
this line minimizes the sum of the residuals. If we add the prediction errors for all 16 trucks, the 
positive and negative residuals cancel out, as shown in Figure 3.9(a). That’s the same issue we 
faced when we tried to measure deviation around the mean in Chapter 1. We’ll solve the 
current problem in much the same way—by squaring the residuals. 
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FIGURE 3.9 Scatterplots of the Ford F-150 data with the regression line added. (a) The 
residuals will add to approximately O when using a good regression line. (b) A good regression 
line should make the sum of squared residuals as small as possible. 


A good regression line will have a sum of residuals near 0. But the regression line we prefer 
is the one that minimizes the sum of the squared residuals. That’s what the line shown in Figure 
3.9(b) does for the Ford F-150 data, which is why we call it the least-squares regression line. 
No other regression line would give a smaller sum of squared residuals. 


DEFINITION Least-squares regression line 


The least-squares regression line is the line that makes the sum of the squared 
residuals as small as possible. 


Your calculator or statistical software will give the equation of the least-squares line from 
data that you enter. Then you can concentrate on understanding and using the regression line. 


AP® EXAM TIP 


When displaying the equation of a least-squares regression line, the calculator will 
report the slope and intercept with much more precision than we need. There is no 


firm rule for how many decimal places to show for answers on the AP® Statistics 
exam. Our advice: decide how much to round based on the context of the problem 
you are working on. 


9. Technology Corner | CALCULATING LEAST-SQUARES 
REGRESSION LINES |») 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


Let’s use the Ford F-150 data to show how to find the equation of the least-squares 
regression line on the TI-83/84. Here are the data again: 


Miles driven 70,583 129,484 29,932 29,953 24495 75,678 8359 4447 


Price (S$) 21,994 9500 29,875 41,995 41,995 28,986 31,891 37,991 
Miles driven 34,077 58,023 44,447 68,474 144,162 140,776 29,397 131,385 
Price ($) 34,995 29,988 22,896 33,961 16,883 20,897 27,495 13,997 


1. Enter the miles driven data into L1 and the price data into L2. 


2. To determine the least-squares regression line, press |STAT| choose CALC and then 
LinReg(a+bx). Note: The TI-83/84 uses the form y\=a+bx¥ =4@ + 62 instead of 


y\ =b0 + blx¥ = 50 + 2, but they are equivalent. 
e OS 2.55 or later: In the dialog box, enter the following: Xlist:L1, Ylist:L2, FreqList 
(leave blank), Store RegEQ (leave blank), and choose Calculate. 
e Older OS: Finish the command to read LinReg(at+bx) L1,L2 and press. 
y=atbx 


a=38257.13507 
b=~. 1629185531 


r?=,664247901 
r=~. 8150140496 


Note: If r2 and r do not appear on the TI-83/84 screen, do this one-time series of keystrokes: 


e OS 2.55 or later: Press and set STAT DIAGNOSTICS to ON. Then redo Step 2 


to calculate the least-squares line. The r2 and r values should now appear. 


e Older OS: Press [o| (CATALOG), scroll down to DiagnosticOn, and press 


ENTER|. Press |ENTER| again to execute the command. The screen should say “Done.” 


Then redo Step 2 to calculate the least-squares line. The r2 and r values should now 
appear. 
To graph the least-squares regression line on the scatterplot: 


1. Set up a scatterplot (see Technology Corner 8 on page 159). 
2. Press and enter the equation of the least-squares regression line in Y 1. 


3. Press |ZOOM| and choose ZoomStat to see the scatterplot with the least-squares regression 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


line. 


Note: When you calculate the equation of the least-squares regression line, you can have the 


calculator store the equation to Y1. When setting up the calculation, enter Y1 for the 
StoreRegEq prompt blank (OS 2.55 or later) or use the following command (older OS): 
LinReg(a+bx) L1,L2,Y1. Y1 is found by pressing and selecting Y-VARS, then 
Function, then Y1. 


Determining if a Linear Model Is Appropriate: Residual Plots 


One of the first principles of data analysis is to look for an overall pattern and for striking 
departures from the pattern. A regression line describes the overall pattern of a linear 
relationship between an explanatory variable and a response variable. We see departures from 
this pattern by looking at a residual plot. 


DEFINITION Residual plot 


A residual plot is a scatterplot that displays the residuals on the vertical axis and 
the explanatory variable on the horizontal axis. 


Some software packages prefer to plot the residuals against the predicted values ¥ instead of 


against the values of the explanatory variable. The basic shape of the two plots is the same 
because ¥ is linearly related to x. 


Residual plots help us assess whether or not a linear model is appropriate. In Figure 3.10(a), 
the scatterplot shows the relationship between the average income (gross domestic product per 
person, in dollars) and fertility rate (number of children per woman) in 187 countries, along 
with the least-squares regression line. The residual plot in Figure 3.10(b) shows the average 
income for each country and the corresponding residual. 
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FIGURE 3.10 The (a) scatterplot and (b) residual plot for the relationship between fertility rate 
and average income for a sample of countries. 


The least-squares regression line clearly doesn’t fit this association very well! For most 
countries with average incomes under $5000, the actual fertility rates are greater than predicted, 


resulting in positive residuals. For countries with average incomes between $5000 and $60,000, 
the actual fertility rates tend to be smaller than predicted, resulting in negative residuals. 
Countries with average incomes above $60,000 all have fertility rates greater than predicted, 
again resulting in positive residuals. This U-shaped pattern in the residual plot indicates that the 
linear form of our model doesn’t match the form of the association. A curved model might be 
better in this case. 


In Figure 3.11(a), the scatterplot shows the Ford F-150 data, along with the least-squares 
regression line. The corresponding residual plot is shown in Figure 3.11(b). 
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FIGURE 3.11 The (a) scatterplot and (b) residual plot for the relationship between price and 
miles driven for Ford F-150s. 


Looking at the scatterplot, the line seems to be a good fit for this relationship. You can 
“see” that the line is appropriate by the lack of a leftover curved pattern in the residual plot. In 
fact, the residuals look randomly scattered around the residual = 0 line. 


HOW TO INTERPRET A RESIDUAL PLOT 


To determine whether the regression model is appropriate, look at the residual plot. 


e If there is no leftover curved pattern in the residual plot, the regression model is 
appropriate. 


e If there is a leftover curved pattern in the residual plot, consider using a regression model 
with a different form. 


EXAMPLE | Pricing diamonds (>) 
Interpreting a residual plot 


JG\/Getty Images 


PROBLEM: Is a linear model appropriate to describe the relationship between the weight 
(in carats) and price (in dollars) of round, clear, internally flawless diamonds with excellent 
cuts? We calculated a least-squares regression line using x = weight and y = price and made 
the corresponding residual plot shown.22 Use the residual plot to determine if the linear 
model is appropriate. 


100,000 


50,000 


Residual ($) 


—50,000 


—100,000 


Weight (carats) 


SOLUTION: 


The linear model relating price to carat weight 1s not appropriate because there is a ‘U- 


shaped pattern left over in the residual plot. 


FOR PRACTICE, TRY EXERCISE 47 


Think About It 


WHY DO WE LOOK FOR PATTERNS IN RESIDUAL PLOTS? The word residual 
comes from the Latin word residuum, meaning “left over.” When we calculate a residual, we 


are calculating what is left over after subtracting the predicted value from the actual value: 
residual = actual y — predicted yresidual = actual y — predicted y 


Likewise, when we look at the form of a residual plot, we are looking at the form that is left 
over after subtracting the form of the model from the form of the association: 

form of residual plot = form of association — form of model 

form of residual plot = form of association — form of model 
When there is a leftover form in the residual plot, the form of the association and form of the 
model are not the same. However, if the form of the association and form of the model are 
the same, the residual plot should have no form, other than random scatter. 


10. Technology Corner |MAKING RESIDUAL PLOTS (@& 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tpsé6e. 

Let’s continue the analysis of the Ford F-150 miles driven and price data from Technology 
Corner 9 (page 184). You should have already made a scatterplot, calculated the equation of 
the least-squares regression line, and graphed the line on the scatterplot. Now, we want to 
calculate residuals and make a residual plot. Fortunately, your calculator has already done 
most of the work. Each time the calculator computes a regression line, it computes the 
residuals and stores them in a list named RESID. 


1. Set up a scatterplot in the statistics plots menu. 


° Press (STAT PLOT). 
e Press to |1| go into Plott. 


e Adjust the settings as shown. The RESID list is found in the List menu by pressing 
[STAT] Note: You have to calculate the equation of the least-squares regression 
line using the calculator before making a residual plot. Otherwise, the RESID list will 
include the residuals from a different least-squares regression line. 

press (<iGR 01 To scuect an orrzoN Ll] 
PRESS (<] OR [>] TO SELECT AN OPTION 


Plot2 Plots 


Ylist:RESID 
Mark :‘fl + - 


Color: Mts > 


2. Use ZoomStat to let the calculator choose an appropriate window. 


e Press |ZOOM| and choose 9: ZoomStat. 


Note: If you want to see the values of the residuals, you can have the calculator put them in 
L3 (or any list). In the list editor, highlight the heading of L3, choose the RESID list from 


the LIST menu, and press |ENTER|. 


CHECK YOUR UNDERSTANDING 


\ 


y In Exercises 3 and 7, we asked you to make and describe a scatterplot for the hiker data 
shown in the table. 


Body weight (Ib) 120" 457 109" "103 431, 465. 158 116 
Backpack weight (Ib) 26 30 26 24 29 35 31 28 


1. Calculate the equation of the least-squares regression line. 
2. Make a residual plot for the linear model in Question 1. 


3. What does the residual plot indicate about the appropriateness of the linear model? 
Explain your answer. 


How Well the Line Fits the Data: The Role of s and /* in Regression 


We use a residual plot to determine if a least-squares regression line is an appropriate model for 
the relationship between two variables. Once we determine that a least-squares regression line 
is appropriate, it makes sense to ask a follow-up question: How well does the line work? That 
is, if we use the least-squares regression line to make predictions, how good will these 


predictions be? 


THE STANDARD DEVIATION OF THE RESIDUALS We already know that a residual 


measures how far an actual y value is from its corresponding predicted value y. Earlier in this 
section, we calculated the residual for the Ford F-150 with 70,583 miles driven and price 
$21,994. As shown in Figure 3.12, the residual was —-$4765, meaning that the actual price was 
$4765 less than we predicted. 
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FIGURE 3.12 Scatterplot of the Ford F-150 data with a regression line added. Residuals for 
each truck are shown with vertical line segments. 


To assess how well the line fits all the data, we need to consider the residuals for each of 


the trucks, not just one. Here are the residuals for all 16 trucks: 


—4765 —7664 —3506 8617 1728 3057 —5004 458 
2289 1183 —8121 6858 2110 nof2 —5973 —2857 


Using these residuals, we can estimate the “typical” prediction error when using the least- 
squares regression line. To do this, we calculate the standard deviation of the residuals s. 


DEFINITION Standard deviation of residuals s 


The standard deviation of the residuals s measures the size of a typical 
residual. That is, S measures the typical distance between the actual y values and 
the predicted y values. 


To calculate s, use the following formula: 
s=sum of squared residualsn—2=)' (yi-y/i)2n-2 


' PN . CES: WEG IM, seers, oe 
/sum of squared residuals 


a i 
For the Ford F-150 data, the standard deviation of the residuals is 
s=(—4765)2+(—7664)2+---+(—2857)216-2=461,264,13614=$5740 


| (4765)? +(—7664)?+---+(—2857)? [461,264,136 
= i6—2 VO 44 
= $5740 


Interpretation: The actual price of a Ford F-150 is typically about $5740 away from the price 
predicted by the least-squares regression line with x = miles driven. If we look at the residual 
plot in Figure 3.11, this seems like a reasonable value. Although some of the residuals are close 
to 0, others are close to $10,000 or —$10,000. 


Think About It 


DOES THE FORMULA FOR s LOOK SLIGHTLY FAMILIAR? It should. In Chapter 1, 
we defined the standard deviation of a set of quantitative data as 


= ae = 
Sx=)' (xi-x )2n-1 


We interpreted the resulting value as the “typical” distance of the data points from the 
mean. In the case of two-variable data, we’re interested in the typical (vertical) distance of 
the data points from the regression line. We find this value in much the same way: first add 
up the squared deviations, then average them (again, in a funny way), and take the square 
root to get back to the original units of measurement. 


THE COEFFICIENT OF DETERMINATION r? There is another numerical quantity that 
tells us how well the least-squares line predicts values of the response variable y. It is r2, the 
coefficient of determination. Some computer packages call it “R-sq.” You may have noticed 
this value in some of the output that we showed earlier. Although it’s true that r2 is equal to the 
square of the correlation r, there is much more to this story. 


DEFINITION The coefficient of determination r 


The coefficient of determination r2 measures the percent reduction in the sum 
of squared residuals when using the least-squares regression line to make 
predictions, rather than the mean value of y. In other words, r2 measures the 
percent of the variability in the response variable that is accounted for by the 
least-squares regression line. 


Holly Albrecht 


Suppose we wanted to predict the price of a particular used Ford F-150, but we didn’t know 
how many miles it had been driven. Our best guess would be the average cost of a used Ford F- 
150, y = $27,834Y = $27,834, Of course, this prediction is unlikely to be very good, as the 
prices vary quite a bit from the mean (sy=$9570°" — $9570), If we knew how many miles the 
truck had been driven, we could use the least-squares regression line to make a better 
prediction. How much better are predictions that use the least-squares regression line with x = 


miles driven, rather than predictions that use only the average price? The answer is r. 


The scatterplot in Figure 3.13(a) shows the squared residuals along with the sum of squared 
residuals (approximately 1,374,000,000) when using the average price as the predicted value. 
The scatterplot in Figure 3.13(b) shows the squared residuals along with the sum of squared 
residuals (approximately 461,300,000) when using the least-squares regression line with x = 
miles driven to predict the price. Notice that the squares in part (b) are quite a bit smaller. 
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FIGURE 3.13 (a) The sum of squared residuals is about 1,374,000,000 if we use the mean price 
as our prediction for all 16 trucks. (b) The sum of squared residuals from the least-squares 
regression line is about 461,300,000. 


To find r2, calculate the percent reduction in the sum of squared residuals: 
12=1,374,000,000-—461,300,0001,374,000,000=912,700,0001,374,000,000=0.66 
» _ 1,374,000,000—461,300,000 = 912,700,000 _ 1.66 
= 1,374,000,000 ~ 1,374,000,000 — 


The sum of squared residuals has been reduced by 66%. 


Interpretation: About 66% of the variability in the price of a Ford F-150 is accounted for by the 
least-squares regression line with x = miles driven. The remaining 34% is due to other factors, 
including age, color, and condition. 

If all the points fall directly on the least-squares line, the sum of squared residuals is 0 and 
r2 = 1. Then all the variation in y is accounted for by the linear relationship with x. In the 
worst-case scenario, the least-squares line does no better at predicting y than y=y ¥ = Y does. 
Then the two sums of squared residuals are the same and r2 = 0. 


It’s fairly remarkable that the coefficient of determination r? is actually the square of the 


correlation. This fact provides an important connection between correlation and regression. 
When you see a linear association, square the correlation to get a better feel for how well the 
least-squares line fits the data. 


Think About It 


WHAT’S THE RELATIONSHIP BETWEEN s AND 2? Both s and r? are calculated 
from the sum of squared residuals. They also both measure how well the line fits the data. 
The standard deviation of the residuals reports the size of a typical prediction error, in the 
same units as the response variable. In the truck example, s=$5740s = $5740, The value of 
r2, however, does not have units and is usually expressed as a percentage between 0% and 
100%, such as r2=66%r” = 66%. Because these values assess how well the line fits the data 
in different ways, we recommend you follow the example of most statistical software and 
report both. 


Knowing how to interpret s and r2 is much more important than knowing how to calculate 
them. Consequently, we typically let technology do the calculations. 


EXAMPLE |Grabbing candy, again (>) 
Interpreting s and r2 


Kylie McManis 


PROBLEM: The scatterplot shows the hand span (in centimeters) and number of 
Starburst™ candies grabbed by each student in Mr. Tyson’s class, along with the regression 
line yA=-29.8+2.83xY = —29.8 + 2.83”. For this model, technology gives s = 4.03 and r2 = 
0.697. 

a. Interpret the value of s. 


b. Interpret the value of r2. 


Number of Starburst candies 


17 18 19 20 21 22 23 24 25 
Hand span (cm) 


SOLUTION: 
a. The actual number of Starburst™ candies grabbed is typically about 4.03 away from 


the number predicted by the least-squares regression line with = hand span. 


b. About 69.7% of the variability in number of Starburst candies grabbed is accounted for 
by the least-squares regression line with x = hand span. 


FOR PRACTICE, TRY EXERCISE 55 


Interpreting Computer Regression Output 


Figure 3.14 displays the basic regression output for the Ford F-150 data from two statistical 
software packages: Minitab and JMP. Other software produces very similar output. Each output 
records the slope and y intercept of the least-squares line. The software also provides 
information that we don’t yet need, although we will use much of it later. Be sure that you can 
locate the slope, the y intercept, and the values of s (called root mean square error in JMP) and 
r2 on both computer outputs. Once you understand the statistical ideas, you can read and work 
with almost any software output. 


Minitab JMP 
Summary of Fit F r 
Slope y intercept RSquare 0.664248 Standard 
Predictor \ ‘ coef / SE Coef T P RSquare Adj 0.640266 |deviation 
Constant _38257| 2446 15.64 0.000 Root Mean Square Error of the residuals 
Miles Driven -~0.16292 0.03096 -5.26 0.000 Mean of Response 27833.69 
. r2 Observations (or Sum Wgts) 16 
S = 5740.13 R-Sq = 66.4% R-Sq(adj) = 64.0% 
a Parameter Estimates 
Standard deviation of the residuals Term Estimate = Std Error t Ratio Prob>|t| 
Intercept | 38257.135]| 2445.813 15.64 <.0001 
Miles Driven /(-0.162919] 0.030956 -5.26 0.0001 
y intercept Slope 


FIGURE 3.14 Least-squares regression results for the Ford F-150 data from Minitab and JMP 
statistical software. Other software produces similar output. 


EXAMPLE | Using feet to predict height (>) 
Interpreting regression output 


© Fancy/Alamy 


PROBLEM: A random sample of 15 high school students was selected from the U.S. 
Census At School database. The foot length (in centimeters) and height (in centimeters) of 
each student in the sample were recorded. Here are a scatterplot with the least-squares 
regression line added, a residual plot, and some computer output: 


195 
190 - 


Height (cm) 
a 


Foot length (cm) 


Residual (cm) 


22 24 26 28 30 

Foot length (cm) 
Predictor Coef SE Coef T P 
Constant 103.4100 19.5000 5.30 0.000 
Foot length 2.7469 0.7833 6,5 0.004 
S = 7.95126 R-Sq = 48.6% R-Sq(adj) = 44.7% 


a. Is a line an appropriate model to use for these data? Explain how you know. 
b. Find the correlation. 


c. What is the equation of the least-squares regression line that describes the relationship 
between foot length and height? Define any variables that you use. 


d. By about how much do the actual heights typically vary from the values predicted by the 
least-squares regression line with x = foot length? 

SOLUTION: 

a. Because the scatterplot shows a linear association and the residual plot has no obvious 
leftover curved patterns, a line is an appropriate model to use for these data. 

6. 1=0.486=0.697" = V0.486 = 0.697 


The correlation r is the square root of r?, where r? is a value between 0 and 1. 
Because the square root function on your calculator will always give a positive 


result, make sure to consider whether the correlation is positive or negative. If the 
slope is negative, so is the correlation. 


e height’ = 103.41 + 2.7469 (foot length)height = 103.41 + 2.7469 (foot length) 


We could also write the equation as y*=103.41+2.7469xY = 103.41 + 2.74692, 


where y = predicted height (cm) and x is foot length (cm). 


d. s =7.95, so the actual heights typically vary by about 7.95 cm from the values 
predicted by the regression line with x = foot length. 


FOR PRACTICE, TRY EXERCISE 59 


CHECK YOUR UNDERSTANDING 


V In Section 3.1, you read about the Old Faithful geyser in Yellowstone National Park. 
The computer output shows the results of a regression of y = interval of time until the next 
eruption (in minutes) and x = duration of the most recent eruption (in minutes) for each 
eruption of Old Faithful in a particular month. 


Summary of Fit 


RSquare 0.853725 

RSquare Adj 0.853165 

Root Mean Square Error 6.493357 

Mean of Response 77.543730 

Observations (or Sum 263 .000000 

wgts) 
Parameter Estimates 
Term Estimate Std Error t Ratio Prob>|t | 
Intercept 33.347442 1.201081 27.76 <.0001* 
Duration 13.285406 0.340393 39.03 <.0001* 


1. What is the equation of the least-squares regression line that describes the relationship 
between interval and duration? Define any variables that you use. 


2. Interpret the slope of the least-squares regression line. 


3. Identify and interpret the standard deviation of the residuals. 


4. What percent of the variability in interval is accounted for by the least-squares regression 
line with x = duration? 


Regression to the Mean 


Using technology is often the most convenient way to find the equation of a least-squares 
regression line. It is also possible to calculate the equation of the least-squares regression line 
using only the means and standard deviations of the two variables and their correlation. 
Exploring this method will highlight an important relationship between the correlation and the 
slope of a least-squares regression line—and reveal why we include the word regression in the 
expression least-squares regression line. 


HOW TO CALCULATE THE LEAST-SQUARES REGRESSION LINE 
USING SUMMARY STATISTICS 


We have data on an explanatory variable x and a response variable y for n individuals. From 
the data, calculate the means x and y ¥ and the standard deviations s, and sy of the two 


variables and their correlation r. The least-squares regression line is the line y\=b0+b1x 
y = bo + 61 @ with slope 


and y intercept 
b0=y —-bix 6% = 9 — hz 


The formula for the slope reminds us that the distinction between explanatory and response 
variables is important in regression. Least-squares regression makes the distances of the data 
points from the line small only in the y direction. If we reverse the roles of the two variables, 
the values of s, and sy will reverse in the slope formula, resulting in a different least-squares 
regression line. This is not true for correlation: switching x and y does not affect the value of r. 

The formula for the y intercept comes from the fact that the least-squares regression line 
always passes through the point (xy ¥). Once we know the slope (b;) and that the line 
goes through the point (xy) y), we can use algebra to solve for the y intercept. 
Substituting (xy ¥) into the equation y\=b0+b1x¥ = 40 + 61% produces the equation 
y =b0+b1ix ¥ = 50 + 51%, Solving this equation for bg gives the equation shown in the 
definition box, bo=y —b1x 6» = ¥ — 51%, To see how these formulas work in practice, let’s 
look at an example. 


EXAMPLE |More about feet and height (>) 
Calculating the least-squares regression line 


panpote/Getty Images 


PROBLEM: In the preceding example, we used data from a random sample of 15 high 
school students to investigate the relationship between foot length (in centimeters) and 
height (in centimeters). The mean and standard deviation of the foot lengths are x _=24.76 
& = 24.76 and s, = 2.71. The mean and standard deviation of the heights are y =171.43 


y = 171.43 and sy = 10.69. The correlation between foot length and height is r = 0.697. Find 


the equation of the least-squares regression line for predicting height from foot length. 
SOLUTION: 


b; = 0.697- 
b1=0.69710.692.71=2.75 2.71 


b0=171.43-2.75(24.76)=103.34% = 171.43 — 2.75 (24.76) = 103.34 


b1=rsysx 
b0=y -bix bo = 9 — hz 


The equation of the least-squares regression line is yA=103.3442.75xY = 103.34 + 2.752, 


FOR PRACTICE, TRY EXERCISE 63 


There is a close connection between the correlation and the slope of the least-squares 
regression line. The slope is 


bh =r—= 
b1=rsysx=r-sysx Sr 8z 
This equation says that along the regression line, a change of 1 standard deviation in x 
corresponds to a change of r standard deviations in y. When the variables are perfectly 
correlated (r = 1 or r = —1), the change in the predicted response y¥ is the same (in standard 
deviation units) as the change in x. For example, if r = 1 and x is 2 standard deviations above 
x #, then the corresponding value of y¥ will be 2 standard deviations above y ¥. 


However, if the variables are not perfectly correlated (-1 < r < 1), the change in y/¥ is less 
than the change in x, when measured in standard deviation units. To illustrate this property, 
let’s return to the foot length and height data from the preceding example. 

Figure 3.15 shows the scatterplot of height versus foot length and the regression line 
y\=103.34+2.75xY = 103.34 + 2.752 We have added four more lines to the graph: a vertical 
line at the mean foot length x , a vertical line at x +sx® + Sx (1 standard deviation above the 
mean foot length), a horizontal line at the mean height y ¥, and a horizontal line at y +sy 
Y + By (1 standard deviation above the mean height). 


y = mean height yp = 103.34 + 2.75x 
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3 


x = mean foot length 
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FIGURE 3.15 Scatterplot showing the relationship between foot length and height for a sample 
of students, along with lines showing the means of x and y and the values 1 standard deviation 
above each mean. 


When a student’s foot length is 1 standard deviation above the mean foot length x @, the 
predicted height y/¥ is above the mean height y ¥—but not an entire standard deviation above 
the mean. How far above the mean is the value of y\¥? 

From the graph, we can see that 


changeiny ?? 
b; = slope =————— = — 
b1=slope=change in ychange in x=??sx changeing sx 


From earlier, we know that 


Psy 


1 
b1=r-sysx $x 
Setting these two equations equal to each other, we have 


2? rsy 


PPsx=rsysx $a 8a 


Thus, y\¥ must be r - Sy above the mean y ¥. 


In other words, for an increase of 1 standard deviation in the value of the explanatory 
variable x, the least-squares regression line predicts an increase of only r standard deviations in 
the response variable y. When the correlation isn’t r = 1 or r = -1, the predicted value of y is 
closer to its mean y Y than the value of x is to its mean x ©. This is called regression to the 
mean, because the values of y “regress” to their mean. 


Sir Francis Galton (1822-1911) is often credited with discovering the idea of regression to 
the mean. He looked at data on the heights of children versus the heights of their parents. He 
found that taller-than-average parents tended to have children who were taller than average but 
not quite as tall as their parents. Likewise, shorter-than-average parents tended to have children 
who were shorter than average but not quite as short as their parents. Galton used the symbol r 
for the correlation because of its important relationship to regression. 


Correlation and Regression Wisdom 


Correlation and regression are powerful tools for describing the relationship between two 
variables. When you use these tools, you should be aware of their limitations. 


CORRELATION AND REGRESSION LINES DESCRIBE ONLY LINEAR 
RELATIONSHIPS You can calculate the correlation and the least-squares line for any 
relationship between two quantitative variables, but the results are useful only if the scatterplot 
shows a linear pattern. Always plot your data first! 


The following four scatterplots show very different relationships. Which one do you think 
shows the greatest correlation? 


8 10 12 14 16 18 20 


Answer: All four have the same correlation, r = 0.816. Furthermore, the least-squares 
regression line for each relationship is exactly the same, y\=3+0.5x¥ = 3 + 0.52, These four 
data sets, developed by statistician Frank Anscombe, illustrate the importance of graphing data 
before doing calculations.24 
CORRELATION AND LEAST-SQUARES REGRESSION LINES ARE NOT 
RESISTANT You already know that the correlation r is not resistant. One unusual point in a 
scatterplot can greatly change the value of r. Is the least-squares line resistant? The following 
activity will help you answer this question. 


ACTIVITY Investigating properties of the least-squares regression 


pePLe> 


line ¢€ 


In this activity, you will use the Correlation and Regression applet to explore some 
properties of the least-squares regression line. 


Points: 


10 


Correlation coefficient: 
0.4000 
@ = sat 
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@ Show least-squares line = 
@ Show mean X & Y lines 


I} Show residuals 


1. Launch the applet at highschool.bfwpub.com/tps6e. 

2. Click on the graphing area to add 10 points in the lower-left corner so that the correlation 
is about r = 0.40. Also, check the boxes to show the “Least-Squares Line” and the “Mean 
X & Y” lines as in the screen shot. Notice that the least-squares regression line goes 
though the point (xy )* y). 

3. If you were to add a point on the least-squares regression line at the right edge of the 
graphing area, what do you think would happen to the least-squares regression line? Add 
the point to see if you were correct. 


4. Click on the point you just added, and drag it up and down along the right edge of the 
graphing area. What happens to the least-squares regression line? 


5. Now, move this point so that it is on the vertical x line. Drag the point up and down on 
the x £ line. What happens to the least-squares regression line? Do outliers in the vertical 
direction have as much influence on the least-squares regression line as outliers in the 
horizontal direction? 


6. Briefly summarize how outliers influence the least-squares regression line. 


Least-squares regression lines make the sum of the squared residuals as small as possible, 
so outliers can be very influential in regression calculations. Depending on where they are, 
outliers can greatly affect the equation of the least-squares regression line, along with other 
summary statistics such as r, r2, and s. The best way to investigate the influence of outliers is to 
do regression calculations with and without the outlier and see how much the results differ. 

Does the age at which a child begins to talk predict a later score on a test of mental ability? 
A study of the development of young children recorded the age in months at which each of 21 
children spoke their first word and their Gesell Adaptive Score, the result of an aptitude test 
taken much later.22 A scatterplot of the data appears in Figure 3.16, along with a residual plot, 
and computer output. Two outliers, child 18 and child 19, are indicated on each plot. 
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FIGURE 3.16 (a) Scatterplot of Gesell Adaptive Scores versus the age at first word for 21 
children, along with the least-squares regression line. (6) Residual plot for the linear model. 
Child 18 and Child 19 are outliers. Each purple point in the graphs stands for two individuals. 


Clover No.7 Photography/Getty Images 


Child 19 has a very large residual because this point lies far from the regression line. 
However, Child 18’s point is close to the line and has a small residual. How do these two 
outliers affect the regression? Figure 3.17 shows the results of removing each of these points on 
the equation of the least-squares regression line, the standard deviation of the residuals, and r2. 
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FIGURE 3.17 Three least-squares regression lines of Gesell score on age at first word. The 
green line is calculated from all the data. The dark blue line is calculated leaving out only Child 
18. The red line is calculated leaving out only Child 19. 


You can see that removing the point for Child 18 moves the line quite a bit. Because of 
Child 18’s extreme position on the age (x) scale, removing this point makes the slope closer to 
O and the y intercept smaller. Removing Child 18 also increases the standard deviation of the 
residuals because its small residual was making the typical distance from the regression line 
smaller. Finally, removing Child 18 also decreases r2 (and makes the correlation closer to 0) 
because the linear association is weaker without this point. 


Child 19’s Gesell score was far above the least-squares regression line, but this child’s age 
(17 months) is very close to x _=14.4% = 14.4 months. Thus, removing Child 19 has very little 
effect on the least-squares regression line. The line shifts down slightly from the original 
regression line, but not by much. Child 19 has a bigger influence on the standard deviation of 
the residuals: without Child 19’s big residual, the size of the typical residual goes from s = 
11.02 to s = 8.63. Likewise, without Child 19, the strength of the linear association increases 
and r2 goes from 0.410 to 0.572. 


Think About It 


WHAT SHOULD WE DO WITH OUTLIERS? The strong influence of Child 18 makes 
the original regression of Gesell score on age at first word misleading. The original data 
have r2 = 0.41. That is, the least-squares line with x = age at which a child begins to talk 
accounts for 41% of the variability in Gesell score. This relationship is strong enough to be 
interesting to parents. If we leave out Child 18, r2 drops to only 11%. The apparent strength 
of the association was largely due to a single influential observation. 

What should the child development researcher do? She must decide whether Child 18 is 
so slow to speak that this individual should not be allowed to influence the analysis. If she 


excludes Child 18, much of the evidence for a connection between the age at which a child 
begins to talk and later ability score vanishes. If she keeps Child 18, she needs data on other 
children who were also slow to begin talking, so the analysis no longer depends as heavily 


on just one child. 


EXAMPLE | Dodging the pattern? (») 
Outliers and influential points 


Robert J Daveant/Shutterstock.com 


PROBLEM: The scatterplot shows the payroll (in millions of dollars) and number of wins 

for Major League Baseball teams in 2016, along with the least-squares regression line. The 

points highlighted in red represent the Los Angeles Dodgers (far right) and the Cleveland 

Indians (upper left). 

a. Describe what influence the point representing the Los Angeles Dodgers has on the 
equation of the least-squares regression line. Explain your reasoning. 

b. Describe what influence the point representing the Cleveland Indians has on the standard 


deviation of the residuals and r2. Explain your reasoning. 


Number of wins 


50 100 150 200 250 300 
Payroll (millions of dollars) 


SOLUTION: 


a. Because the point for the Los Angeles Dodgers is on the right and below the least- 
Squares regression line, It is making the slope of the line closer to o and the y intercept 
greater. af the Dodgers’ point was removed, the line would be steeper. 


b. Because the point for the Cleveland Indians has a large residual, it is making the 
standard deviation of the residuals greater and the value of r2 smaller. 


FOR PRACTICE, TRY EXERCISE 67 


ASSOCIATION DOES NOT IMPLY CAUSATION When we study the relationship 
between two variables, we often hope to show that changes in the explanatory variable cause 
changes in the response variable. oOo A strong association between two variables is not 


enough to draw conclusions about cause and effect. Sometimes an observed association 
really does reflect cause and effect. A household that heats with natural gas uses more gas in 
colder months because cold weather requires burning more gas to stay warm. In other cases, an 
association is explained by other variables, and the conclusion that x causes y is not valid. 

A study once found that people with two cars live longer than people who own only one 
car.2© Owning three cars is even better, and so on. There is a substantial positive association 
between number of cars x and length of life y. Can we lengthen our lives by buying more cars? 
No. The study used number of cars as a quick indicator of wealth. Well-off people tend to have 
more cars. They also tend to live longer, probably because they are better educated, take better 
care of themselves, and get better medical care. The cars have nothing to do with it. There is no 
cause-and-effect link between number of cars and length of life. 


Remember: It only makes sense to talk about the correlation between two quantitative variables. If 


one or both variables are categorical, you should refer to the association between the two 
variables. To be safe, use the more general term association when describing the relationship 
between any two variables. 


Section 3.2 Summary 


e A regression line is a line that describes how a response variable y changes as an 
explanatory variable x changes. You can use a regression line to predict the value of y for 
any value of x by substituting this x value into the equation of the line. 

e The slope b, of a regression line y\=b0+b1x¥ = 0 + 61% describes how the predicted value 
of y changes for each increase of 1 unit of x. 

e The y intercept bo of a regression line y\=b0+b1x¥ = 40 + 612 js the predicted value of y 


when the explanatory variable x equals 0. This prediction is of no statistical use unless x can 
actually take values near 0. 


e Avoid extrapolation, using a regression line to make predictions using values of the 
explanatory variable outside the values of the data from which the line was calculated. 


e The most common method of fitting a line to a scatterplot is least squares. The least-squares 
regression line is the line that minimizes the sum of the squares of the vertical distances of 
the observed points from the line. 


e You can examine the fit of a regression line by studying the residuals, which are the 
differences between the actual values of y and predicted values of y. Be on the lookout for 
curved patterns in the residual plot, which indicate that a linear model may not be 
appropriate. 


e The standard deviation of the residuals s measures the typical size of a residual when 
using the regression line. 


e The coefficient of determination r2 is the percent of the variation in the response variable 
that is accounted for by the least-squares regression line using a particular explanatory 
variable. 


e The least-squares regression line of y on x is the line with slope eae Sx and 
intercept bO=y —b1x 6» = ¥ — 61©, This line always passes through the point (xy )® y). 

e Correlation and regression must be used with caution. Plot the data to be sure that the 
relationship is roughly linear and to detect outliers. Also look for influential observations, 
individual points that substantially change the correlation or the regression line. Outliers in x 
are often influential. 


e Most of all, be careful not to conclude that there is a cause-and-effect relationship between 
two variables just because they are strongly associated. 


3.2 Technology Corners 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


9. Calculating least-squares regression lines Page 184 
10. Making residual plots Page 187 


Section 3.2 Exercises 


37. PQ 178 |) Predicting wins Earlier we investigated the relationship between x = 


payroll (in millions of dollars) and y = number of wins for Major League Baseball teams 
in 2016. Here is a scatterplot of the data, along with the regression line y\=60.7+0.139x 


j = 60.7 + 0.139. 
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a. Predict the number of wins for a team that spends $200 million on payroll. 
b. Predict the number of wins for a team that spends $400 million on payroll. 


c. How confident are you in each of these predictions? Explain your reasoning. 


38. How much gas? Joan is concerned about the amount of energy she uses to heat her home. 
The scatterplot shows the relationship between x = mean temperature in a particular month 
and y = mean amount of natural gas used per day (in cubic feet) in that month, along with 
the regression line y\=1425-19.87xY = 1425 — 19.872, 


a. Predict the mean amount of natural gas Joan will use per day in a month with a mean 
temperature of 30°F. 
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b. Predict the mean amount of natural gas Joan will use per day in a month with a mean 
temperature of 65°F. 


c. How confident are you in each of these predictions? Explain your reasoning. 


39. pg 180 _») Residual wins Refer to Exercise 37. The Chicago Cubs won the World 


40. 


41. 


42. 


43. 


Series in 2016. They had 103 wins and spent $182 million on payroll. Calculate and 
interpret the residual for the Cubs. 


Residual gas Refer to Exercise 38. During March, the average temperature was 46.4°F 
and Joan used an average of 490 cubic feet of gas per day. Calculate and interpret the 
residual for this month. 


pg 182 |») More wins? Refer to Exercise 37. 


a. Interpret the slope of the regression line. 


b. Does the value of the y intercept have meaning in this context? If so, interpret the y 
intercept. If not, explain why. 


Less gas? Refer to Exercise 38. 

a. Interpret the slope of the regression line. 

b. Does the value of the y intercept have meaning in this context? If so, interpret the y 
intercept. If not, explain why. 


Long strides The scatterplot shows the relationship between x = height of a student (in 
inches) and y = number of steps required to walk the length of a school hallway, along 
with the regression line y\=113.6—-0.921xY = 113.6 — 0.9212, 
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a. Calculate and interpret the residual for Kiana, who is 67 inches tall and took 49 steps to 
walk the hallway. 


b. Matthew is 10 inches taller than Samantha. About how many fewer steps do you expect 
Matthew to take compared to Samantha? 


44. Crickets chirping The scatterplot shows the relationship between x = temperature in 


degrees Fahrenheit and y = chirps per minute for the striped ground cricket, along with the 
regression line y\=—0.31+0.212x¥ = —0.31 + 0.212 27 


Chirps per minute 
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Temperature (°F) 


a. Calculate and interpret the residual for the cricket who chirped 20 times per minute 
when the temperature was 88.6°F. 


b. About how many additional chirps per minute do you expect a cricket to make if the 
temperature increases by 10°F? 


45. More Olympic athletes In Exercises 5 and 11, you described the relationship between 
height (in inches) and weight (in pounds) for Olympic track and field athletes. The 
scatterplot shows this relationship, along with two regression lines. The regression line for 
the shotput, hammer throw, and discus throw athletes (blue squares) is yA=—115+5.13x 


y = —115 + 5.132. The regression line for the remaining athletes (black dots) is 
y\=-297+6.41xY = —297 + 6.412, 


Weight (Ib) 
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a. How do the regression lines compare? 


b. How much more do you expect a 72-inch discus thrower to weigh than a 72-inch 
sprinter? 


46. More Starbucks In Exercises 6 and 12, you described the relationship between fat (in 


grams) and the number of calories in products sold at Starbucks. The scatterplot shows 
this relationship, along with two regression lines. The regression line for the food products 
(blue squares) is y\=170+11.8xY = 170 + 11.82. The regression line for the drink products 
(black dots) is yA=88+24.5x¥ = 88 + 24.52, 
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a. How do the regression lines compare? 


b. How many more calories do you expect to find in a food item with 5 grams of fat 
compared to a drink item with 5 grams of fat? 


47. 186 Infant weights in Nahya A study of nutrition in developing countries 
pg. Wo] 


collected data from the Egyptian village of Nahya. Researchers recorded the mean weight 
(in kilograms) for 170 infants in Nahya each month during their first year of life. A hasty 
user of statistics enters the data into software and computes the least-squares line without 


looking at the scatterplot first. The result is weight/=4.88+0.267Weight = 4.88 + 0.267 
(age). Use the residual plot to determine if this linear model is appropriate. 


Residual 
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48. Driving speed and fuel consumption Exercise 9 (page 171) gives data on the fuel 


49. 


50. 


ol. 


consumption y of a car at various speeds x. Fuel consumption is measured in liters of 
gasoline per 100 kilometers driven, and speed is measured in kilometers per hour. A 
Statistical software package gives the least-squares regression line y4=11.058—0.01466x 
y = 11.058- 0.014662 | Use the residual plot to determine if this linear model is 
appropriate. 


Residual 
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Actual weight Refer to Exercise 47. Use the equation of the least-squares regression line 
and the residual plot to estimate the actual mean weight of the infants when they were 1 
month old. 


Actual consumption Refer to Exercise 48. Use the equation of the least-squares 
regression line and the residual plot to estimate the actual fuel consumption of the car 
when driving 20 kilometers per hour. 


Movie candy Is there a relationship between the amount of sugar (in grams) and the 
number of calories in movie-theater candy? Here are the data from a sample of 12 types of 
candy: 


Name Sugar (g) Calories 
Butterfinger Minis 45 450 
Junior Mints 107 570 
M&M’S® 62 480 
Milk Duds 44 370 
Peanut M&M’S® 19 790 
Raisinets 60 420 
Reese’s Pieces 61 580 
Skittles 87 450 
Sour Patch Kids 92 490 
Sweetarts 136 680 


Twizzlers 59 460 


Whoppers 48 350 


a. Sketch a scatterplot of the data using sugar as the explanatory variable. 


b. Use technology to calculate the equation of the least-squares regression line for 
predicting the number of calories based on the amount of sugar. Add the line to the 
scatterplot from part (a). 


c. Explain why the line calculated in part (b) is called the “least-squares” regression line. 


52. Long jumps Here are the 40-yard-dash times (in seconds) and long-jump distances (in 


53. 


54. 


50. 


56. 


inches) for a small class of 12 students: 


Dash time (sec) 5.41 5.05 7.01 7.17 6.73 5.68 
Long-jump distance (in.) 171 184 90 65 78 130 
Dash time (sec) 5.78 6.31 6.44 6.50 6.80 7.25 
Long-jump distance (in.) 173 143 92 139 120 110 


a. Sketch a scatterplot of the data using dash time as the explanatory variable. 


b. Use technology to calculate the equation of the least-squares regression line for 
predicting the long-jump distance based on the dash time. Add the line to the scatterplot 
from part (a). 

c. Explain why the line calculated in part (b) is called the “least-squares” regression line. 


More candy Refer to Exercise 51. Use technology to create a residual plot. Sketch the 
residual plot and explain what information it provides. 


More long jumps Refer to Exercise 52. Use technology to create a residual plot. Sketch 
the residual plot and explain what information it provides. 


pg 191 |») Longer strides In Exercise 43, we summarized the relationship between x = 


height of a student (in inches) and y = number of steps required to walk the length of a 
school hallway, with the regression line y\=113.6—-0.921xY = 113.6 — 0.9212 Foy this 
model, technology gives s = 3.50 and r2 = 0.399. 


a. Interpret the value of s. 


b. Interpret the value of r2. 


Crickets keep chirping In Exercise 44, we summarized the relationship between x = 
temperature in degrees Fahrenheit and y = chirps per minute for the striped ground cricket, 
with the regression line y\=-0.31+0.212x¥ =~ 0.31 + 0.2122, For this model, technology 
gives s = 0.97 and r2 = 0.697. 


a. Interpret the value of s. 


b. Interpret the value of r2. 


57. Olympic figure skating For many people, the women’s figure skating competition is the 


58. 


highlight of the Olympic Winter Games. Scores in the short program x and scores in the 
free skate y were recorded for each of the 24 skaters who competed in both rounds during 
the 2010 Winter Olympics in Vancouver, Canada.22 Here is a scatterplot with least- 
squares regression line yA=-16.2+2.07x¥ = —16.2 + 2.07%. For this model, s = 10.2 and 
r? = 0.736. 


Free skate score 


Short program score 


a. Calculate and interpret the residual for the 2010 gold medal winner Yu-Na Kim, who 
scored 78.50 in the short program and 150.06 in the free skate. 


b. Interpret the slope of the least-squares regression line. 


c. Interpret the value of s. 
d. Interpret the value of r2. 


Age and height A random sample of 195 students was selected from the United Kingdom 
using the Census At School data selector. The age x (in years) and height y (in 
centimeters) were recorded for each student. Here is a scatterplot with the least-squares 
regression line y\=106.1+4.21xY = 106.1 + 4.212, For this model, s = 8.61 and r2 = 
0.274. 
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59. 


C. 


d. 


Calculate and interpret the residual for the student who was 141 cm tall at age 10. 
Interpret the slope of the least-squares regression line. 
Interpret the value of s. 


Interpret the value of r2. 


pg 192 |») More mess? When Mentos are dropped into a newly opened bottle of Diet 


Coke, carbon dioxide is released from the Diet Coke very rapidly, causing the Diet Coke 
to be expelled from the bottle. To see if using more Mentos causes more Diet Coke to be 
expelled, Brittany and Allie used twenty-four 2-cup bottles of Diet Coke and randomly 
assigned each bottle to receive either 2, 3, 4, or 5 Mentos. After waiting for the fizzing to 
stop, they measured the amount expelled (in cups) by subtracting the amount remaining 
from the original amount in the bottle.22 Here is some computer output from a regression 
of y = amount expelled on x = number of Mentos: 


Amount expelled (cups) 


2 3 4 5 
Number of Mentos 


zB 
& 
a 
= 
3 
% 
= 
2 3 4 5 
Number of Mentos 
Term Coef SE Coef T-Value P-Value 


Constant 1.0021 0.0451 22.21 0.000 


60. 


Mentos 0.0708 0.0123 5.77 0.000 
S = 0.06724 R-Sq = 60.21% R-Sq(adj) = 58.40% 


a. Is a line an appropriate model to use for these data? Explain how you know. 


b. Find the correlation. 


c. What is the equation of the least-squares regression line? Define any variables that you 
use. 


d. Interpret the values of s and r?. 


Less mess? Kerry and Danielle wanted to investigate whether tapping on a can of soda 
would reduce the amount of soda expelled after the can has been shaken. For their 
experiment, they vigorously shook 40 cans of soda and randomly assigned each can to be 
tapped for 0 seconds, 4 seconds, 8 seconds, or 12 seconds. After waiting for the fizzing to 
stop, they measured the amount expelled (in milliliters) by subtracting the amount 
remaining from the original amount in the can.22 Here is some computer output from a 
regression of y = amount expelled on x = tapping time: 


E 100 ° 
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90 : 
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Tapping time (sec) 


15 . 

a 

a 

= 

= 

Ff 

x 

0 2 4 6 8 10 12 
Tapping time (sec) 

Term Coef SE Coef T-Value P-Value 
Constant 106.360 1.320 80.34 0.000 


Tapping _time -2.635 Q.177 -14.90 0.000 


61. 


62. 


S 55.00347 R-Sq 585.38% R-Sq(adj) 584.99% 


a. Is a line an appropriate model to use for these data? Explain how you know. 
b. Find the correlation. 


c. What is the equation of the least-squares regression line? Define any variables that you 
use. 


d. Interpret the values of s and r?. 


Temperature and wind The average temperature (in degrees Fahrenheit) and average 
wind speed (in miles per hour) were recorded for 365 consecutive days at Chicago’s 
O’Hare International Airport. Here is computer output for a regression of y = average 
wind speed on x = average temperature: 


Summary of Fit 


RSquare 0.047874 

RSquare Adj 0.045251 

Root Mean Square Error 3.655950 

Mean of Response 9.826027 

Observations (or Sum 365 

wgts) 
Parameter Estimates 
Term Estimate Std Error t Ratio Prob> |t| 
Intercept 11.897762 0.521320 22.82 <.0001* 
Avg temp -0.041077 0.009615 -4,27 <.0001* 


a. Calculate and interpret the residual for the day where the average temperature was 42°F 
and the average wind speed was 2.2 mph. 


b. Interpret the slope. 


c. By about how much do the actual average wind speeds typically vary from the values 
predicted by the least-squares regression line with x = average temperature? 


d. What percent of the variability in average wind speed is accounted for by the least- 
squares regression line with x = average temperature? 


Beetles and beavers Do beavers benefit beetles? Researchers laid out 23 circular plots, 
each 4 meters in diameter, in an area where beavers were cutting down cottonwood trees. 
In each plot, they counted the number of stumps from trees cut by beavers and the number 
of clusters of beetle larvae. Ecologists believe that the new sprouts from stumps are more 
tender than other cottonwood growth, so beetles prefer them. If so, more stumps should 
produce more beetle larvae.24 Here is computer output for a regression of y = number of 
beetle larvae on x = number of stumps: 


Summary of Fit 


63. 


64. 


RSquare 0.839144 


RSquare Adj 0.831484 

Root Mean Square Error 6.419386 

Mean of Response 25.086960 

Observations (or Sum 23 

wgts) 
Parameter Estimates 
Term Estimate Std Error t Ratio- Prob>|t| 
Intercept -1.286104 2.853182 -0.45 0.6568 
Number of stumps 11.893733 1.136343 10.47 <.0001* 


a. Calculate and interpret the residual for the plot that had 2 stumps and 30 beetle larvae. 


b. Interpret the slope. 


c. By about how much do the actual number of larvae typically vary from the values 
predicted by the least-squares regression line with x = number of stumps? 


d. What percent of the variability in number of larvae is accounted for by the least-squares 
regression line with x = number of stumps? 


pg 195 |») Husbands and wives The mean height of married American women in their 


early 20s is 64.5 inches and the standard deviation is 2.5 inches. The mean height of 
married men the same age is 68.5 inches with standard deviation 2.7 inches. The 
correlation between the heights of husbands and wives is about r = 0.5. 


a. Find the equation of the least-squares regression line for predicting a husband’s height 
from his wife’s height for married couples in their early 20s. 


b. Suppose that the height of a randomly selected wife was 1 standard deviation below 
average. Predict the height of her husband without using the least-squares line. 


The stock market Some people think that the behavior of the stock market in January 
predicts its behavior for the rest of the year. Take the explanatory variable x to be the 
percent change in a stock market index in January and the response variable y to be the 
change in the index for the entire year. We expect a positive correlation between x and y 
because the change during January contributes to the full year’s change. Calculation from 
data for an 18-year period gives 


x =1.75% sx=5.36% y =9.07% sy=15.35% r=0.596 
z= 1.75%s,=5.36%y = 9.07% sy = 15.35% r = 0.596 


a. Find the equation of the least-squares line for predicting full-year change from January 
change. 


b. Suppose that the percent change in a particular January was 2 standard deviations above 
average. Predict the percent change for the entire year without using the least-squares 
line. 


65. 


66. 


67. 


Will I bomb the final? We expect that students who do well on the midterm exam in a 
course will usually also do well on the final exam. Gary Smith of Pomona College looked 
at the exam scores of all 346 students who took his statistics class over a 10-year period.22 
Assume that both the midterm and final exam were scored out of 100 points. 


a. State the equation of the least-squares regression line if each student scored the same on 
the midterm and the final. 


b. The actual least-squares line for predicting final-exam score y from midterm-exam 


score x was y\=46.6+0.41xY = 46.6 + 0.412. Predict the score of a student who scored 
50 on the midterm and a student who scored 100 on the midterm. 


c. Explain how your answers to part (b) illustrate regression to the mean. 


It’s still early We expect that a baseball player who has a high batting average in the first 
month of the season will also have a high batting average the rest of the season. Using 66 
Major League Baseball players from a recent season,=* a least-squares regression line was 
calculated to predict rest-of-season batting average y from first-month batting average x. 
Note: A player’s batting average is the proportion of times at bat that he gets a hit. A 
batting average over 0.300 is considered very good in Major League Baseball. 


a. State the equation of the least-squares regression line if each player had the same 
batting average the rest of the season as he did in the first month of the season. 


b. The actual equation of the least-squares regression line is y\=0.245+0.109x 


y = 0.245 + 0.1092, Predict the rest-of-season batting average for a player who had a 
0.200 batting average the first month of the season and for a player who had a 0.400 
batting average the first month of the season. 


c. Explain how your answers to part (b) illustrate regression to the mean. 


pg 200 |») Who’s got hops? Haley, Jeff, and Nathan measured the height (in inches) 


and vertical jump (in inches) of 74 students at their school.4 Here is a scatterplot of the 
data, along with the least-squares regression line. Jacob (highlighted in red) had a vertical 
jump of nearly 3 feet! 


Vertical jump (in.) 
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a. Describe the influence that Jacob’s point has on the equation of the least-squares 
regression line. 


b. Describe the influence that Jacob’s point has on the standard deviation of the residuals 
2 
and r-. 


68. Stand mixers The scatterplot shows the weight (in pounds) and cost (in dollars) of 11 
stand mixers.22 The mixer from Walmart (highlighted in red) was much lighter—and 
cheaper—than the other mixers. 


Price ($) 


5 10 15 20 25 30 35 
Weight (Ib) 


a. Describe what influence the highlighted point has on the equation of the least-squares 
regression line. 


b. Describe what influence the highlighted point has on the standard deviation of the 
residuals and r?. 


69. Managing diabetes People with diabetes measure their fasting plasma glucose (FPG, 
measured in milligrams per milliliter) after fasting for at least 8 hours. Another 


measurement, made at regular medical checkups, is called HbA. This is roughly the 
percent of red blood cells that have a glucose molecule attached. It measures average 
exposure to glucose over a period of several months. The table gives data on both HbA 
and FPG for 18 diabetics five months after they had completed a diabetes education 


class.28 
Subject HbA (%) FPG (mg/ml) 
1 6.1 141 
2 6.3 158 
3 6.4 112 
4 6.8 153 
5 7.0 134 
6 7.1 95 
7 7.5 96 
8 if 78 
9 7.9 148 
10 8.7 172 
11 9.4 200 
12 10.4 271 
13 10.6 103 
14 10.7 172 
15 10.7 359 
16 11.2 145 
17 13.7 147 
18 19.3 255 


a. Make a scatterplot with HbA as the explanatory variable. Describe what you see. 


b. Subject 18 is an outlier in the x direction. What effect do you think this subject has on 
the correlation? What effect do you think this subject has on the equation of the least- 
squares regression line? Calculate the correlation and equation of the least-squares 
regression line with and without this subject to confirm your answer. 


c. Subject 15 is an outlier in the y direction. What effect do you think this subject has on 
the correlation? What effect do you think this subject has on the equation of the least- 
squares regression line? Calculate the correlation and equation of the least-squares 
regression line with and without this subject to confirm your answer. 


70. Rushing for points What is the relationship between rushing yards and points scored in 
the National Football League? The table gives the number of rushing yards and the 
number of points scored for each of the 16 games played by the Jacksonville Jaguars in a 


recent season.24 


Game Rushing yards Points scored 


a. 


a 163 16 
2 Ii? 3 
3 128 10 
4 104 10 
5 96 20 
6 133 13 
7 132 12 
8 84 14 
9 141 17 
10 108 10 
11 105 13 
12 129 14 
13 116 41 
14 116 14 
15 113 lg 
16 190 19 


Make a scatterplot with rushing yards as the explanatory variable. Describe what you 
see. 


The number of rushing yards in Game 16 is an outlier in the x direction. What effect do 
you think this game has on the correlation? On the equation of the least-squares 
regression line? Calculate the correlation and equation of the least-squares regression 
line with and without this game to confirm your answers. 


The number of points scored in Game 13 is an outlier in the y direction. What effect do 
you think this game has on the correlation? On the equation of the least-squares 
regression line? Calculate the correlation and equation of the least-squares regression 
line with and without this game to confirm your answers. 


Multiple Choice Select the best answer for Exercises 71—78. 


71. Which of the following is not a characteristic of the least-squares regression line? 


a. 


b. 
C; 


d. 


e. 


The slope of the least-squares regression line is always between —1 and 1. 
The least-squares regression line always goes through the point (xy y). 
The least-squares regression line minimizes the sum of squared residuals. 


The slope of the least-squares regression line will always have the same sign as the 
correlation. 


The least-squares regression line is not resistant to outliers. 


72. Each year, students in an elementary school take a standardized math test at the end of the 
school year. For a class of fourth-graders, the average score was 55.1 with a standard 


dios 


74. 


deviation of 12.3. In the third grade, these same students had an average score of 61.7 with 
a standard deviation of 14.0. The correlation between the two sets of scores is r = 0.95. 
Calculate the equation of the least-squares regression line for predicting a fourth-grade 
score from a third-grade score. 


a. yA=3.58+0.835xI = 3.58 + 0.835 

b. yA=15.69+0.835xY = 15.69 + 0.8352 
Cc. yA=2.19+1.08xY = 2.19 + 1.082 

d. yA=-11.54+1.08xY = —11.54 + 1.082 


e. Cannot be calculated without the data. 


Using data from the LPGA tour, a regression analysis was performed using x = average 
driving distance and y = scoring average. Using the output from the regression analysis 
shown below, determine the equation of the least-squares regression line. 


Predictor Coef SE Coef T P 
Constant 87.974000 2.391000 36.78 0.000 
Driving Distance -0.060934 0.009536 -6.39 0.000 
S = 1.01216 R-Sq = 22.1% R-Sq(adj) = 21.6% 


a. y\=87.974+2.391xy = 87.974 + 2.3912 
b. yA=87.974+1.01216xY = 87.974 + 1.01216x 

c. y\=87.974-0.060934xY = 87.974 — 0.060934x 

d. y\=-0.060934+1.01216xY = —0.060934 + 1.01216x 


e. yA=-0.060934+87.947xY = —9.060934 + 87.9472 
Exercises 74 to 78 refer to the following setting. Measurements on young children in 
Mumbai, India, found this least-squares line for predicting y = height (in cm) from x = arm 
span (in cm):38 

y\=6.4+0.93xY = 6.44 0.932 


By looking at the equation of the least-squares regression line, you can see that the 
correlation between height and arm span is 


a. greater than zero. 
b. less than zero. 

c. 0.93. 

d. 6.4. 


e. Can’t tell without seeing the data. 


75. In addition to the regression line, the report on the Mumbai measurements says that r2 = 
0.95. This suggests that 


76. 


77. 


78. 


a. 


although arm span and height are correlated, arm span does not predict height very 
accurately. 


height increases by 0.95=0.97 cmv 9-95 = 0.97 cm for each additional centimeter of arm 
span. 


. 95% of the relationship between height and arm span is accounted for by the regression 


line. 


. 95% of the variation in height is accounted for by the regression line with x = arm span. 


. 95% of the height measurements are accounted for by the regression line with x = arm 


span. 


One child in the Mumbai study had height 59 cm and arm span 60 cm. This child’s 
residual is 


a. 
b. 
c 
d. 


e. 


—3.2 cm. 
—2.2 cm. 
—1.3 cm. 
3.2 cm. 


62.2 cm. 


Suppose that a tall child with arm span 120 cm and height 118 cm was added to the 
sample used in this study. What effect will this addition have on the correlation and the 
slope of the least-squares regression line? 


a. 
b. 
ci 
d. 


e. 


Correlation will increase, slope will increase. 
Correlation will increase, slope will stay the same. 
Correlation will increase, slope will decrease. 
Correlation will stay the same, slope will stay the same. 


Correlation will stay the same, slope will increase. 


Suppose that the measurements of arm span and height were converted from centimeters 
to meters by dividing each measurement by 100. How will this conversion affect the 


values of r2 and s? 


a. 


bi 


r2 will increase, s will increase. 


r2 will increase, s will stay the same. 


79. 


80. 


c. r2 will increase, s will decrease. 
d. r2 will stay the same, s will stay the same. 


e. r? will stay the same, s will decrease. 
Recycle and Review 


Fuel economy (2.2) In its recent Fuel Economy Guide, the Environmental Protection 
Agency (EPA) gives data on 1152 vehicles. There are a number of outliers, mainly 
vehicles with very poor gas mileage or hybrids with very good gas mileage. If we ignore 
the outliers, however, the combined city and highway gas mileage of the other 1120 or so 
vehicles is approximately Normal with mean 18.7 miles per gallon (mpg) and standard 
deviation 4.3 mpg. 


a. The Chevrolet Malibu with a four-cylinder engine has a combined gas mileage of 25 
mpg. What percent of the 1120 vehicles have worse gas mileage than the Malibu? 


b. How high must a vehicle’s gas mileage be in order to fall in the top 10% of the 1120 


vehicles? 


Marijuana and traffic accidents (1.1) Researchers in New Zealand interviewed 907 
drivers at age 21. They had data on traffic accidents and they asked the drivers about 
marijuana use. Here are data on the numbers of accidents caused by these drivers at age 
19, broken down by marijuana use at the same age:22 


Marijuana use per year 


Never 1210 times 11250 times 511 times 
Number of drivers 452 229 70 156 
Accidents caused 59 36 15 50 


a. Make a graph that displays the accident rate for each category of marijuana use. Is there 


evidence of an association between marijuana use and traffic accidents? Justify your 
answer. 


b. Explain why we can’t conclude that marijuana use causes accidents based on this study. 


MM _. FRAPPY! FREE RESPONSE AP® 
PROBLEM, YAY! 


The following problem is modeled after actual AP® Statistics exam free response 
questions. Your task is to generate a complete, concise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you use, because you will be 
scored on the correctness of your methods as well as on the accuracy and completeness of 
your results and explanations. 


Two Statistics students went to a flower shop and randomly selected 12 carnations. 
When they got home, the students prepared 12 identical vases with exactly the same 
amount of water in each vase. They put one tablespoon of sugar in 3 vases, two 
tablespoons of sugar in 3 vases, and three tablespoons of sugar in 3 vases. In the 
remaining 3 vases, they put no sugar. After the vases were prepared, the students 
randomly assigned 1 carnation to each vase and observed how many hours each flower 
continued to look fresh. A scatterplot of the data is shown below. 


Freshness (h) 


0 1 2 3 
Amount of sugar (tbsp) 


a. Briefly describe the association shown in the scatterplot. 


b. The equation of the least-squares regression line for these data is y\=180.8+15.8x 
j = 180.8 + 15.82 = 180.8 + 15.82, Interpret the slope of the line in the context of the 
study. 


c. Calculate and interpret the residual for the flower that had 2 tablespoons of sugar and 
looked fresh for 204 hours. 


d. Suppose that another group of students conducted a similar experiment using 12 
flowers, but included different varieties in addition to carnations. Would you expect the 
value of r2 for the second group’s data to be greater than, less than, or about the same 
as the value of r? for the first group’s data? Explain. 


After you finish, you can view two example solutions on the book’s website 
(highschool.bfwpub.comitps6e). Determine whether you think each solution is 
“complete,” “substantial,” “developing,” or “minimal.” If the solution is not 
complete, what improvements would you suggest to the student who wrote it? 
Finally, your teacher will provide you with a scoring rubric. Score your response 
and note what, if anything, you would do differently to improve your own score. 


Chapter 3 Review 


Section 3.1: Scatterplots and Correlation 


In this section, you learned how to explore the relationship between two quantitative variables. 
As with distributions of a single variable, the first step is always to make a graph. A scatterplot 
is the appropriate type of graph to investigate relationships between two quantitative variables. 
To describe a scatterplot, be sure to discuss four characteristics: direction, form, strength, and 
unusual features. The direction of a relationship might be positive, negative, or neither. The 
form of a relationship can be linear or nonlinear. A relationship is strong if it closely follows a 
specific form. Finally, unusual features include outliers that clearly fall outside the pattern of 
the rest of the data and distinct clusters of points. 


The correlation r is a numerical summary for linear relationships that describes the 
direction and strength of the association. When r > 0, the association is positive, and when r < 
0, the association is negative. The correlation will always take values between —1 and 1, withr 
= -1 andr = 1 indicating a perfectly linear relationship. Strong linear relationships have 
correlations near 1 or —1, while weak linear relationships have correlations near 0. It isn’t 
possible to determine the form of a relationship from only the correlation. Strong nonlinear 
relationships can have a correlation close to 1 or a correlation close to 0. You also learned that 
outliers can greatly affect the value of the correlation and that correlation does not imply 
causation. That is, we can’t assume that changes in one variable cause changes in the other 
variable, just because they have a correlation close to 1 or —1. 


Section 3.2: Least-Squares Regression 


In this section, you learned how to use least-squares regression lines as models for relationships 
between two quantitative variables that have a linear association. It is important to understand 
the difference between the actual data and the model used to describe the data. To emphasize 
that the model only provides predicted values, least-squares regression lines are always 
expressed in terms of y\i¥ instead of y. Likewise, when you are interpreting the slope of a 
least-squares regression line, describe the change in the predicted value of y. 


The difference between the actual value of y and the predicted value of y is called a 
residual. Residuals are the key to understanding almost everything in this section. To find the 
equation of the least-squares regression line, find the line that minimizes the sum of the squared 
residuals. To see if a linear model is appropriate, make a residual plot. If there is no leftover 
curved pattern in the residual plot, you know the model is appropriate. To assess how well a 
line fits the data, calculate the standard deviation of the residuals s to estimate the size of a 
typical prediction error. You can also calculate r2, which measures the percent of the variation 
in the y variable that is accounted for by the least-squares regression line. 


You also learned how to obtain the equation of a least-squares regression line from 


computer output and from summary Statistics (the means and standard deviations of two 
variables and their correlation). As with the correlation, the equation of the least-squares 
regression line and the values of s and r2 can be greatly influenced by outliers, so be sure to 
plot the data and note any unusual features before making any calculations. 


What Did You Learn? 


Related Relevant 
Example Chapter 


on Review 

Learning Target Section Page(s) Exercise(s) 
Distinguish between explanatory and response re PI 154 R3.4 
variables for quantitative data. 
Make a scatterplot to display the relationship between Oud. 155 R3.4 
two quantitative variables. 
Describe the direction, form, and strength of a ool. 158 Ra 
relationship displayed in a scatterplot and identify R3.2 
unusual features. 
Interpret the correlation. Sal 162 Aas 
Understand the basic properties of correlation, 3.1 165,169 R31, 
including how the correlation is influenced by outliers. R32 
Distinguish correlation from causation. al 165 R3.6 
Make predictions using regression lines, Keeping in 3.2 178 R3.4, 
mind the dangers of extrapolation. R29 
Calculate and interpret a residual. Bue 180 RSs, 

R3.4 
Interpret the slope and y intercept of a regression line. 3.2 182 R3.4 
Determine the equation of a least-squares regression Bue 192 Rg, 
line using technology or computer output. R3.4 
Construct and interpret residual plots to assess 3:2 186 as, 
whether a regression model is appropriate. R3.4 
Interpret the standard deviation of the residuals and /2 ro 191 Pes, 
and use these values to assess how well a least- R3.9 


squares regression line models the relationship 
between two variables. 


Describe how the least-squares regression line, 3.2 200 neem 
standard deviation of the residuals, and ,2 are 


influenced by outliers. 
Find the slope and y intercept of the least-squares 3.2 195 


regression line from the means and standard 
deviations of x and y and their correlation. 


A 
on 


Chapter 3 Review Exercises 


These exercises are designed to help you review the important ideas and methods of the 


chapter. 


R3.1 |») Born to be old? Is there a relationship between the gestational period (time from 


conception to birth) of an animal and its average life span? The figure shows a scatterplot 
of the gestational period and average life span for 43 species of animals.“2 


Life span (years) 


0 100 200 300 400 500 600 700 
Gestation (days) 


Describe the relationship shown in the scatterplot. 


. Point A is the hippopotamus. What effect does this point have on the correlation, the 


equation of the least-squares regression line, and the standard deviation of the 
residuals? 


Point B is the Asian elephant. What effect does this point have on the correlation, the 
equation of the least-squares regression line, and the standard deviation of the 
residuals? 


R3.2 p) Penguins diving A study of king penguins looked for a relationship between how 


deep the penguins dive to seek food and how long they stay under water.* For all but the 
shallowest dives, there is an association between x = depth (in meters) and y = dive 
duration (in minutes) that is different for each penguin. The study gives a scatterplot for 
one penguin titled “The Relation of Dive Duration (y) to Depth (x).” The scatterplot 
shows an association that is positive, linear, and strong. 


a 9 


a. Explain the meaning of the term positive association in this context. 
b. 


Explain the meaning of the term linear association in this context. 
Explain the meaning of the term strong association in this context. 


Suppose the researchers reversed the variables, using x = dive duration and y = depth. 


Would this change the correlation? The equation of the least-squares regression line? 


R3.3 |p) Stats teachers’ cars A random sample of AP® Statistics teachers was asked to 


report the age (in years) and mileage of their primary vehicles. Here are a scatterplot, a 
residual plot, and other computer output: 


160,000 
e ” 
120,000 ye 
‘ sie 
= ‘ ° 
2 80,000 - BY 
_ a“ ° 
= -— 
40,000 - QE: 
xe * 
ss 
04¥¢ 
T T T T T T 
0 2 4 6 8 10 12 
Age (years) 
60,000 e 
40,000 
3 . 
& 20,000 
= ? 
2S e 
= 0 
-20,000 
Age (years) 
Predictor Coef SE Coef T P 
Constant 3704 8268 0.45 0.662 
Age 12188 1492 8.17 0.000 
S = 20870.5 R-Sq = 83.7% R-Sq(adj) = 82.4% 


a. Is a linear model appropriate for these data? Explain how you know this. 
b. What’s the correlation between car age and mileage? Interpret this value in context. 


c. Give the equation of the least-squares regression line for these data. Identify any 
variables you use. 


d. One teacher reported that her 6-year-old car had 65,000 miles on it. Find and interpret 
its residual. 


e. Interpret the values of s and r2. 


R3.4 |») Late bloomers? Japanese cherry trees tend to blossom early when spring weather is 


warm and later when spring weather is cool. Here are some data on the average March 
temperature (in degrees Celsius) and the day in April when the first cherry blossom 


appeared over a 24-year period:42 


Temperature (°C) 40 54 3.2 26 42 47 49 40 49 3.8 4.0 5.1 


Days in April to first blossom 14 8 11 19 14 14 14 21 9 14 13 11 
Temperature (°C) 43 15 3.7 38 45 41 61 62 51 50 46 4.0 
Days in April to first blossom 13 28 17 19 10 17 3 3 11 6 9 11 


a. Make a well-labeled scatterplot that’s suitable for predicting when the cherry trees 
will blossom from the temperature. Which variable did you choose as the explanatory 
variable? Explain your reasoning. 


b. Use technology to calculate the correlation and the equation of the least-squares 
regression line. Interpret the slope and y intercept of the line in this setting. 


c. Suppose that the average March temperature this year was 8.2°C. Would you be 
willing to use the equation in part (b) to predict the date of first blossom? Explain 
your reasoning. 


d. Calculate and interpret the residual for the year when the average March temperature 
was 4.5°C. 


e. Use technology to help construct a residual plot. Describe what you see. 


R3.5 jp) What’s my grade? In Professor Friedman’s economics course, the correlation 


between the students’ total scores prior to the final examination and their final exam 
scores is r = 0.6. The pre-exam totals for all students in the course have mean 280 and 
standard deviation 30. The final exam scores have mean 75 and standard deviation 8. 
Professor Friedman has lost Julie’s final exam but knows that her total before the exam 
was 300. He decides to predict her final exam score from her pre-exam total. 


a. Find the equation for the least-squares regression line Professor Friedman should use 
to make this prediction. 


b. Use the least-squares regression line to predict Julie’s final exam score. 
c. Explain the meaning of the phrase “least squares” in the context of this question. 
d. Julie doesn’t think this method accurately predicts how well she did on the final 


exam. Determine r2. Use this result to argue that her actual score could have been 
much higher (or much lower) than the predicted value. 


R3.6 ») Calculating achievement The principal of a high school read a study that reported a 


high correlation between the number of calculators owned by high school students and 
their math achievement. Based on this study, he decides to buy each student at his school 
two calculators, hoping to improve their math achievement. Explain the flaw in the 
principal’s reasoning. 


Chapter 3 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


T3.1 A school guidance counselor examines how many extracurricular activities students 
participate in and their grade point average. The guidance counselor says, “The 
evidence indicates that the correlation between the number of extracurricular activities 
a student participates in and his or her grade point average is close to 0.” Which of the 
following is the most appropriate conclusion? 


a. Students involved in many extracurricular activities tend to be students with poor 
grades. 


b. Students with good grades tend to be students who are not involved in many 
extracurricular activities. 


c. Students involved in many extracurricular activities are just as likely to get good 
grades as bad grades. 


d. Students with good grades tend to be students who are involved in many 
extracurricular activities. 


e. No conclusion should be made based on the correlation without looking at a 
scatterplot of the data. 


T3.2 An AP® Statistics student designs an experiment to see whether today’s high school 
students are becoming too calculator-dependent. She prepares two quizzes, both of 
which contain 40 questions that are best done using paper-and-pencil methods. A 
random sample of 30 students participates in the experiment. Each student takes both 
quizzes—one with a calculator and one without—in a random order. To analyze the 
data, the student constructs a scatterplot that displays a linear association between the 
number of correct answers with and without a calculator for the 30 students. A least- 
squares regression yields the equation 


calculator’ = -1.2 + 0.865 (pencil) r= 0.79calculator 1.2 + 0.865 (pencil) r= 0.79 
calculator = — 1.2 + 0.865 (pencil) r= 0.79 


Which of the following statements is/are true? 


I. If the student had used Calculator as the explanatory variable, the correlation would 
remain the same. 


II. If the student had used Calculator as the explanatory variable, the slope of the least- 
squares line would remain the same. 


Il. The standard deviation of the number of correct answers on the paper-and-pencil 
quizzes was smaller than the standard deviation on the calculator quizzes. 


I only 


oT p 


IT only 

c. If only 

I and IT only 
e. I, IZ, and III 


aan 


Questions T3.3—T3.5 refer to the following setting. Scientists examined the activity 
level of 7 fish at different temperatures. Fish activity was rated on a scale of 0 (no 
activity) to 100 (maximal activity). The temperature was measured in degrees Celsius. 
A computer regression printout and a residual plot are provided. Notice that the 
horizontal axis on the residual plot is labeled “Fitted value,” which means the same 


thing as “predicted value.” 


Residual 


55 60 65 70 75 80 85 90 95 
Fitted value 


Predictor Coef SE Coef T 
Constant 148.62 10.71 13.88 
Temperature -3.2167 0.4533 -7.10 

S = 4.78505 R-Sq = 91.0% R-Sq (adj) 


T3.3 What is the correlation between temperature and fish activity? 
a. 0.95 
b. 0.91 
c. 0.45 
d. —0.91 
e. —0.95 


T3.4 What was the actual activity level rating for the fish at a temperature of 20°C? 


a. 87 
b. 84 
c. 81 


d. 


e. 


66 
3 


T3.5 Which of the following gives a correct interpretation of s in this setting? 


a. 


e. 


For every 1°C increase in temperature, fish activity is predicted to increase by 4.785 
units. 


. The typical distance of the temperature readings from their mean is about 4.785°C. 


The typical distance of the activity level ratings from the least-squares line is about 
4.785 units. 


. The typical distance of the activity level readings from their mean is about 4.785 


units. 


At a temperature of 0°C, this model predicts an activity level of 4.785 units. 


T3.6 Which of the following statements is not true of the correlation r between the lengths 
(in inches) and weights (in pounds) of a sample of brook trout? 


a. 
b. 


C. 


r must take a value between —1 and 1. 
r is measured in inches. 
If longer trout tend to also be heavier, then r > 0. 


r would not change if we measured the lengths of the trout in centimeters instead of 
inches. 


r would not change if we measured the weights of the trout in kilograms instead of 
pounds. 


T3.7 When we standardize the values of a variable, the distribution of standardized values 
has mean 0 and standard deviation 1. Suppose we measure two variables X and Y on 
each of several subjects. We standardize both variables and then compute the least- 
squares regression line. Suppose the slope of the least-squares regression line is 20.44. 
We may conclude that 


a. the intercept will also be —0.44. 
b. 

C. 
d. 


e. 


the intercept will be 1.0. 
the correlation will be 1/—0.44. 
the correlation will be 1.0. 


the correlation will also be —0.44. 


T3.8 There is a linear relationship between the number of chirps made by the striped ground 
cricket and the air temperature. A least-squares fit of some data collected by a biologist 
gives the model yA=25.2+3.3x = 25.2 +3.3e = 25.2 + 3.32, where x is the number of 
chirps per minute and y/iY is the estimated temperature in degrees Fahrenheit. What is 
the predicted increase in temperature for an increase of = chirps per minute? 


a. 


3.0°R 


b. 16.5°F 
C 25,2°F 
d. 28.5°F 
e. 41.7°F 


T3.9 The scatterplot shows the relationship between the number of people per television set 
and the number of people per physician for 40 countries, along with the least-squares 
regression line. In Ethiopia, there were 503 people per TV and 36,660 people per 


doctor. Which of the following is correct? 
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c. The correlation is greater than 1. 


jae 


Increasing the number of TVs in a country will attract more doctors. 


. The slope of the least-squares regression line is less than 1. 


. The point for Ethiopia is decreasing the slope of the least-squares regression line. 


e. Ethiopia has more people per doctor than expected, based on how many people it 


has per TV. 


T3.10 The scatterplot shows the lean body mass and metabolic rate for a sample of = adults. 
For each person, the lean body mass is the subject’s total weight in kilograms less any 
weight due to fat. The metabolic rate is the number of calories burned in a 24-hour 


period. 


Metabolic rate (cal/24 hours) 


1800 
1700 
1600 
1500 
1400 
1300 
1200 
1100 
1000 


35 40 45 50 55 
Lean body mass (kg) 


Because a person with no lean body mass should burn no calories, it makes sense to 
model the relationship with a direct variation function in the form y = kx. Models were 
tried using different values of k (k = 25, k = 26, etc.) and the sum of squared residuals 
(SSR) was calculated for each value of k. Here is a scatterplot showing the relationship 
between SSR and k: 


SSR (1000s) 


According to the scatterplot, what is the ideal value of k to use for predicting metabolic 
rate? 


a 7 


a. 24 
b. 


25 
29 
31 
36 


Section Il: Free Response Show all your work. Indicate clearly the methods you use, 
because you will be graded on the correctness of your methods as well as on the accuracy and 


completeness of your results and explanations. 


T3.11 Sarah’s parents are concerned that she seems short for her age. Their doctor has kept the 
following record of Sarah’s height: 


Age (months) 36 48 51 54 5/ 60 
Height (cm) 86 90 91 93 94 95 


a. Make a scatterplot of these data using age as the explanatory variable. Describe what 
you see. 


b. Using your calculator, find the equation of the least-squares regression line. 
c. Calculate and interpret the residual for the point when Sarah was 48 months old. 


d. Would you be confident using the equation from part (b) to predict Sarah’s height 
when she is 40 years old? Explain. 


T3.12 Drilling down beneath a lake in Alaska yields chemical evidence of past changes in 
climate. Biological silicon, left by the skeletons of single-celled creatures called 
diatoms, is a measure of the abundance of life in the lake. A rather complex variable 
based on the ratio of certain isotopes relative to ocean water gives an indirect measure 
of moisture, mostly from snow. As we drill down, we look further into the past. Here is 
a scatterplot of data from 2300 to 12,000 years ago: 


Silicon 
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Isotope 


a. Identify the unusual point in the scatterplot and estimate its x and y coordinates. 
b. Describe the effect this point has on 

i. the correlation. 

ii. the slope and y intercept of the least-squares line. 


iii. the standard deviation of the residuals. 


T3.13 Long-term records from the Serengeti National Park in Tanzania show interesting 
ecological relationships. When wildebeest are more abundant, they graze the grass more 
heavily, so there are fewer fires and more trees grow. Lions feed more successfully 
when there are more trees, so the lion population increases. Researchers collected data 
on one part of this cycle, wildebeest abundance (in thousands of animals) and the 
percent of the grass area burned in the same year. The results of a least-squares 
regression on the data are shown here.“2 
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a. Is a linear model appropriate for describing the relationship between wildebeest 
abundance and percent of grass area burned? Explain. 


b. Give the equation of the least-squares regression line. Be sure to define any variables 


you use. 


. Interpret the slope. Does the value of the y intercept have meaning in this context? If 
so, interpret the y intercept. If not, explain why. 


. Interpret the standard deviation of the residuals and r2. 


Chapter 3 Project Investigating Relationships in 
Baseball 


What is a better predictor of the number of wins for a baseball team, the number of runs scored 
by the team or the number of runs they allow the other team to score? What variables can we 
use to predict the number of runs a team scores? To predict the number of runs it allows the 
other team to score? In this project, you will use technology to help answer these questions by 
exploring a large set of data from Major League Baseball. 


Part 1 


. Download the “MLB Team Data 2012-2016” Excel file from the book’s website, along with 
the “Glossary for MLB Team Data file,” which explains each of the variables included in the 
data set.“ Import the data into the statistical software package you prefer. 


’. Create a scatterplot to investigate the relationship between runs scored per game (R/G) and 
wins (W). Then calculate the equation of the least-squares regression line, the standard 
deviation of the residuals, and r2. Note: R/G is in the section for hitting statistics and W is in 
the section for pitching statistics. 


3. Create a scatterplot to investigate the relationship between runs allowed per game (RA/G) 
and wins (W). Then calculate the equation of the least-squares regression line, the standard 
deviation of the residuals, and r2. Note: Both of these variables may be found in the section 
for pitching statistics. 


|. Compare the two associations. Is runs scored or runs allowed a better predictor of wins? 
Explain your reasoning. 


». Because the number of wins a team has is dependent on both how many runs they score and 
how many runs they allow, we can use a combination of both variables to predict the number 
of wins. Add a column in your data table for a new variable, run differential. Fill in the 
values using the formula R/G — RA/G. 


}. Create a scatterplot to investigate the relationship between run differential and wins. Then 
calculate the equation of the least-squares regression line, the standard deviation of the 
residuals, and r2, 


’. Is run differential a better predictor than the variable you chose in Question 4? Explain your 
reasoning. 


Part 2 


It is fairly clear that the number of games a team wins is dependent on both runs scored and 
runs allowed. But what variables help predict runs scored? Runs allowed? 


. Choose either runs scored (R) or runs allowed (RA) as the response variable you will try to 
model. 

’. Choose at least three different explanatory variables (or combinations of explanatory 
variables) that might help predict the response variable you chose in Question 1. Create a 
scatterplot using each explanatory variable. Then calculate the equation of the least-squares 
regression line, the standard deviation of the residuals, and r2 for each relationship. 


3, Which explanatory variable from Question 2 is the best? Explain your reasoning. 


Chapter 4 Collecting Data 
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Introduction 


Y ou can hardly go a day without hearing the results of a statistical study. Here are some 
examples: 


e The National Highway Traffic Safety Administration (NHTSA) reports that seat belt use in 
passenger vehicles increased from 88.5% in 2015 to 90.1% in 2016.4 


e According to a survey, U.S. teens aged 13 to 18 use entertainment media (television, 
Internet, social media, listening to music, etc.) nearly 9 hours a day, on average.2 


e A study suggests that lack of sleep increases the risk of catching a cold.2 


¢ For their final project, two AP® Statistics students showed that listening to music while 
studying decreased subjects’ performance on a memory task.4 


Can we trust these results? As you’ll learn in this chapter, the answer depends on how the data 
were produced. Let’s take a closer look at where the data came from in each of these studies. 


Each year, the NHTSA conducts an observational study of seat belt use in vehicles. The 
NHTSA sends trained observers to record the behavior of people in vehicles at randomly 
selected locations across the country. The idea of an observational study is simple: you can 
learn a lot just by watching or by asking a few questions, as in the survey of teens’ media 
habits. Common Sense Media conducted this survey using a random sample of 1399 U.S. 13- 
to 18-year-olds. Both of these studies use information from a sample to draw conclusions about 
some larger population. Section 4.1 examines the issues involved in sampling and surveys. 


In the sleep and catching a cold study, 153 volunteers answered questions about their sleep 
habits over a two-week period. Then researchers gave them a virus and waited to see who 
developed a cold. This was a complicated observational study. Compare this with the 
experiment performed by the AP® Statistics students. They recruited 30 students and divided 
them into two groups of 15 by drawing names from a hat. Students in one group tried to 
memorize a list of words while listening to music. Students in the other group tried to 
memorize the same list of words while sitting in silence. Section 4.2 focuses on designing 
experiments. 


In Section 4.3, we revisit two key ideas from Sections 4.1 and 4.2: drawing conclusions 
about a population based on a random sample and drawing conclusions about cause and effect 
based on a randomized experiment. In both cases, we will focus on the role of randomization in 
our analysis. 


SECTION 4.1 Sampling and Surveys 


LEARNING TARGETS By the end of the section, you should be able to: 


Identify the population and sample in a statistical study. 


Identify voluntary response sampling and convenience sampling and explain 
how these sampling methods can lead to bias. 


Describe how to select a simple random sample with technology or a table of 
random digits. 


Describe how to select a sample using stratified random sampling and cluster 
sampling, distinguish stratified random sampling from cluster sampling, and give 
an advantage of each method. 


Explain how undercoverage, nonresponse, question wording, and other aspects 
of a sample survey can lead to bias. 


Suppose we want to find out what percent of young drivers in the United States text while 
driving. To answer the question, we will survey 16- to 20-year-olds who live in the United 
States and drive. Ideally, we would ask them all by conducting a census. But contacting every 
driver in this age group wouldn’t be practical: it would take too much time and cost too much 
money. Instead, we pose the question to a sample chosen to represent the entire population of 
young drivers. 


DEFINITION Population, Census, Sample 


The population in a statistical study is the entire group of individuals we want 
information about. A census collects data from every individual in the population. 


A sample is a subset of individuals in the population from which we collect data. 


The distinction between population and sample is basic to statistics. To make sense of any 
sample result, you must know what population the sample represents. 


EXAMPLE | Sampling monitors and voters (>) 
Populations and samples 
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PROBLEM: Identify the population and the sample in each of the following settings. 


a. The quality control manager at a factory that produces computer monitors selects 10 
monitors from the production line each hour. The manager inspects each monitor for 
defects in construction and performance. 


b. Prior to an election, a news organization surveys 1000 registered voters to predict which 
candidate will be elected as president. 

SOLUTION: 

a. The population is all the monitors produced in this factory that hour. The sample 1s 
the 10 monitors selected from the production line. 


b. The population is all registered voters. The sample is the 1000 registered voters 
surveyed. 


Because the sample came from 1 hour’s production at this factory, the population is 


the monitors produced that hour in this factory—not all monitors produced in the world 
or even all monitors produced by this factory. 


FOR PRACTICE, TRY EXERCISE 1 


The Idea of a Sample Survey 


We often draw conclusions about a population based on a sample. Have you ever tasted a 
sample of ice cream and ordered a cone because the sample tastes good? Because ice cream is 
fairly uniform, the single taste represents the whole. Choosing a representative sample from a 
large and varied population (like all young U.S. drivers) is not so easy. The first step in 
planning a sample survey is to decide what population we want to describe. The second step is 
to decide what we want to measure. 


DEFINITION Sample survey 


A sample survey is a study that collects data from a sample that is chosen to 
represent a specific population. 


By our definition, the population in a sample survey can consist of people, animals, or 
things. Some people use the terms survey or sample survey to refer only to studies in which 
people are asked questions, like the news organization survey in the preceding example. We’|l 
avoid this more restrictive terminology. 


The final step in planning a sample survey is to decide how to choose a sample from the 
population. Here is an activity that illustrates the process of conducting a sample survey. 


ACTIVITY Who wrote the Federalist Papers? 


The Federalist Papers are a series of 85 essays supporting the ratification of the U.S. 
Constitution. At the time they were published, the identity of the authors was a secret known 
to only a few people. Over time, however, the authors were identified as Alexander 
Hamilton, James Madison, and John Jay. The authorship of 73 of the essays is fairly certain, 
leaving 12 in dispute. However, thanks in some part to statistical analysis, most scholars now 


believe that the 12 disputed essays were written by Madison alone or in collaboration with 
Hamilton.2 
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There are several ways to use statistics to help determine the authorship of a disputed 


text. One method is to estimate the average word length in a disputed text and compare it to 
the average word lengths of works where the authorship is not in dispute. 


The following passage is the opening paragraph of Federalist Paper No. 51,© one of the 


disputed essays. The theme of this essay is the separation of powers between the three 
branches of government. 


To what expedient, then, shall we finally resort, for maintaining in practice the 
necessary partition of power among the several departments, as laid down in the 
Constitution? The only answer that can be given is, that as all these exterior provisions 
are found to be inadequate, the defect must be supplied, by so contriving the interior 


structure of the government as that its several constituent parts may, by their mutual 
relations, be the means of keeping each other in their proper places. Without presuming 
to undertake a full development of this important idea, I will hazard a few general 
observations, which may perhaps place it in a clearer light, and enable us to form a 
more correct judgment of the principles and structure of the government planned by the 
convention. 


1. Choose 5 words from this passage. Count the number of letters in each of the words you 
selected, and find the average word length. 


2. Your teacher will draw and label a horizontal axis for a class dotplot. Plot the average 
word length you obtained in Step 1 on the graph. 

3. Your teacher will show you how to use a random number generator to select a random 
sample of 5 words from the 130 words in the opening passage. Count the number of letters 
in each of the selected words, and find the average word length. 


4. Your teacher will draw and label another horizontal axis with the same scale for a 
comparative dotplot. Plot the average word length you obtained in Step 3 on the graph. 


5. How do the dotplots compare? Can you think of any reasons why they might be different? 
Discuss with your classmates. 


How to Sample Badly 


Suppose we want to know how long students at a large high school spent doing homework last 
week. We might go to the school library and ask the first 30 students we see about the amount 
of time they spend on their homework. This method is known as a convenience sampling. 


DEFINITION Convenience sampling 


Convenience sampling selects individuals from the population who are easy to 
reach. 


og Convenience sampling often produces unrepresentative data. Consider our sample 
of 30 students from the school library. It’s unlikely that this convenience sample accurately 
represents the homework habits of all students at the high school. In fact, if we were to repeat 
this sampling process day after day, we would almost always overestimate the average 
homework time in the population. Why? Because students who hang out in the library tend to 
be more studious. This is bias: using a method that favors some outcomes over others. 


DEFINITION Bias 


The design of a statistical study shows bias if it is very likely to underestimate or 


very likely to overestimate the value you want to know. 


AP® EXAM TIP 


If you’re asked to describe how the design of a sample survey leads to bias, you're 
expected to do two things: (1) describe how the members of the sample might 
respond differently from the rest of the population, and (2) explain how this 
difference would lead to an underestimate or overestimate. Suppose you were 


asked to explain how using your statistics class as a sample to estimate the 
proportion of all high school students who own a graphing calculator could result in 
bias. You might respond, “This is a convenience sample. It would probably include a 
much higher proportion of students with a graphing calculator than in the population 
at large because a graphing calculator is required for the statistics class. So this 
method would probably lead to an overestimate of the actual population proportion.” 


@ Bias is not just bad luck in one sample. It’s the result of a bad study design that will 


consistently miss the truth about the population in the same way. Convenience sampling will 
almost always result in bias. So will voluntary response sampling. 


DEFINITION Voluntary response sampling 


Voluntary response sampling allows people to choose to be in the sample by 
responding to a general invitation. 


The Internet brings voluntary response sampling to the computer nearest you. Visit 


www.misterpoll.com to become part of the sample in any of dozens of online polls. As the site 
says, “None of these polls are ‘scientific,’ but do represent the collective opinion of everyone who 
participates.” Unfortunately, such polls don’t tell you much about the views of any larger 
population. 


Most Internet polls, along with call-in, text-in, and write-in polls, rely on voluntary 
response sampling. People who self-select to participate in such surveys are usually not 
representative of some larger population of interest. Voluntary response sampling attracts 
people who feel strongly about an issue, and who often share the same opinion. That leads to 
bias. 


EXAMPLE |Boaty McBoatface (>) 
Biased sampling methods 


NERC/SplashNews/Newscom 


PROBLEM: In 2016, Britain’s Natural Environment Research Council invited the public to 
name its new $300 million polar research ship. To vote on the name, people simply needed 
to visit a website and record their choice. Ignoring names suggested by the council, over 
124,000 people voted for “Boaty McBoatface,” which ended up having more than 3 times as 
many votes as the second-place finisher.2 


What type of sampling did the council use in their poll? Explain how bias in this sampling 
method could have affected the poll results. 


SOLUTION: 
The council used voluntary response sampling: people chose to go online and respond. The 


people who chose to be in the sample were probably less serious about science than the 
British population as a whole—and more likely to prefer a funny name. The proportion of 
people in the sample who prefer the name Boaty McBoatface ts likely to be greater than 
the proportion of all British residents who would choose this name. 


Remember to describe how the responses from the members of the sample might 


differ from the responses from the rest of the population and how this difference will 
affect the estimate. 


FOR PRACTICE, TRY EXERCISE 5 


CHECK YOUR UNDERSTANDING 


A 


y For each of the following situations, identify the sampling method used. Then explain 
how bias in the sampling method could affect the results. 


1. A farmer brings a juice company several crates of oranges each week. A company 
inspector looks at 10 oranges from the top of each crate before deciding whether to buy all 
the oranges. 


2. The ABC program Nightline once asked if the United Nations should continue to have its 
headquarters in the United States. Viewers were invited to call one telephone number to 
respond “Yes” and another to respond “No.” There was a charge for calling either number. 
More than 186,000 callers responded, and 67% said “No.” 


How to Sample Well: Random Sampling 


In convenience sampling, the researcher chooses easy-to-reach members of the population. In 
voluntary response sampling, people decide whether to join the sample. Both sampling 
methods suffer from bias due to personal choice. As you discovered in The Federalist Papers 
activity, a good way to avoid bias is to let chance choose the sample. That’s the idea of 


random sampling. 


DEFINITION Random sampling 


Random sampling involves using a chance process to determine which 
members of a population are included in the sample. 


In everyday life, some people use the word random to mean “haphazard,” as in “That’s so 


random.” In statistics, random means “using chance.” Don’t say that a sample was chosen at 
random if a chance process wasn’t used to select the individuals. 


For example, to choose a random sample of 6 students from a class of 30, start by writing 
each of the 30 names on a separate slip of paper, making sure the slips are all the same size. 
Then put the slips in a hat, mix them well, and pull out slips one at a time until you have 
identified 6 different students. An alternative approach would be to give each member of the 
population a distinct number and to use the “hat method” with these numbers instead of 
people’s names. Note that this version would work just as well if the population consisted of 
animals or things. The resulting sample is called a simple random sample, or SRS for short. 


DEFINITION Simple random sample (SRS) 


A simple random sample (SRS) of size n is chosen in such a way that every 
group of rn individuals in the population has an equal chance to be selected as the 
sample. 


An SRS gives every possible sample of the desired size an equal chance to be chosen. 
Picture drawing 20 slips of paper (the sample) from a hat containing 200 identical slips (the 
population). Any set of 20 slips has the same chance as any other set of 20 to be chosen. This 
also means that each individual has the same chance to be chosen in an SRS. However, giving 


each individual the same chance to be selected is not enough to guarantee that a sample is an 
SRS. Some other random sampling methods give each member of the population an equal 
chance to be selected, but not each possible sample. We’Il look at some of these methods later. 


HOW TO CHOOSE A SIMPLE RANDOM SAMPLE The hat method won’t work well if 
the population is large. Imagine trying to take a simple random sample of 1000 registered 
voters in the United States using a hat! In practice, most people use random numbers generated 
by technology to choose samples. 


HOW TO CHOOSE AN SRS WITH TECHNOLOGY 


e Label. Give each individual in the population a distinct numerical label from 1 to N, 
where N is the number of individuals in the population. 


e Randomize. Use a random number generator to obtain n different integers from 1 to N, 
where n is the sample size. 


e Select. Choose the individuals that correspond to the randomly selected integers. 


When choosing an SRS, we make the selections without replacement. That is, once an 
individual is selected for a sample, that individual cannot be selected again. Many random 
number generators sample numbers with replacement, so it is important to explain that repeated 
numbers should be ignored when using technology to select an SRS. 


11. Technology Corner | CHOOSING AN SRS @& 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


Let’s use a graphing calculator to select an SRS of 10 students from a population of students 
numbered 1 to 1750. 


1. Check that your calculator’s random number generator is working properly. 


e Press |MATH|, then select PROB (PRB) and choose randInt(. 


Newer OS: In the dialogue box, enter these values: lower: 1, upper: 1750, n: 1, choose 


Paste, and press |ENTER|. 


Older OS: Complete the command randInt(1,1750) and press [ENTER]. 


e Compare your results with those of your classmates. If several students got the same 
number, you’!l need to seed your calculator’s random integer generator with different 
numbers before you proceed. Directions for doing this are given in the Teacher’s 
Edition. 


2. Randomly generate 10 distinct numbers from 1 to 1750 by pressing until you 
have chosen 10 different labels. 


NORMAL FLOAT AUTO REAL RADIAN CL fl 


randiInt(1,175@) 


Note: If you have a TI-84 Plus CE, use the command RandIntNoRep(1,1750,10) to get 10 
distinct integers from 1 to 1750. If you have a TI-84 with OS 2.55 or later, use the command 
RandIntNoRep(1,1750) to sort the numbers from 1 to 1750 in random order. The first 10 
numbers listed give the labels of the chosen students. 


There are many random number generators available on the Internet, including those at 
www.random.org. You can also use the random number generator on your calculator. 


If you don’t have technology handy, you can use a table of random digits to choose an SRS. 
We have provided a table of random digits at the back of the book (Table D). Here is an 
excerpt: 


Table D Random digits 
LINE 
101 19223 95034 05756 28713 96409 12531 42544 82853 
102 73676 47150 99400 01927 27754 42648 £82425 36290 
103 45467 71709 77558 00095 32863 29485 82226 90056 


You can think of this table as the result of someone putting the digits 0 to 9 in a hat, 
mixing, drawing one, replacing it, mixing again, drawing another, and so on. The digits have 
been arranged in groups of five within numbered rows to make the table easier to read. The 
groups and rows have no special meaning—Table D is just a long list of randomly chosen 
digits. As with technology, there are three steps in using Table D to choose a random sample. 


HOW TO CHOOSE AN SRS USING TABLE D 


e Label. Give each member of the population a distinct numerical label with the same 
number of digits. Use as few digits as possible. 


e Randomize. Read consecutive groups of digits of the appropriate length from left to right 
across a line in Table D. Ignore any group of digits that wasn’t used as a label or that 
duplicates a label already in the sample. Stop when you have chosen n different labels. 


| e Select. Choose the individuals that correspond to the randomly selected integers. | 


Always use the shortest labels that will cover your population. For instance, you can label 
up to 100 individuals with two digits: 01, 02, ... , 99, 00. As standard practice, we recommend 
that you begin with label 1 (or 01 or 001 or 0001, as needed). Reading groups of digits from the 
table gives all individuals the same chance to be chosen because all labels of the same length 
have the same chance to be found in the table. For example, any pair of digits in the table is 
equally likely to be any of the 100 possible labels 01, 02, ... , 99, 00. Here’s an example that 
shows how this process works. 


EXAMPLE | Attendance audit (>) 
Choosing an SRS with Table D 


WALKER ws) SCHOOL 
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PROBLEM: Each year, the state Department of Education randomly selects three schools 
from each district and conducts a detailed audit of their attendance records. 


a. Describe how to use a table of random digits to select an SRS of three schools from this 
list of 19 schools. 


Amphitheater High School Keeling Elementary School 
Amphitheater Middle School La Cima Middle School 

Canyon del Oro High School Mesa Verde Elementary School 
Copper Creek Elementary School Nash Elementary School 
Coronado K-8 School Painted Sky Elementary School 
Cross Middle School Prince Elementary School 
Donaldson Elementary School Rio Vista Elementary School 
Harelson Elementary School Walker Elementary School 
Holaway Elementary School Wilson K-8 School 


Ironwood Ridge High School 


b. Use the random digits here to choose the sample. 
62081 64816 87374 09517 84534 06489 87201 97245 


SOLUTION: 


a. Label the schools from 01 to19 in alphabetical order. Move along a line of random 
digits from left to right, reading two-digit numbers, until three different numbers from 
01 to 19 have been selected (ignoring repeated numbers and the numbers 20-99, 00). 
Audit the three schools that correspond with the numbers selected. 


Remember to include all three steps: 


e Label 
e Randomize 


e Select 


b. 62-skip, 08-select, 16-select, 48-skip, 16-repeat, 87-skip, 3 7-skip, 40-skip, 95-skip, 17- 
select. 
The three schools are 08: Harelson Elementary School, 16: Prince Elementary School, 
and 17: Rio Vista Elementary School. 


FOR PRACTICE, TRY EXERCISE 11 


CHECK YOUR UNDERSTANDING 


. 


y A furniture maker buys hardwood in batches that each contain 1000 pieces. The supplier 
is supposed to dry the wood before shipping (wood that isn’t dry won’t hold its size and 
shape). The furniture maker chooses five pieces of wood from each batch and tests their 
moisture content. If any piece exceeds 12% moisture content, the entire batch is sent back. 
Describe how to select a simple random sample of 5 pieces using each of the following. 


1. A random number generator 


2. A table of random digits 


Other Random Sampling Methods 


One of the most common alternatives to simple random sampling is called stratified random 


sampling. This method involves sampling from groups (strata) of similar individuals within 
the population separately. Then these separate “subsamples” are combined to form the sample. 


DEFINITION Strata, Stratified random sample 


Strata are groups of individuals in a population who share characteristics thought 
to be associated with the variables being measured in a study. Stratified random 
sampling selects a sample by choosing an SRS from each stratum and 
combining the SRSs into one overall sample. 


The singular form of strata is stratum. 


Stratified random sampling works best when the individuals within each stratum are similar 
with respect to what is being measured and when there are large differences between strata. For 
example, in a study of sleep habits on school nights, the population of students in a large high 
school might be divided into freshman, sophomore, junior, and senior strata. After all, it is 
reasonable to think that freshmen have different sleep habits than seniors. The following 
activity illustrates the benefit of choosing appropriate strata. 


ACTIVITY Sampling sunflowers 


A British farmer grows sunflowers for making sunflower oil. Her field is arranged in a grid 
pattern, with 10 rows and 10 columns as shown in the figure. Irrigation ditches run along the 
top and bottom of the field. The farmer would like to estimate the number of healthy plants 
in the field so she can project how much money she’!| make from selling them. It would take 
too much time to count the plants in all 100 squares, so she’|l accept an estimate based on a 
sample of 10 squares. 


Li Ding/Alamy 


1. Use Table D or technology to take a simple random sample of 10 grid squares. Record the 
location (e.g., B6) of each square you select. 


2. This time, select a stratified random sample using the rows as strata. Use Table D or 
technology to randomly select one square from each horizontal row. Record the location 


of each square—for example, Row 1: G, Row 2: B, and so on. 


3. Now, take a stratified random sample using the columns as strata. Use Table D or 
technology to randomly select one square from each vertical column. Record the location 
of each square—for example, Column A: 4, Column B: 1, and so on. 


4. Your teacher will provide the actual number of healthy sunflowers in each grid square. 
Use that information to calculate your estimate of the mean number of healthy sunflowers 
per square in the entire field for each of your samples in Steps 1, 2, and 3. 


5. Make comparative dotplots showing the mean number of healthy sunflowers obtained 
using the three different sampling methods for all members of the class. Describe any 
similarities and differences you see. 


The following dotplots show the mean number of healthy plants in 100 samples using each 
of the three sampling methods in the activity: simple random sampling, stratified random 
sampling with rows of the field as strata, and stratified random sampling with columns of the 
field as strata. Notice that all three distributions are centered at about 102.5, the true mean 
number of healthy plants in all squares of the field. That makes sense because random sampling 
tends to yield accurate estimates of unknown population means. 
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One other detail stands out in the graphs: there is much less variability in the estimates 


when we use the rows as strata. The table provided by your teacher shows the actual number of 
healthy sunflowers in each grid square. Notice that the squares within each row contain a 
similar number of healthy plants but that there are big differences between rows. When we can 
choose strata that have similar responses (e.g., number of healthy plants) within strata but 
different responses between strata, stratified random samples give more precise estimates than 
simple random samples of the same size. 


Why didn’t using the columns as strata reduce the variability of the estimates in a similar 
way? Because the numbers of healthy plants vary a lot within each column and aren’t very 
different from other columns. 


Both simple random sampling and stratified random sampling are hard to use when 
populations are large and spread out over a wide area. In that situation, we’d prefer a method 
that selects groups (clusters) of individuals that are “near” one another. That’s the idea of 


cluster sampling. 


DEFINITION Clusters, Cluster sampling 


A cluster is a group of individuals in the population that are located near each 
other. Cluster sampling selects a sample by randomly choosing clusters and 
including each member of the selected clusters in the sample. 


Cluster sampling is often used for practical reasons, like saving time and money. It works 
best when the clusters look just like the population but on a smaller scale. Imagine a large high 
school that assigns students to homerooms alphabetically by last name, in groups of 25. 
Administrators want to survey 200 randomly selected students about a proposed schedule 
change. It would be difficult to track down an SRS of 200 students, so the administration opts 
for a cluster sample of homerooms. The principal (who knows some statistics) selects an SRS 
of 8 homerooms and gives the survey to all 25 students in each homeroom. 


Be sure you understand the difference between strata and clusters. We want each stratum to 
contain similar individuals and for large differences to exist between strata. For a cluster 
sample, we’d like each cluster to look just like the population, but on a smaller scale. 
Unfortunately, cluster samples don’t offer the statistical advantage of better information about 
the population that stratified random samples do. Here’s an example that compares stratified 
random sampling and cluster sampling. 


EXAMPLE | Sampling at a school assembly (>) 
Other sampling methods 


H. Mark Weidman Photography/Alamy 


PROBLEM: The student council wants to conduct a survey about use of the school library 
during the first five minutes of an all-school assembly in the auditorium. There are 800 
students present at the assembly. Here is a map of the auditorium. Note that students are 
seated by grade level and that the seats are numbered from 1 to 800. 


9th grade: Seats 601—800 10th grade: Seats 401-600 
11th grade: Seats 201-400 12th grade: Seats 1-200 


a. Describe how to obtain a sample of 80 students using stratified random sampling. Explain 
your choice of strata and why this method might be preferred to simple random sampling. 


b. Describe how to obtain a sample of 80 students using cluster sampling. Explain your 
choice of clusters and why this method might be preferred to simple random sampling. 


SOLUTION: 


a. Because students’ library use might be similar within grade levels but different ACYOSS 
grade levels, use the grade-level seating areas as strata. For the oth grade, generate 20 
different random integers from 601 to 800 and give the survey to the students in those 
seats. Do the same for sophomores, juniors, and seniors using their corresponding seat 
numbers. Stratification by grade level should result in more precise estimates of student 


opinion than a simple random sample of the same size. 


b. Each column of seats from the stage to the back of the auditorium could be used as a 
cluster because it would be relatively easy to hand out the surveys to an entire column. 
Label the columns from 1 to 20 starting at the left side of the stage, generate 2 different 
integers from 1 to 20, and give the survey to the 80 students sitting in these two 
columns. Cluster sampling is much more efficient than finding 80 seats scattered about 
the auditorium, as required by simple random sampling. 


Note that each cluster contains students from all four grade levels, so each should 


represent the population fairly well. Randomly selecting 4 rows as clusters would also 
be easy, but this may over- or under-represent one grade level. 
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Most large-scale sample surveys use multistage sampling, which combines two or more 
sampling methods. For example, the U.S. Census Bureau carries out a monthly Current 
Population Survey (CPS) of about 60,000 households. Researchers start by choosing a stratified 
random sample of neighborhoods in 756 of the 2007 geographical areas in the United States. 
Then they divide each neighborhood into clusters of four nearby households and select a cluster 
sample to interview. 

Analyzing data from sampling methods other than simple random sampling takes us 
beyond basic statistics. But the SRS is the building block of more elaborate methods, and the 
principles of analysis remain much the same for these other methods. 


CHECK YOUR UNDERSTANDING 


Ny 


”, A factory runs 24 hours a day, producing wood pencils on three 8-hour shifts—day, 

evening, and overnight. In the last stage of manufacturing, the pencils are packaged in boxes 

of 10 pencils each. Each day a sample of 300 pencils is selected and inspected for quality. 

1. Describe how to select a stratified random sample of 300 pencils. Explain your choice of 
Strata. 

2. Describe how to select a cluster sample of 300 pencils. Explain your choice of clusters. 

3. Explain a benefit of using a stratified random sample and a benefit of using a cluster 
sample in this context. 


Sample Surveys: What Else Can Go Wrong? 


As we have learned, the use of bad sampling methods (convenience or voluntary response) 
often leads to bias. Researchers can avoid these methods by using random sampling to choose 
their samples. Other problems in conducting sample surveys are more difficult to avoid. 


Sampling is sometimes done using a list of individuals in the population, called a sampling 
frame. Such lists are seldom accurate or complete. The result is undercoverage. 


DEFINITION Undercoverage 


Undercoverage occurs when some members of the population are less likely to 
be chosen or cannot be chosen in a sample. 


Most samples suffer from some degree of undercoverage. A sample survey of households, 
for example, will miss not only homeless people but also prison inmates and students in 
dormitories. An opinion poll conducted by calling landline telephone numbers will miss 
households that have only cell phones as well as households without a phone. The results of 
sample surveys may not be accurate if the people who are undercovered differ from the rest of 
the population in ways that affect their responses. 


Even if every member of the population is equally likely to be selected for a sample, not all 
members of the population are equally likely to provide a response. Some people are never at 
home and cannot be reached by pollsters on the phone or in person. Other people see an 
unfamiliar phone number on their caller ID and never pick up the phone or quickly hang up 
when they don’t recognize the voice of the caller. These are examples of nonresponse, another 
major source of bias in surveys. 


DEFINITION Nonresponse 


Nonresponse occurs when an individual chosen for the sample can’t be 
contacted or refuses to participate. 


Nonresponse leads to bias when the individuals who can’t be contacted or refuse to 
participate would respond differently from those who do participate. Consider a telephone 
survey that asks people how many hours of television they watch per day. People who are 
selected but are out of the house won’t be able to respond. Because these people probably 
watch less television than the people who are at home when the phone call is made, the mean 
number of hours obtained in the sample is likely to be greater than the mean number of hours 
of TV watched in the population. 

How bad is nonresponse? According to polling guru Nate Silver, “Response rates to 
political polls are dismal. Even polls that make every effort to contact a representative sample 
of voters now get no more than 10 percent to complete their surveys—down from about 35 
percent in the 1990s.”8 In contrast, the Census Bureau’s American Community Survey (ACS) 
has the lowest nonresponse rate of any poll we know: only about 1% of the households in the 


sample refuse to respond. The overall nonresponse rate, including “never at home” and other 
causes, is just 2.5%2 
@ Some students misuse the term voluntary response to explain why certain 


individuals don’t respond in a sample survey. Their belief is that participation in the survey 
is optional (voluntary), so anyone can refuse to take part. What the students are describing is 
nonresponse. Think about it this way: nonresponse can occur only after a sample has been 
selected. In a voluntary response sample, every individual has opted to take part, so there won’t 
be any nonresponse. 


The wording of questions has an important influence on the answers given to a sample 
survey. Confusing or leading questions can introduce strong bias. Even a single word can make 
a difference. In a recent Quinnipiac University poll, half of the respondents were asked if they 
support “stronger gun laws” and the other half were asked if they support “stronger gun control 
laws.” In the first group, 52% of respondents supported stronger laws, but when the word 


control was added to the question, only 46% of respondents supported stronger laws.22 


The gender, age, ethnicity, or behavior of the interviewer can also affect people’s 
responses. People may lie about their age, income, or drug use. They may misremember how 
many hours they spent on the Internet last week. Or they might make up an answer to a 
question that they don’t understand. All these issues can lead to response bias. 


DEFINITION Response bias 


Response bias occurs when there is a systematic pattern of inaccurate answers 
to a survey question. 


EXAMPLE | Wash your hands! (>) 
Response bias 


PROBLEM: What percent of Americans wash their hands after using the bathroom? It 
depends on how you collect the data. In a telephone survey of 1006 U.S. adults, 96% said 
they always wash their hands after using a public restroom. An observational study of 6028 
adults in public restrooms told a different story: only 85% of those observed washed their 


hands after using the restroom. Explain why the results of the two studies are so different 11 


SOLUTION: 


When asked in person, many people may lie about always washing their hands because 
they want to appear to have good hygiene. When people are only observed and not asked 


directly, the percent who wash their hands will be smaller—and much closer to the truth. 
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Even the order in which questions are asked is important. For example, ask a sample of 
college students these two questions: 


e “How happy are you with your life in general?” (Answer on a scale of 1 to 5.) 


e “How many dates did you have last month?” 


There is almost no association between responses to the two questions when asked in this 
order. It appears that dating has little to do with happiness. Reverse the order of the questions, 
however, and a much stronger association appears: college students who say they had more 
dates tend to give higher ratings of happiness about life. The lesson is clear: the order in which 
questions are asked can influence the results. 


CHECK YOUR UNDERSTANDING 


‘ 
y 


1. Each of the following is a possible source of bias in a sample survey. Name the type of 
bias that could result. 
a. The sample is chosen at random from a telephone directory. 
b. Some people cannot be contacted in five calls. 
c. Interviewers choose people walking by on the sidewalk to interview. 


2. A survey paid for by makers of disposable diapers found that 84% of the sample opposed 
banning disposable diapers. Here is the actual question: 


It is estimated that disposable diapers account for less than 2% of the trash in today’s 
landfills. In contrast, beverage containers, third-class mail, and yard wastes are 
estimated to account for about 21% of the trash in landfills. Given this, in your 
opinion, would it be fair to ban disposable diapers?_2 


Do you think the estimate of 84% is less than, greater than, or about equal to the percent 
of all people in the population who would oppose banning disposable diapers? Explain 
your reasoning. 


Section 4.1 Summary 


e A census collects data from every individual in the population. 


e A sample survey selects a sample from the population of all individuals about which we 
desire information. The goal of a sample survey is to draw conclusions about the population 
based on data from the sample. 


e Convenience sampling chooses individuals who are easiest to reach. In voluntary response 
sampling, individuals choose to join the sample in response to an open invitation. Both these 
sampling methods usually lead to bias: they will be very likely to underestimate or very 
likely to overestimate the value you want to know. 


e Random sampling uses a chance process to select a sample. 


e A simple random sample (SRS) gives every possible sample of a given size the same 
chance to be chosen. Choose an SRS by labeling the members of the population and using a 
table of random digits or technology to select the sample. 


e To use stratified random sampling, divide the population into groups of individuals 
(strata) that are similar in some way that might affect their responses. Then choose a 
separate SRS from each stratum and combine these SRSs to form the sample. When strata 
are “similar within but different between,” stratified random samples tend to give more 
precise estimates of unknown population values than do simple random samples. 


e To use cluster sampling, divide the population into groups of individuals that are located 
near each other, called clusters. Randomly select some of these clusters. All the individuals 
in the chosen clusters are included in the sample. Ideally, clusters are “different within but 
similar between.” Cluster sampling saves time and money by collecting data from entire 
groups of individuals that are close together. 


e Random sampling helps avoid bias in choosing a sample. Bias can still occur in the sampling 
process due to undercoverage, which happens when some members of the population are 
less likely to be chosen or cannot be chosen for the sample. 


e Other serious problems in sample surveys can occur after the sample is chosen. The single 
biggest problem is nonresponse: when people can’t be contacted or refuse to answer. 
Untruthful answers by respondents, poorly worded questions, and other problems can lead to 
response bias. 


4.1 Technology Corners | @ 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 


11. Choosing an SRS 


Section 4.1 Exercises 


1. pg 221 |») Sampling stuffed envelopes A large retailer prepares its customers’ 


monthly credit card bills using an automatic machine that folds the bills, stuffs them into 
envelopes, and seals the envelopes for mailing. Are the envelopes completely sealed? 
Inspectors choose 40 envelopes at random from the 1000 stuffed each hour for visual 


inspection. Identify the population and the sample. 


Student archaeologists An archaeological dig turns up large numbers of pottery shards, 
broken stone tools, and other artifacts. Students working on the project classify each 
artifact and assign a number to it. The counts in different categories are important for 
understanding the site, so the project director chooses 2% of the artifacts at random and 
checks the students’ work. Identify the population and the sample. 


Students as customers A high school’s student newspaper plans to survey local 
businesses about the importance of students as customers. From an alphabetical list of all 
local businesses, the newspaper staff chooses 150 businesses at random. Of these, 73 
return the questionnaire mailed by the staff. Identify the population and the sample. 


Customer satisfaction A department store mails a customer satisfaction survey to people 
who make credit card purchases at the store. This month, 45,000 people made credit card 
purchases. Surveys are mailed to 1000 of these people, chosen at random, and 137 people 
return the survey form. Identify the population and the sample. 


pg 224 |») Sleepless nights How much sleep do high school students get on a typical 


school night? A counselor designed a survey to find out. To make data collection easier, 
the counselor surveyed the first 100 students to arrive at school on a particular morning. 
These students reported an average of 7.2 hours of sleep on the previous night. 


a. What type of sample did the counselor obtain? 


b. Explain why this sampling method is biased. Is 7.2 hours probably greater than or less 
than the true average amount of sleep last night for all students at the school? Why? 


Online polls Parade magazine posed the following question: “Should drivers be banned 
from using all cell phones?” Readers were encouraged to vote online at parade.com. The 
subsequent issue of Parade reported the results: 2407 (85%) said “Yes” and 410 (15%) 
said “No.” 


a. What type of sample did the Parade survey obtain? 


b. Explain why this sampling method is biased. Is 85% probably greater than or less than 
the true percent of all adults who believe that all cell phone use while driving should be 
banned? Why? 


Online reviews Many websites include customer reviews of products, restaurants, hotels, 
and so on. The manager of a hotel was upset to see that 26% of reviewers on a travel 
website gave the hotel “1 star’—the lowest possible rating. Explain how bias in the 
sampling method could affect the estimate. 


Funding for fine arts The band director at a high school wants to estimate the percentage 
of parents who support a decrease in the budget for fine arts. Because many parents attend 
the school’s annual musical, the director surveys the first 30 parents who arrive at the 
show. Explain how bias in the sampling method could affect the estimate. 


9. 


10. 


il, 


12. 


Explain it to the congresswoman You are on the staff of a member of Congress who is 
considering a bill that would provide government-sponsored insurance for nursing-home 
care. You report that 1128 letters have been received on the issue, of which 871 oppose 
the legislation. “I’m surprised that most of my constituents oppose the bill. I thought it 
would be quite popular,” says the congresswoman. Are you convinced that a majority of 
the voters oppose the bill? How would you explain the statistical issue to the 
congresswoman? 


Sampling mall shoppers You may have seen the mall interviewer, clipboard in hand, 
approaching people passing by. Explain why even a large sample of mall shoppers would 
not provide a trustworthy estimate of the current unemployment rate in the city where the 
mall is located. 


pg 228 |») Do you trust the Internet? You want to ask a sample of high school 


students the question “How much do you trust information about health that you find on 
the Internet—a great deal, somewhat, not much, or not at all?” You try out this and other 
questions on a pilot group of 5 students chosen from your class. 


a. Explain how you would use a line of Table D to choose an SRS of 5 students from the 
following list. 


b. Use line 107 to select the sample. Show how you use each of the digits. 


Anderson Drasin Kim Rider 
Arroyo Eckstein Molina Rodriguez 
Batista Fernandez Morgan Samuels 
Bell Fullmer Murphy Shen 
Burke Gandhi Nguyen Tse 
Cabrera Garcia Palmiero Velasco 
Calloway Glaus Percival Wallace 
Delluci Helling Prince Washburn 
Deng Husain Puri Zabidi 

De Ramos Johnson Richards Zhao 


Apartment living You are planning a report on apartment living in a college town. You 
decide to select three apartment complexes at random for in-depth interviews with 
residents. 


a. Explain how you would use a line of Table D to choose an SRS of 3 complexes from 
the following list. 


b. Use line 117 to select the sample. Show how you use each of the digits. 


Ashley Oaks Country View Mayfair Village 
Bay Pointe Country Villa Nobb Hill 
Beau Jardin Crestview Pemberly Courts 


13. 


14. 


15. 


16. 


Bluffs Del-Lynn Peppermill 


Brandon Place Fairington Pheasant Run 
Briarwood Fairway Knolls Richfield 
Brownstone Fowler Sagamore Ridge 
Burberry Franklin Park Salem Courthouse 
Cambridge Georgetown Village Manor 
Chauncey Village Greenacres Waterford Court 
Country Squire Lahr House Williamsburg 


Sampling the forest To gather data on a 1200-acre pine forest in Louisiana, the U.S. 
Forest Service laid a grid of 1410 equally spaced circular plots over a map of the forest. A 
ground survey visited a sample of 10% of the plots.t2 


a. Explain how you would use a random number generator to choose an SRS of 141 plots. 
Your description should be clear enough for a classmate to carry out your plan. 


b. Use your method from part (a) to choose the first 3 plots. 


Sampling gravestones The local genealogical society in Coles County, Illinois, has 
compiled records on all 55,914 gravestones in cemeteries in the county for the years 1825 
to 1985. Historians plan to use these records to learn about African Americans in Coles 
County’s history. They first choose an SRS of 395 records to check their accuracy by 


visiting the actual gravestones./4 


a. Explain how you would use a random number generator to choose the SRS. Your 
description should be clear enough for a classmate to carry out your plan. 


b. Use your method from part (a) to choose the first 3 gravestones. 


Dead trees On the west side of Rocky Mountain National Park, many mature pine trees 
are dying due to infestation by pine beetles. Scientists would like to use sampling to 
estimate the proportion of all pine trees in this area that have been infested. 


a. Explain why it wouldn’t be practical for scientists to obtain an SRS in this setting. 


b. A possible alternative would be to use every pine tree along the park’s main road as a 
sample. Why is this sampling method biased? 


c. Suppose that a more complicated random sampling plan is carried out, and that 35% of 
the pine trees in the sample are infested by the pine beetle. Can scientists conclude that 
exactly 35% of all the pine trees on the west side of the park are infested? Why or why 
not? 


iPhones Suppose 1000 iPhones are produced at a factory today. Management would like 
to ensure that the phones’ display screens meet their quality standards before shipping 
them to retail stores. Because it takes about 10 minutes to inspect an individual phone’s 
display screen, managers decide to inspect a sample of 20 phones from the day’s 
production. 


17. 


18. 


19. 


20. 


21. 


a. Explain why it would be difficult for managers to inspect an SRS of 20 iPhones that are 
produced today. 


b. An eager employee suggests that it would be easy to inspect the last 20 iPhones that 
were produced today. Why isn’t this a good idea? 


c. Another employee recommends a different sampling method: Randomly choose one of 
the first 50 iPhones produced. Inspect that phone and every fiftieth iPhone produced 
afterward. (This method is known as systematic random sampling.) Explain carefully 
why this sampling method is not an SRS. 


No tipping The owner of a large restaurant is considering a new “no tipping” policy and 
wants to survey a sample of employees. The policy would add 20% to the cost of food and 
beverages and the additional revenue would be distributed equally among servers and 
kitchen staff. Describe how to select a stratified random sample of approximately 30 
employees. Explain your choice of strata and why stratified random sampling might be 
preferred in this context. 


Parking on campus The director of student life at a university wants to estimate the 
proportion of undergraduate students who regularly park a car on campus. Describe how 
to select a stratified random sample of approximately 100 students. Explain your choice of 
strata and why stratified random sampling might be preferred in this context. 


SRS of engineers? A corporation employs 2000 male and 500 female engineers. A 
stratified random sample of 200 male and 50 female engineers gives every individual in 
the population the same chance to be chosen for the sample. Is itan SRS? Explain your 
answer. 


SRS of students? At a party, there are 30 students over age 21 and 20 students under age 
21. You choose at random 3 of those over 21 and separately choose at random 2 of those 
under 21 to interview about their attitudes toward alcohol. You have given every student 
at the party the same chance to be interviewed. Is your sample an SRS? Explain your 
answer. 


pg 231 |») How is your room? A hotel has 30 floors with 40 rooms per floor. The 


rooms on one side of the hotel face the water, while rooms on the other side face a golf 
course. There is an extra charge for the rooms with a water view. The hotel manager wants 
to select 120 rooms and survey the registered guest in each of the selected rooms about his 
or her overall satisfaction with the property. 


a. Describe how to obtain a sample of 120 rooms using stratified random sampling. 
Explain your choice of strata and why this method might be preferred to simple random 
sampling. 


b. Describe how to obtain a sample of 120 rooms using cluster sampling. Explain your 
choice of clusters and why this method might be preferred to simple random sampling. 


22. 


de 


24. 


Go Blue! Michigan Stadium, also known as “The Big House,” seats over 100,000 fans for 
a football game. The University of Michigan Athletic Department wants to survey fans 
about concessions that are sold during games. Tickets are most expensive for seats on the 
sidelines. The cheapest seats are in the end zones (where one of the authors sat as a 
student). A map of the stadium is shown. 


Visitor Sideline 


ENDZONE 


1 


Michigan Sideline 


MS Sideline 
[1] Corner 
[3 Endzone 


a. Describe how to obtain a sample using stratified random sampling. Explain your choice 
of strata and why this method might be preferred to simple random sampling. 


b. Describe how to obtain a sample using cluster sampling. Explain your choice of 


clusters and why this method might be preferred to simple random sampling. 


High-speed Internet Laying fiber-optic cable is expensive. Cable companies want to 
make sure that if they extend their lines to less dense suburban or rural areas, there will be 
sufficient demand so the work will be cost-effective. They decide to conduct a survey to 
determine the proportion of households in a rural subdivision that would buy the service. 
They select a simple random sample of 5 blocks in the subdivision and survey each family 
that lives on one of those blocks. 


a. What is the name for this kind of sampling method? 


b. Give a possible reason why the cable company chose this method. 


Timber! A lumber company wants to estimate the proportion of trees in a large forest that 
are ready to be cut down. They use an aerial map to divide the forest into 200 equal-sized 
rectangles. Then they choose a random sample of 20 rectangles and examine every tree 
that’s in one of those rectangles. 


a. What is the name for this kind of sampling method? 


b. Give a possible reason why the lumber company chose this method. 


20. 


26. 


a7, 


28. 


29. 
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31. 


32. 


33. 


Eating on campus The director of student life at a small college wants to know what 
percent of students eat regularly in the cafeteria. To find out, the director selects an SRS of 
300 students who live in the dorms. Describe how undercoverage might lead to bias in this 
study. Explain the likely direction of the bias. 


Immigration reform A news organization wants to know what percent of U.S. residents 
support a “pathway to citizenship” for people who live in the United States illegally. The 
news organization randomly selects registered voters for the survey. Describe how 
undercoverage might lead to bias in this study. Explain the likely direction of the bias. 


Reporting weight loss A total of 300 people participated in a free 12-week weight-loss 
course at a community health clinic. After one year, administrators emailed each of the 
300 participants to see how much weight they had lost since the end of the course. Only 
56 participants responded to the survey. The mean weight loss for this sample was 13.6 
pounds. Describe how nonresponse might lead to bias in this study. Explain the likely 
direction of the bias. 


Nonresponse A survey of drivers began by randomly sampling from all listed residential 
telephone numbers in the United States. Of 45,956 calls to these numbers, 5029 were 
completed. The goal of the survey was to estimate how far people drive, on average, per 
day.42 Describe how nonresponse might lead to bias in this study. Explain the likely 
direction of the bias. 


pg 234 ») Running red lights An SRS of 880 drivers was asked: “Recalling the last 


ten traffic lights you drove through, how many of them were red when you entered the 
intersections?” Of the 880 respondents, 171 admitted that at least one light had been red. 
A practical problem with this survey is that people may not give truthful answers. Explain 
the likely direction of the bias. 


Seat belt use A study in El Paso, Texas, looked at seat belt use by drivers. Drivers were 
observed at randomly chosen convenience stores. After they left their cars, they were 
invited to answer questions that included questions about seat belt use. In all, 75% said 
they always used seat belts, yet only 61.5% were wearing seat belts when they pulled into 
the store parking lots Explain why the two percentages are so different. 


Boys don’t cry? Two female statistics students asked a random sample of 60 high school 
boys if they have ever cried during a movie. Thirty of the boys were asked directly and the 
other 30 were asked anonymously by means of a “secret ballot.” When the responses were 
anonymous, 63% of the boys said “Yes,” whereas only 23% of the other group said “Yes.” 
Explain why the two percentages are so different. 


Weight? Wait what? Marcos asked a random sample of 50 mall shoppers for their 
weight. Twenty-five of the shoppers were asked directly and the other 25 were asked 
anonymously by means of a “secret ballot.” The mean reported weight was 13 pounds 
heavier for the anonymous group. Explain why the two means are so different.1Z 


Wording bias Comment on each of the following as a potential sample survey question. 


Is the question clear? Is it slanted toward a desired response? 


a. “Some cell phone users have developed brain cancer. Should all cell phones come with 
a warning label explaining the danger of using cell phones?” 


b. “Do you agree that a national system of health insurance should be favored because it 
would provide health insurance for everyone and would reduce administrative costs?” 


c. “In view of escalating environmental degradation and incipient resource depletion, 
would you favor economic incentives for recycling of resource—intensive consumer 
goods?” 


34. Checking for bias Comment on each of the following as a potential sample survey 
question. Is the question clear? Is it slanted toward a desired response? 


a. Which of the following best represents your opinion on gun control? 
i. The government should confiscate our guns. 
ii. We have the right to keep and bear arms. 


b. A freeze in nuclear weapons should be favored because it would begin a much-needed 
process to stop everyone in the world from building nuclear weapons now and reduce 
the possibility of nuclear war in the future. Do you agree or disagree? 


Multiple Choice Select the best answer for Exercises 35—40. 


35. A popular website places opinion poll questions next to many of its news stories. Simply 
click your response to join the sample. One of the questions was “Do you plan to diet this 
year?” More than 30,000 people responded, with 68% saying “Yes.” Which of the 
following is true? 


a. About 68% of Americans planned to diet. 


b. The poll used a convenience sample, so the results tell us little about the population of 
all adults. 


c. The poll uses voluntary response, so the results tell us little about the population of all 
adults. 


d. The sample is too small to draw any conclusion. 


e. None of these. 


36. To gather information about the validity of a new standardized test for 10th-grade students 
in a particular state, a random sample of 15 high schools was selected from the state. The 
new test was administered to every 10th-grade student in the selected high schools. What 
kind of sample is this? 


a. A simple random sample 


b. A stratified random sample 
c. A cluster sample 
d. A systematic random sample 


e. A voluntary response sample 


37. Your statistics class has 30 students. You want to ask an SRS of 5 students from your 
class whether they use a mobile device for the online quizzes. You label the students 01, 
02, ... , 30. You enter the table of random digits at this line: 


14459 26056 31424 80371 65103 62253 22490 61181 
Your SRS contains the students labeled 


a. 14, 45, 92, 60, 56. 
b. 14, 31, 03, 10, 22. 
c. 14, 03, 10, 22, 22. 
d. 14, 03, 10, 22, 06. 
e. 14, 03, 10, 22, 11. 


38. Suppose that 35% of the voters in a state are registered as Republicans, 40% as 
Democrats, and 25% as Independents. A newspaper wants to select a sample of 1000 
registered voters to predict the outcome of the next election. If it randomly selects 350 
Republicans, randomly selects 400 Democrats, and randomly selects 250 Independents, 
did this sampling procedure result in a simple random sample of registered voters from 
this state? 


a. Yes, because each registered voter had the same chance of being chosen. 

b. Yes, because random chance was involved. 

c. No, because not all registered voters had the same chance of being chosen. 

d. No, because a different number of registered voters was selected from each party. 


e. No, because not all possible groups of 1000 registered voters had the same chance of 
being chosen. 


39. A local news agency conducted a survey about unemployment by randomly dialing phone 
numbers during the work day until it gathered responses from 1000 adults in its state. In 
the survey, 19% of those who responded said they were not currently employed. In reality, 
only 6% of the adults in the state were not currently employed at the time of the survey. 
Which of the following best explains the difference in the two percentages? 


a. The difference is due to sampling variability. We shouldn’t expect the results of a 
random sample to match the truth about the population every time. 


b. The difference is due to response bias. Adults who are employed are likely to lie and 
say that they are unemployed. 


c. The difference is due to undercoverage bias. The survey included only adults and did 
not include teenagers who are eligible to work. 


d. The difference is due to nonresponse bias. Adults who are employed are less likely to 
be available for the sample than adults who are unemployed. 


e. The difference is due to voluntary response. Adults are able to volunteer as a member 
of the sample. 


40. A simple random sample of 1200 adult Americans is selected, and each person is asked 
the following question: “In light of the huge national deficit, should the government at this 
time spend additional money to send humans to Mars?” Only 39% of those responding 
answered “Yes.” This survey 


a. is reasonably accurate because it used a large simple random sample. 
b. needs to be larger because only about 24 people were drawn from each state. 
c. probably understates the percent of people who favor sending humans to Mars. 


d. is very inaccurate but neither understates nor overstates the percent of people who favor 
sending humans to Mars. Because simple random sampling was used, it is unbiased. 


e. probably overstates the percent of people who favor sending humans to Mars. 


Recycle and Review 


41. Don’t turn it over (3.2) How many points do turnovers cost teams in the NFL? The 
scatterplot shows the relationship between x = number of turnovers and y = number of 
points scored by teams in the NFL during 2015, along with the least-squares regression 
line y\=460.2-4.084xY = 460.2 — 4.0842 


Points scored 


14 16 18 20 22 24 26 28 30 32 34 
Number of turnovers 


a. Interpret the slope of the regression line in context. 
b. For this regression line, s = 57.3. Interpret this value. 


c. Calculate and interpret the residual for the San Francisco 49ers, who had 17 turnovers 
and scored 238 points. 


d. How does the point for the 49ers affect the least-squares regression line and standard 
deviation of the residuals? Explain your answer. 


42. Internet charges (2.1) Some Internet service providers (ISPs) charge companies based on 
how much bandwidth they use in a month. One method that ISPs use to calculate 
bandwidth is to find the 95th percentile of a company’s usage based on samples of 
hundreds of 5-minute intervals during a month. 


a. Explain what “95th percentile” means in this setting. 


b. Is it possible to determine the z-score for a usage total that is at the 95th percentile? If 
so, find the z-score. If not, explain why not. 


SECTION 4.2 Experiments 


LEARNING TARGETS By the end of the section, you should be able to: 


e Explain the concept of confounding and how it limits the ability to make cause- 
and-effect conclusions. 


Distinguish between an observational study and an experiment, and identify the 
explanatory and response variables in each type of study. 


Identify the experimental units and treatments in an experiment. 
Describe the placebo effect and the purpose of blinding in an experiment. 


Describe how to randomly assign treatments in an experiment using slips of 
paper, technology, or a table of random digits. 


Explain the purpose of comparison, random assignment, control, and replication 
in an experiment. 


Describe a completely randomized design for an experiment. 


Describe a randomized block design and a matched pairs design for an 
experiment and explain the purpose of blocking in an experiment. 


A sample survey aims to gather information about a population without disturbing the 
population in the process. Sample surveys are one kind of observational study. Other 
observational studies record the behavior of animals in the wild or track the medical history of 
volunteers to look for associations between variables such as type of diet, amount of exercise, 
and blood pressure. 


DEFINITION Observational study 


An observational study observes individuals and measures variables of interest 
but does not attempt to influence the responses. 


Section 4.2 is about statistical designs for experiments, a very different way to produce data. 


Observational Studies Versus Experiments 


Is taking a vitamin D supplement good for you? Hundreds of observational studies have looked 
at the relationship between vitamin D concentration in a person’s blood and various health 
outcomes.48 In one observational study, researchers found that teenage girls with higher 
vitamin D intakes were less likely to suffer broken bones.12 Other observational studies have 


shown that people with higher vitamin D concentration have less cardiovascular disease, better 
cognitive function, and less risk of diabetes than people with lower concentrations of vitamin 
Ly 


Tatiana Popova/Shutterstock.com 
In the observational studies involving vitamin D and diabetes, the explanatory variable is 


vitamin D concentration in the blood and the response variable is diabetes status—whether or 
not the person developed diabetes. 


DEFINITION Response variable, Explanatory variable 


A response variable measures an outcome of a study. An explanatory variable 
may help explain or predict changes in a response variable. 


Unfortunately, it is very difficult to show that taking vitamin D causes a lower risk of 
diabetes using an observational study. As shown in the table, there are many possible 
differences between the group of people with high vitamin D concentration and the group of 
people with low vitamin D concentration. Any of these differences could be causing the 
difference in diabetes risk between the two groups of people. 


Variable Group 1 Group 2 
Vitamin D concentration High vitamin D Low vitamin D 
(explanatory) concentration concentration 
Quality of diet Better diet Worse diet 
Amount of exercise More exercise Less exercise 
Amount of vitamin supplementation More likely to take other Less likely to take other 
vitamins vitamins 
Diabetes status (response) Less likely to have diabetes More likely to have diabetes 


For example, it is possible that people who have healthier diets eat lots of foods that are 
high in vitamin D. Likewise, it is possible that people with healthier diets are less likely to 
develop diabetes. Vitamin D concentration may not have anything to do with diabetes status, 
even though there is an association between the two variables. In this case, we say there is 


confounding between vitamin D concentration and diet because we cannot tell which variable 
is causing the change in diabetes status. 


Some people call a variable that results in confounding, like diet in this case, a confounding 
variable. 


DEFINITION Confounding 


Confounding occurs when two variables are associated in such a way that their 
effects on a response variable cannot be distinguished from each other. 


AP® EXAM TIP 


If you are asked to identify a possible confounding variable in a given setting, you 
are expected to explain how the variable you choose (1) is associated with the 
explanatory variable and (2) is associated with the response variable. 


Likewise, because sun exposure increases vitamin D concentration, it is possible that 
people who exercise a lot outside have higher concentrations of vitamin D. If people who 
exercise a lot are also less likely to get diabetes, then amount of exercise and vitamin D 
concentration are confounded—we can’t say which variable is the cause of the smaller diabetes 
risk. 


EXAMPLE | Smoking and ADHD (>) 
Confounding 


Ai/Getty Images 


PROBLEM: In a study of more than 4700 children, researchers from Cincinnati Children’s 
Hospital Medical Center found that those children whose mothers smoked during pregnancy 


were more than twice as likely to develop ADHD as children whose mothers had not 
smoked.22 Explain how confounding makes it unreasonable to conclude that a mother’s 
smoking during pregnancy causes an increase in the risk of ADHD in her children based on 
this study. 


SOLUTION: 


It ts possible that the mothers who smoked during pregnancy were also more likely to have 
unhealthy diets. af ‘people with unhealthy diets are also more likely to have children with 
ADHD, then it could be that unhealthy diets caused the increase in ADHD risk, not 
smoking. 


Notice that the solution describes how diet might be associated with the explanatory 


variable (smoking status) and with the response variable (ADHD status). 
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Observational studies of the effect of an explanatory variable on a response variable often 
fail because of confounding between the explanatory variable and one or more other variables. 
In contrast to observational studies, experiments don’t just observe individuals or ask them 
questions. They actively impose some treatment to measure the response. Experiments can 
answer questions like “Does aspirin reduce the chance of a heart attack?” and “Can yoga help 
dogs live longer?” 


DEFINITION Experiment 


An experiment deliberately imposes some treatment on individuals to measure 
their responses. 


To determine if taking vitamin D actually causes a reduction in diabetes risk, researchers in 
Norway performed an experiment. The researchers randomly assigned 500 people with pre- 
diabetes to either take a high dose of vitamin D or to take a placebo—a pill that looked exactly 
like the vitamin D supplement but contained no active ingredient. After 5 years, about 40% of 
the people in each group were diagnosed with diabetes.2! In other words, the association 
between vitamin D concentration and diabetes status disappeared when comparing two groups 
that were roughly the same to begin with. 


DEFINITION Placebo 


A placebo is a treatment that has no active ingredient, but is otherwise like other 
treatments. 


The experiment in Norway avoided confounding by letting chance decide who took vitamin 
D and who didn’t. That way, people with healthier diets were split about evenly between the 
two groups. So were people who exercise a lot and people who take other vitamins. When our 
goal is to understand cause and effect, experiments are the only source of fully convincing 
data. For this reason, the distinction between observational study and experiment is one of the 
most important in statistics. 


EXAMPLE | Facebook and financial incentives (>) 
Observational studies and experiments 


Rostislav_Sedlacek/Shutterstock.com 


PROBLEM: In each of the following settings, identify the explanatory and response 
variables. Then determine if each is an experiment or an observational study. Explain your 
reasoning. 


a. Ina study conducted by researchers at the University of Texas, people were asked about 
their social media use and satisfaction with their marriage. Of the heavy social media 
users, 32% had thought seriously about leaving their spouse. Only 16% of non-social 


media users had thought seriously about leaving their spouse.22 


b. Ina diet study using 100 overweight volunteers, 50 volunteers were randomly assigned to 
receive weight-loss counseling, monthly weigh-ins, and a three-month gym pass. The 
other 50 volunteers were given financial incentives (earning $20 for losing 4 pounds in a 
month or paying $20 otherwise) along with the counseling, weigh-ins, and gym pass. The 
group with the financial incentives lost 6.7 more pounds, on average.22 


SOLUTION: 


a. Explanatory variable: Frequency of social media use. ‘Response variable: Marital 
satisfaction. This ts an observational study because people weren't assigned to use social 
media or not. 


b. Explanatory variable: Whether or not financial incentives were given. ‘Response 
variable: Amount of weight lost. This is an experiment because researchers gave some 


volunteers financial incentives and did not give financial incentives to the other 
volunteers. 


In part (a), the response variable is not the percent who thought about leaving their 
spouse. This percent is a Summary of all the responses. Likewise, in part (b), the 


response variable is not the average weight loss. This average is a Summary of all the 
responses. 
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In part (a) of the example, it would be incorrect to conclude that using social media causes 
marital dissatisfaction. It could be that other variables are confounded with social media use— 
or even that marital dissatisfaction is causing increased social media use. In part (b), it is 
reasonable to conclude that the financial incentives caused the increase in weight loss because 
this was a well-designed experiment. 


CHECK YOUR UNDERSTANDING 


. 
J 


1. Does reducing screen brightness increase battery life in laptop computers? To find out, 
researchers obtained 30 new laptops of the same brand. They chose 15 of the computers at 
random and adjusted their screens to the brightest setting. The other 15 laptop screens 
were left at the default setting—moderate brightness. Researchers then measured how 
long each machine’s battery lasted. Was this an observational study or an experiment? 
Justify your answer. 

Questions 2-4 refer to the following setting. Does eating dinner with their families 
improve students’ academic performance? According to an ABC News article, “Teenagers 
who eat with their families at least five times a week are more likely to get better grades in 
school.”24 This finding was based on a sample survey conducted by researchers at 
Columbia University. 


2. Was this an observational study or an experiment? Justify your answer. 
3. What are the explanatory and response variables? 


4. Explain clearly why such a study cannot establish a cause-and-effect relationship. Suggest 
a variable that may be confounded with whether families eat dinner together. 


The Language of Experiments 


An experiment is a statistical study in which we actually do something (a treatment) to people, 
animals, or objects (the experimental units or subjects) to observe the response. 


DEFINITION Treatment, Experimental unit, Subjects 


A specific condition applied to the individuals in an experiment is called a 
treatment. If an experiment has several explanatory variables, a treatment is a 
combination of specific values of these variables. An experimental unit is the 
object to which a treatment is randomly assigned. When the experimental units 
are human beings, they are often called subjects. 


The best way to learn the language of experiments is to practice using it. 


EXAMPLE | How can we prevent malaria? (>) 
Vocabulary of experiments 


PROBLEM: Malaria causes hundreds of thousands of deaths each year, with many of the 
victims being children. Will regularly screening children for the malaria parasite and treating 
those who test positive reduce the proportion of children who develop the disease? 
Researchers worked with children in 101 schools in Kenya, randomly assigning half of the 
schools to receive regular screenings and follow-up treatments and the remaining schools to 
receive no regular screening. Children at all 101 schools were tested for malaria at the end of 
the study.22 Identify the treatments and the experimental units in this experiment. 


SOLUTION: 


This experiment compares two treatments: (1) regular screenings and follow-up treatments 
and Gee regular screening. The experimental units are 101 schools in ‘Kenya. 


Note that the experimental units are the schools, not the students. The decision about 


who to screen was made school by school, not student by student. All students at the 
same school received the same treatment. 
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In the malaria experiment, there was one explanatory variable: screening status. In other 
experiments, there are multiple explanatory variables. Sometimes, these explanatory variables 
are called factors. In an experiment with multiple factors, the treatments are formed using the 
various levels of each of the factors. 


DEFINITION Factor, Levels 


In an experiment, a factor is a variable that is manipulated and may cause a 
change in the response variable. The different values of a factor are called levels. 


Here’s an example of a multifactor experiment. 


EXAMPLE |The five-second rule (>) 
Experiments with multiple explanatory variables 


Zaia Snively 


PROBLEM: Have you ever dropped a tasty piece of food on the ground, then quickly 
picked it up and eaten it? If so, you probably thought about the “five-second rule,” which 
states that a piece of food is safe to eat if it has been on the floor less than 5 seconds. The 
rule is based on the belief that bacteria need time to transfer from the floor to the food. But 
does it work? 

Researchers from Rutgers University put the five-second rule to the test. They used four 
different types of food: watermelon, bread, bread with butter, and gummy candy. They 
dropped the food onto four different surfaces: stainless steel, ceramic tile, wood, and carpet. 
And they waited for four different lengths of time: less than 1 second, 5 seconds, 30 seconds, 
and 300 seconds. Finally, they used bacteria prepared two different ways: in a tryptic soy 
broth or peptone buffer. Once the bacteria were ready, the researchers spread them out on the 
different surfaces and started dropping food.2® 


a. List the factors in this experiment and the number of levels for each factor. 


b. If the researchers used every possible combination to form the treatments, how many 
treatments were included in the experiment? 


c. List two of the treatments. 
SOLUTION: 
a. T ype of ‘food (4 levels), type of surface (4 levels), amount of time (4 levels), method of 


bacterial preparation (2 levels) 
6. 4X4X4X2=128 different treatments 


C. Watermelon/stainless steel/| less than 1 second/tryptic soy broth; gummy 
candy) wood /3 OO seconds/peptone buffer 
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What did the researchers discover? The wetter foods had greater bacterial transfer and food 
dropped on carpet had the least bacterial transfer. There was greater bacterial transfer the 
longer the food was on the surface, although there was some transfer that happened almost 
instantaneously. Overall, the researchers concluded that the type of food and type of surface 
were at least as important as the amount of time the food remained on the surface. 


This example shows how experiments allow us to study the combined effect of several 
factors. The interaction of several factors can produce effects that could not be predicted from 
looking at the effect of each factor alone. For example, although longer time was associated 
with more bacterial transfer in general, this relationship might not be true for very moist food. 


Designing Experiments: Comparison 


Experiments are the preferred method for examining the effect of one variable on another. By 
imposing the specific treatment of interest and controlling other influences, we can pin down 
cause and effect. Good designs are essential for effective experiments, just as they are for 
sampling. To see why, let’s start with an example of a bad experimental design. 


Zaia Snively 


Does caffeine affect pulse rate? Many students regularly consume caffeine to help them 
stay alert. So it seems plausible that taking caffeine might increase an individual’s pulse rate. Is 


this true? One way to investigate this claim is to ask volunteers to measure their pulse rates, 
drink some cola with caffeine, measure their pulse rates again after 10 minutes, and calculate 
the increase in pulse rate. 


This experiment has a very simple design. A group of subjects (the students) were exposed 
to a treatment (the cola with caffeine), and the outcome (change in pulse rate) was observed. 
Here is the design: 


Students = Cola with caffeine — Change in pulse rate 
Students — Cola with caffeine —» Change in pulse rate 


Unfortunately, even if the pulse rate of every student went up, we couldn’t attribute the 
increase to caffeine. Perhaps the excitement of being in an experiment made their pulse rates 
increase. Maybe it was the sugar in the cola and not the caffeine. Perhaps their teacher told 
them a funny joke during the 10-minute waiting period and made everyone laugh. In other 
words, there are many other variables that are potentially confounded with taking caffeine. 


Many laboratory experiments use a design like the one in the caffeine example: 


Experimental units . Treatment — Measure response 
Experimental units — Treatment — Measure response 


In the lab environment, simple designs often work well. Field experiments and experiments 
with animals or people deal with more varied conditions. Outside the lab, badly designed 
experiments often yield worthless results because of confounding. 


The remedy for the confounding in the caffeine example is to do a comparative experiment 
with two groups: one group that receives caffeine and a control group that does not receive 
caffeine. 


DEFINITION Control group 


In an experiment, a control group is used to provide a baseline for comparing the 
effects of other treatments. Depending on the purpose of the experiment, a control 
group may be given an inactive treatment (placebo), an active treatment, or no 
treatment at all. 


In all other aspects, these groups should be treated exactly the same so that the only 
difference is the caffeine. That way, if there is convincing evidence of a difference in the 
average increase in pulse rates, we can safely conclude it was caused by the caffeine. This 
means that one group could get regular cola with caffeine, while the control group gets 
caffeine-free cola. Both groups would get the same amount of sugar, so sugar consumption 
would no longer be confounded with caffeine intake. Likewise, both groups would experience 
the same events during the experiment, so what happens during the experiment won’t be 
confounded with caffeine intake either. 


EXAMPLE | Preventing malaria (>) 
Control groups 


Alexander Joe/Getty Images 


PROBLEM: In an earlier example, we described an experiment in which researchers 
randomly assigned 101 schools in Kenya to either receive regular screenings and follow-up 
treatments or to receive no regular screenings. Explain why it was necessary to include a 
control group of schools that didn’t receive regular screenings. 


SOLUTION: 
The purpose of the control group 15 tO provide a baseline for comparing the effect of the 


regular screenings and ‘follow-up treatments. Otherwise, researchers wouldn't be able to 
determine ie a decrease in malaria rates was due to the treatment or some other change 
that occurred during the experiment (like a drought that killed off mosquitos, slowing the 
spread of malaria). 
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A control group was essential in the malaria experiment to determine if screening was 
effective. However, not all experiments include a control group—as long as comparison takes 
place. In the experiment about the five-second rule, there were 128 different treatments being 
compared and no control group. A control group wasn’t essential in this experiment because 
researchers were interested in comparing different amounts of time on the floor, different types 
of food, and different types of surfaces. 


Designing Experiments: Blinding and the Placebo Effect 


In the caffeine experiment, we used comparison to help prevent confounding. But even when 
there is comparison, confounding is still possible. If the subjects in the experiment know what 
type of soda they are receiving, the expectations of the two groups will be different. The 
knowledge that a subject is receiving caffeine may increase his or her pulse rate, apart from the 
caffeine itself. This is an example of the placebo effect. 


DEFINITION Placebo effect 


The placebo effect describes the fact that some subjects in an experiment will 
respond favorably to any treatment, even an inactive treatment. 


In one study, researchers zapped the wrists of 24 test subjects with a painful jolt of 
electricity. Then they rubbed a cream with no active medicine on subjects’ wrists and told them 
the cream should help soothe the pain. When researchers shocked them again, 8 subjects said 
they experienced significantly less pain.22 When the ailment is psychological, like depression, 
some experts think that the placebo effect accounts for about three-quarters of the effect of the 
most widely used drugs.28 


Because of the placebo effect, it is important that subjects don’t know what treatment they 
are receiving. It is also better if the people interacting with the subjects and measuring the 
response variable don’t know which subjects are receiving which treatment. When neither 
group knows who is receiving which treatment, the experiment is double-blind. Other 
experiments are single-blind. 


DEFINITION Double-blind, Single-blind 


In a double-blind experiment, neither the subjects nor those who interact with 
them and measure the response variable know which treatment a subject 
received. 


In a single-blind experiment, either the subjects don’t know which treatment they 
are receiving or the people who interact with them and measure the response 
variable don’t know which subjects are receiving which treatment. 


The idea of a double-blind design is simple. Until the experiment ends and the results are 
in, only the study’s statistician knows for sure which treatment a subject is receiving. However, 
some experiments cannot be carried out in a double-blind manner. For example, if researchers 
are comparing the effects of exercise and dieting on weight loss, then subjects will know which 
treatment they are receiving. Such an experiment can still be single-blind if the individuals who 
are interacting with the subjects and measuring the response variable don’t know who is dieting 
and who is exercising. In other single-blind experiments, the subjects are unaware of which 
treatment they are receiving, but the people interacting with them and measuring the response 
variable do know. 


EXAMPLE |Do magnets repel pain? (>) 
Blinding and the placebo effect 


Eric O’Connell/Getty Images 


PROBLEM: Early research showed that magnetic fields affected living tissue in humans. 
Some doctors have begun to use magnets to treat patients with chronic pain. Scientists 
wondered if this type of therapy really worked. They designed a double-blind experiment to 
find out. A total of 50 patients with chronic pain were recruited for the study. A doctor 
identified a painful site on each patient and asked him or her to rate the pain on a scale from 
0 (mild pain) to 10 (severe pain). Then the doctor selected a sealed envelope containing a 
magnet at random from a box with a mixture of active and inactive magnets. The chosen 
magnet was applied to the site of the pain for 45 minutes. After being treated, each patient 
was again asked to rate the level of pain from 0 to 10.72 


a. Explain what it means for this experiment to be double-blind. 


b. Why was it important for this experiment to be double-blind? 
SOLUTION: 


a. Neither the subjects nor the doctors applying the magnets and recording the pain 
ratings knew which subjects had the active magnets and which had the inactive 
magnets. 


b. af subjects knew they were recetving an active treatment, researchers wouldn’t know if 
any improvement was due to the magnets or to the expectation of getting better (the 
placebo effect). af the doctors knew which subjects received which treatments, they might 
treat one group of subjects differently from the other group. This would make it 


difficult to know if the magnets were the cause of any improvement. 
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CHECK YOUR UNDERSTANDING 


. 


Y A new analysis is casting doubt on a claimed benefit of omega-3 fish oil. For years, 
doctors have been recommending eating fish and taking fish oil supplements to prevent heart 
disease. But the new analysis reviewed 20 previous studies and showed that the effects of 
omega-3 aren’t as great as once suspected. One reason is that an early trial of omega-3 
supplements was conducted as an open-label study.22 In this type of study, both patients and 
researchers know who is receiving which treatment. 


1. Describe a potential problem with an open-label study in this context. 
2. Describe how you can fix the problem identified in Question 1. 


Designing Experiments: Random Assignment 


Comparison alone isn’t enough to produce results we can trust. If the treatments are given to 
groups that differ greatly when the experiment begins, confounding will result. If we allow 
students to choose what type of cola they will drink in the caffeine experiment, students who 
consume caffeine on a regular basis might be more likely to choose the regular cola. Due to 
their caffeine tolerance, these students’ pulse rates might not increase as much as other 
students’ pulse rates. In this case, caffeine tolerance would be confounded with the amount of 
caffeine consumed, making it impossible to conclude cause and effect. 

To create roughly equivalent groups at the beginning of an experiment, we use random 
assignment to determine which experimental units get which treatment. 


DEFINITION Random assignment 


In an experiment, random assignment means that experimental units are 
assigned to treatments using a chance process. 


Let’s look at how random assignment can be used to improve the design of the caffeine 
experiment. 


EXAMPLE | Caffeine and pulse rates (>) 


How random assignment works 


Zaia Snively 


PROBLEM: A total of 20 students have agreed to participate in an experiment comparing 
the effects of caffeinated cola and caffeine-free cola on pulse rates. Describe how you would 
randomly assign 10 students to each of the two treatments: 


a. Using 20 identical slips of paper 
b. Using technology 
c. Using Table D 


SOLUTION: 


a. On10 slips of ‘paper, write the letter “A”: on the remaining 10 slips, write the letter “B.” 
S huffle the slips of ‘paper and hand out one slip of ‘paper to each volunteer. Students who 
get an “A” slip receive the cola with caffeine and students who get a Be slip receive the 
cola without caffeine. 


When describing a method of random assignment, don’t stop after creating the 


groups. Make sure to identify which group gets which treatment. 


b. Label each student with a different integer from 1 to 20. Then randomly generate 10 
different integers from 1 to 20. The students with these labels receive the cola with 
caffeine. The remaining 10 students receive the cola without caffeine. 


When using a random number generator or a table of random digits to assign 


treatments, make sure to account for the possibility of repeated numbers when 
describing your method. 


c. Label each student with a different integer from 01 to 20. Go toa line of Table D and 
read two-digit groups moving from left to right. The first 10 different labels between 01 
and 20 identify the 10 students who receive cola with caffeine. The remaining 10 
students receive the caffeine-free cola. Ignore groups of digits from 21 to Oo. 


FOR PRACTICE, TRY EXERCISE 63 


Random assignment should distribute the students who regularly consume caffeine in 
roughly equal numbers to each group. It should also balance out the students with high 
metabolism and those with larger body sizes in the caffeine and caffeine-free groups. Random 
assignment helps ensure that the effects of other variables (e.g., caffeine tolerance, metabolism, 
or body size) are spread evenly among the two groups. 


Designing Experiments: Control 


Although random assignment should create two groups of students that are roughly equivalent 
to begin with, we still have to ensure that the only consistent difference between the groups 
during the experiment is the type of cola they receive. We can control the effects of some 
variables by keeping them the same for both groups. For example, we should make both 
treatments contain the same amount of sugar. If one group got regular cola and the other group 
got caffeine-free diet cola, then the amount of sugar would be confounded with the amount of 
caffeine—we wouldn’t know if it was the sugar or the caffeine that was causing a change in 
pulse rates. 


DEFINITION Control 


In an experiment, control means keeping other variables constant for all 
experimental units. 


We also want to control other variables to reduce the variability in the response variable. 
Suppose we let volunteers in both groups choose how much cola they want to drink. In that 
case, the changes in pulse rate would be more variable than if we made sure each subject drank 
the same amount of soda. Letting the amount of cola vary will make it harder to determine if 
caffeine is really having an effect on pulse rates. 


The dotplots on the left show the results of an experiment in which the amount of cola 
consumed was the same for all participating students. Because there is so little overlap in these 
graphs, it seems clear that caffeine increases pulse rates. The dotplots on the right show the 
results of an experiment in which the students were permitted to choose how much or how little 
cola they consumed. Notice that the centers of the distributions haven’t changed, but the 
distributions are much more variable. The increased overlap in the graphs makes the evidence 
supporting the effect of caffeine less convincing. 
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Change in (keeping amount of soda constant) pulse rate Change in (letting amount of soda vary) pulse rate 


After randomly assigning treatments and controlling other variables, the two groups should 
be about the same, except for the treatments. Then a difference in the average change in pulse 
rate must be due either to the treatments themselves—or to the random assignment. We can’t 
say that any difference between the average pulse rate changes for students in the two groups 
must be caused by the difference in caffeine. There would be some difference, even if both 
groups received the same type of cola, because the random assignment is unlikely to produce 
two groups that are exactly equivalent with respect to every variable that might affect pulse 
rate. 


Designing Experiments: Replication 


Would you trust an experiment with just one student in each group? No, because the results 
would depend too much on which student was assigned to the caffeinated cola. However, if we 
randomly assign many subjects to each group, the effects of chance will balance out, and there 
will be little difference in the average responses in the two groups—unless the treatments 
themselves cause a difference. This is the idea of replication. 


DEFINITION Replication 


In an experiment, replication means using enough experimental units to 
distinguish a difference in the effects of the treatments from chance variation due 
to the random assignment. 


Oo In statistics, replication means “use enough subjects.” In other fields, the term 


replication has a different meaning. In these fields, replication means conducting an 
experiment in one setting and then having other investigators conduct a similar experiment in a 
different setting. That is, replication means repeatability. 


Experiments: Putting It All Together 


The following box summarizes the four key principles of experimental design: comparison, 
random assignment, control, and replication. 


PRINCIPLES OF EXPERIMENTAL DESIGN 


The basic principles for designing experiments are as follows: 
1. Comparison. Use a design that compares two or more treatments. 


2. Random assignment. Use chance to assign experimental units to treatments. Doing so 
helps create roughly equivalent groups of experimental units by balancing the effects of 
other variables among the treatment groups. 


3. Control. Keep other variables the same for all groups, especially variables that are likely 
to affect the response variable. Control helps avoid confounding and reduces variability in 
the response variable. 


4. Replication. Use enough experimental units in each group so that any differences in the 
effects of the treatments can be distinguished from chance differences between the 


groups. 


Let’s see how these principles were used in designing a famous medical experiment. 


EXAMPLE |The Physicians’ Health Study (>) 
Principles of experimental design 


ES | 
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PROBLEM: Does regularly taking aspirin help protect people against heart attacks? The 
Physicians’ Health Study was a medical experiment that helped answer this question. In fact, 
the Physicians’ Health Study looked at the effects of two drugs: aspirin and beta-carotene. 
Researchers wondered if beta-carotene would help prevent some forms of cancer. The 
subjects in this experiment were 21,996 male physicians. There were two explanatory 
variables (factors), each having two levels: aspirin (yes or no) and beta-carotene (yes or no). 
Combinations of the levels of these factors form the four treatments shown in the diagram. 
One-fourth of the subjects were assigned at random to each of these treatments. 


On odd-numbered days, the subjects took either a tablet that contained aspirin or a 
placebo that looked and tasted like the aspirin but had no active ingredient. On even- 
numbered days, they took either a capsule containing beta-carotene or a placebo. There were 
several response variables—the study looked for heart attacks, several kinds of cancer, and 
other medical outcomes. After several years, 239 of the placebo group but only 139 of the 
aspirin group had suffered heart attacks. This difference is large enough to give good 


evidence that taking aspirin does reduce heart attacks.2! It did not appear, however, that 
beta-carotene had any effect on preventing cancer. 
Factor 2: Beta-carotene 
Yes No 


Yes q , 1 q 3 


Aspirin Placebo 
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Placebo Beta-carotene Placebo Placebo 


a. Explain how this experiment used comparison. 
b. Explain the purpose of randomly assigning the physicians to the four treatments. 


c. Name two variables that were controlled in this experiment and why it was beneficial to 
control these variables. 


d. Explain how this experiment used replication. What is the purpose of replication in this 
context? 


SOLUTION: 


a. Researchers used a design that compared each of the active treatments to a placebo. 


6b. Random assignment helped ensure that the four groups of physicians were roughly 
equivalent at the beginning of the experiment. 


c. The experiment used subjects of the same gender and same occupation. ‘Using only male 
physicians helps to reduce the variability in the response variables. 


d. There were over 5000 subjects in each treatment group. This helped ensure that the 
difference in heart attacks was due to the aspirin and not to chance variation in the 
random assignment. 


If women and people with other occupations were included, the results might be more 
variable, making it harder to determine the effects of aspirin and beta-carotene. 


However, using only male physicians means we don’t know how females or other 
males would respond to these treatments. 
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The Physicians’ Health Study shows how well-designed experiments can yield good 


evidence that differences in the treatments cause the differences we observe in the response. 


Reports in medical journals regularly begin with words like these from a study of a flu vaccine 
given as a nose spray: “This study was a randomized, double-blind, placebo-controlled trial. 
Participants were enrolled from 13 sites across the continental United States between mid- 
September and mid-November.”22 Doctors are supposed to know what this means. Now you 


know, too. 


CHECK YOUR UNDERSTANDING 


\ 


y Many utility companies have introduced programs to encourage energy conservation 
among their customers. An electric company considers placing small digital displays in 
households to show current electricity use and what the cost would be if this use continued 
for a month. Will the displays reduce electricity use? One cheaper approach is to give 
customers a chart and information about monitoring their electricity use from their outside 
meter. Would this method work almost as well? The company decides to conduct an 
experiment using 60 households to compare these two approaches (display, chart) with a 
group of customers who receive information about energy consumption but no help in 
monitoring electricity use. 
1. Explain why it was important to have a control group that didn’t get the display or the 
chart. 


2. Describe how to randomly assign the treatments to the 60 households. 


3. What is the purpose of randomly assigning treatments in this context? 


Completely Randomized Designs 


The diagram in Figure 4.1 presents the details of the caffeine experiment: random assignment, 
the sizes of the groups and which treatment they receive, and the response variable. This type 


of design is called a completely randomized design. 
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FIGURE 4.1 Outline of a completely randomized design to compare caffeine and no caffeine. 


DEFINITION Completely randomized design 


In a completely randomized design, the experimental units are assigned to the 
treatments completely by chance. 


Although there are good statistical reasons for using treatment groups that are about equal 
in size, the definition of a completely randomized design does not require that each treatment 
be assigned to an equal number of experimental units. It does specify that the assignment of 
treatments must occur completely at random. 


EXAMPLE | Chocolate milk and concussions (>) 
Completely randomized design 


Ann Heath 


PROBLEM: “Concussion-Related Measures Improved in High School Football Players 
Who Drank New Chocolate Milk” announced a recent headline.22 In the study, researchers 
compared a group of concussed football players given a new type of chocolate milk with a 
group of concussed football players who received no treatment. 


a. Explain why it isn’t reasonable to conclude that the new type of chocolate milk is 
effective for treating high school football players with concussions based on this study. 


b. To test the effectiveness of the new type of chocolate milk, you recruit 50 high school 
football players who suffered a concussion in the previous 24 hours to participate in an 
experiment. Write a few sentences describing a completely randomized design for this 
experiment. 


SOLUTION: 


a. It ts possible that the group who received the new type of chocolate milk improved 
because they knew they were being treated and expected to get better, not because of the 


new chocolate milk. 


6. Number the players from 1 to 50. Use a random number generator to produce 25 
different integers from 1 to So and give the new type of chocolate milk to the players 
with these numbers. Give regular chocolate milk to the remaining 25 players. Compare 
the concussion-related measures for the two groups. 
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AP® EXAM TIP 


If you are asked to describe a completely randomized design, stay away from 
flipping coins. For example, suppose we ask each student in the caffeine 
experiment to toss a coin. If it’s heads, then the student will drink the cola with 
caffeine. If it’s tails, then the student will drink the caffeine-free cola. As long as all 
20 students toss a coin, this is still a completely randomized design. Of course, the 
two groups are unlikely to contain exactly 10 students because it is unlikely that 20 
coin tosses will result in a perfect 50-50 split between heads and tails. 

The problem arises if we try to force the two groups to have equal sizes. 
Suppose we continue to have students toss coins until one of the groups has 10 
students and then place the remaining students in the other group. In this case, the 
last two students in line are very likely to end up in the same group. However, ina 
completely randomized design, the last two subjects should only have a 50% 
chance of ending up in the same group. 


Randomized Block Designs 


Completely randomized designs are the simplest statistical designs for experiments. They 
illustrate clearly the principles of comparison, random assignment, control, and replication. But 
just as with sampling, there are times when the simplest method doesn’t yield the most precise 
results. When a population consists of groups of individuals that are “similar within but 
different between,” a stratified random sample gives a better estimate than a simple random 
sample. This same logic applies in experiments. 


Suppose that a mobile phone company is considering two different keyboard designs (A 
and B) for its new smartphone. The company decides to perform an experiment to compare the 
two keyboards using a group of 10 volunteers. The response variable is typing speed, measured 
in words per minute. 


How should the company address the fact that four of the volunteers already use a 
smartphone, whereas the remaining six volunteers do not? They could use a completely 
randomized design and hope that the random assignment distributes the smartphone users and 
non-smartphone users about evenly between the group using keyboard A and the group using 


keyboard B. Even so, there might be a lot of variability in typing speed within both treatment 
groups because some members of each treatment group are more familiar with smartphones 
than the others. This additional variability might make it difficult to detect a difference in the 
effectiveness of the two keyboards. What should the researchers do? 


Zaia Snively 


Because the company knows that experience with smartphones will affect typing speed, 
they could start by separating the volunteers into two groups—one with experienced 
smartphone users and one with inexperienced smartphone users. Each of these groups of 
similar subjects is known as a block. Within each block, the company could then randomly 
assign half of the subjects to use keyboard A and the other half to use keyboard B. To control 
other variables, each subject should be given the same passage to type while in a quiet room 
with no distractions. This randomized block design helps account for the variation in typing 
speed that is due to experience with smartphones. 


DEFINITION Block, Randomized block design 


A block is a group of experimental units that are known before the experiment to 
be similar in some way that is expected to affect the response to the treatments. 


In a randomized block design, the random assignment of experimental units to 
treatments is carried out separately within each block. 


Figure 4.2 outlines the randomized block design for the smartphone experiment. The 
subjects are first separated into blocks based on their experience with smartphones. Then the 
two treatments are randomly assigned within each block. 
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FIGURE 4.2 Outline of a randomized block design for the smartphone experiment. The blocks 
consist of volunteers who have used smartphones and volunteers who have not used 
smartphones. The treatments are keyboard A and keyboard B. 


Using a randomized block design allows us to account for the variation in the response that 
is due to the blocking variable of smartphone experience. This makes it easier to determine if 
one treatment is really more effective than the other. 


To see how blocking helps, let’s look at the results of the smartphone experiment. In the 
block of 4 smartphone users, 2 were randomly assigned to use keyboard A and the other 2 were 
assigned to use keyboard B. Likewise, in the block of 6 non-smartphone users, 3 were 
randomly assigned to use keyboard A and the other 3 were assigned to use keyboard B. Each of 


the 10 volunteers typed the same passage and the typing speed was recorded. Here are the 
results: 
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There is some evidence that keyboard A results in higher typing speeds, but the evidence 
isn’t that convincing. Enough overlap occurs in the two distributions that the differences might 
simply be due to the chance variation in the random assignment. 

If we compare the results for the two keyboards within each block, however, a different 
story emerges. Among the 4 smartphone users (indicated by the red squares), keyboard A was 
the clear winner. Likewise, among the 6 non- smartphone users (indicated by the black dots), 
keyboard A was also the clear winner. 


= Smartphone user ¢ Non-smartphone user 


A e cy © a 5 
—) 
= 
J 
co 
s B ee ° 2 a 
15 20 25 30 35 40 45 50 


Typing speed (wpm) 


The overlap in the first set of dotplots was due almost entirely to the variation in 
smartphone experience—smartphone users were generally faster than non-smartphone users, 
regardless of which keyboard they used. In fact, the average typing speed for the smartphone 
users was 40, while the average typing speed for non-smartphone users was only 26, a 
difference of 14 words per minute. To account for the variation created by the difference in 
smartphone experience, let’s subtract 14 from each of the typing speeds in the block of 
smartphone users to “even the playing field.” Here are the results: 
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Because we accounted for the variation due to the difference in smartphone experience, the 
variation in each of the distributions has been reduced. There is now much less overlap 
between the two distributions, meaning that the evidence in favor of keyboard A is much more 
convincing. When blocks are formed wisely, it is easier to find convincing evidence that one 
treatment is more effective than another. 


AP® EXAM TIP 


Don’t mix the language of experiments and the language of sample surveys or other 
observational studies. You will lose credit for saying things like “use a randomized 
block design to select the sample for this survey” or “this experiment suffers from 
nonresponse because some subjects dropped out during the study.” 


The idea of blocking is an important additional principle of experimental design. A wise 
experimenter will form blocks based on the most important unavoidable sources of variation 
among the experimental units. In other words, the experimenter will form blocks using the 
variables that are the best predictors of the response variable. Random assignment will then 
average out the effects of the remaining other variables and allow a fair comparison of the 
treatments. The moral of the story is: control what you can, block on what you can’t control, 
and randomize to create comparable groups. 


EXAMPLE | Should | use the popcorn button? (>) 


Blocking in an experiment 


Ann Health 


PROBLEM: A popcorn lover wants to determine if it is better to use the “popcorn button” 
on her microwave oven or use the amount of time recommended on the bag of popcorn. To 
measure how well each method works, she will count the number of unpopped kernels 
remaining after popping. To obtain the experimental units, she goes to the store and buys 10 
bags each of 4 different varieties of microwave popcorn (butter, cheese, natural, and kettle 
corn), for a total of 40 bags. 


a. Describe a randomized block design for this experiment. Justify your choice of blocks. 


b. Explain why a randomized block design might be preferable to a completely randomized 
design for this experiment. 


SOLUTION: 


a. Form blocks based on variety, because the number of unpopped kernels is likely to differ 
by variety. Randomly assign 5 bags of each variety to the popcorn button treatment 
and 5 to the timed treatment by placing all 10 bags of a particular variety ina large 
box. Shake the box, pick 5 bags without looking, and assign them to be popped using the 
popcorn button. The remaining 5 bags will be popped using the instructions on the 
bags. ‘Repeat this process for the remaining 3 varieties. After popping each of the 40 
bags in random order, count the number of unpopped kernels in each bag and compare 
the results within each variety. Then combine the results from the 4 varieties after 
accounting for the difference in average response for each variety. 


It is important to pop the bags in random order so that changes over time (e.g., 
temperature, humidity) aren’t confounded with the explanatory variable. For 
example, if the 20 “popcorn button” bags are popped last when the room 


temperature is greater, we wouldn't know if using the popcorn button or the 
warmer temperature was the cause of a difference in the number of unpopped 
kernels. 


b. A randomized block design accounts for the variability in the number of unpopped 


kernels created by the different varieties of ‘popcorn (butter, cheese, natural, kettle). This 
makes it easier to determine if using the microwave button is more effective for reducing 


the number of unpopped kernels. 
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Another way to address the variability in unpopped kernels created by the different varieties 
is to use only one variety of popcorn in the experiment. Because variety of popcorn is no longer 
a variable, it will not be a source of variability. Of course, this means that the results of the 
experiment only apply to that one variety of popcorn—not ideal for the popcorn lover in the 
example! 


MATCHED PAIRS DESIGN A common type of randomized block design for comparing two 
treatments is a matched pairs design. The idea is to create blocks by matching pairs of similar 
experimental units. The random assignment of subjects to treatments is done within each 
matched pair. Just as with other forms of blocking, matching helps account for the variation 
due to the variable(s) used to form the pairs. 


DEFINITION Matched pairs design 


A matched pairs design is a common experimental design for comparing two 
treatments that uses blocks of size 2. In some matched pairs designs, two very 
similar experimental units are paired and the two treatments are randomly 
assigned within each pair. In others, each experimental unit receives both 
treatments in a random order. 


Suppose we want to investigate if listening to classical music while taking a math test 
affects performance. A total of 30 students in a math class volunteer to take part in the 
experiment. The difference in mathematical ability among the volunteers is likely to create 
additional variation in the test scores, making it harder to see the effect of classical music. To 
account for this variation, we could pair the students by their grade in the class—the two 
students with the highest grades are paired together, the two students with the next highest 
grades are paired together, and so on. Within each pair, one student is randomly assigned to 
take a math test while listening to classical music and the other member of the pair is assigned 
to take the math test in silence. 


J-Elgaard/Getty Images 


Sometimes, each “pair” in a matched pairs design consists of just one experimental unit that 
gets both treatments in random order. In the experiment about the effect of listening to classical 
music, we could have each student take a math test in both conditions. To decide the order, we 
might flip a coin for each student. If the coin lands on heads, the student takes a math test with 
classical music playing today and a similar math test without music playing tomorrow. If it 
lands on tails, the student does the opposite—no music today and classical music tomorrow. 


Randomizing the order of treatments is important to avoid confounding. Suppose everyone 
did the classical music treatment on the first day and the no-music treatment on the second day, 
but the air conditioner wasn’t working on the second day. We wouldn’t know if any difference 
in mean test score was due to the difference in treatment or the difference in room temperature. 


EXAMPLE | Will an additive improve my mileage? (>) 
Matched pairs design 


PROBLEM: A consumer organization wants to know if using a certain fuel additive 
increases the fuel efficiency (in miles per gallon, or mpg) of cars. A total of 20 cars of 
different types are available for testing. Design an experiment that uses a matched pairs 
design to investigate this question. Explain your method of pairing. 


SOLUTION: 


Give each car both treatments. It is reasonable to think that some cars are more fuel 
efficient than others, so using each car as its own “pair” accounts for the variation in fuel 
efficiency in the experimental units. For each car, randomly assign the order in which the 
treatments are assigned by flipping a coin. Heads indicates using the additive first and no 
additive second. Tails indicates using no additive first and then the additive second. For 
each car, record the fuel efficiency (mpg) after using each treatment. 
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In the preceding example, it is also possible to form pairs of two similar cars. For instance, 
we could pair together the two most fuel-efficient cars, the next two most fuel-efficient cars, 
and so on. This is less ideal, however, because there will still be some differences between the 
members of each pair that may cause additional variation in the results. Using the same car 
twice creates perfectly matched “pairs,” and it also doubles the number of pairs used in the 
experiment. Both these features make it easier to find convincing evidence that the gas additive 
is effective, if it really is effective. 


CHECK YOUR UNDERSTANDING 


q 


y Researchers would like to design an experiment to compare the effectiveness of three 
different advertisements for a new television series featuring the works of Jane Austen. 
There are 300 volunteers available for the experiment. 


1. Describe a completely randomized design to compare the effectiveness of the three 
advertisements. 


2. Describe a randomized block design for this experiment. Justify your choice of blocks. 


3. Why might a randomized block design be preferable in this context? 


Section 4.2 Summary 


e Statistical studies often try to show that changing one variable (the explanatory variable) 
causes changes in another variable (the response variable). Variables are confounded when 
their effects on a response variable can’t be distinguished from each other. 


e We can produce data to answer specific questions using observational studies or 
experiments. An observational study gathers data on individuals as they are. Experiments 
actively do something to people, animals, or objects in order to measure their response. 
Experiments are the best way to show cause and effect. 

e In an experiment, we impose one or more treatments on a group of experimental units 
(sometimes called subjects if they are human). Each treatment is a combination of the levels 
of the explanatory variables (also called factors). 

e Some experiments give a placebo (fake treatment) to a control group. That helps prevent 
confounding due to the placebo effect, whereby some patients get better because they expect 
the treatment to work. 


e Many behavioral and medical experiments are double-blind. That is, neither the subjects nor 
those interacting with them and measuring their responses know who is receiving which 


treatment. If one group knows and the other doesn’t, then the experiment is single-blind. 
e The basic principles of experimental design are: 
« Comparison: Use a design that compares two or more treatments. 


: Random assignment: Use chance (slips of paper, a random number generator, a table of 
random digits) to assign experimental units to treatments. This helps create roughly 
equivalent groups before treatments are imposed. 


: Control: Keep other variables the same for all groups. Control helps avoid confounding 
and reduces the variation in responses, making it easier to decide if a treatment is effective. 


« Replication: Impose each treatment on enough experimental units so that the effects of the 
treatments can be distinguished from chance differences between the groups. 


e Ina completely randomized design, the experimental units are assigned to the treatments 
completely by chance. 


e A randomized block design forms groups (blocks) of experimental units that are similar 
with respect to a variable that is expected to affect the response. Treatments are assigned at 
random within each block. Responses are then compared within each block and combined 
with the responses of other blocks after accounting for the differences between the blocks. 
When blocks are chosen wisely, it is easier to determine if one treatment is more effective 
than another. 


e A matched pairs design is a common form of randomized block design for comparing two 
treatments. In some matched pairs designs, each subject receives both treatments in a random 
order. In others, two very similar subjects are paired, and the two treatments are randomly 
assigned within each pair. 


Section 4.2 Exercises 


43. pg 243 ®) Good for the gut? Is fish good for the gut? Researchers tracked 22,000 
Ne] 


male physicians for 22 years. Those who reported eating seafood of any kind at least 5 
times per week had a 40% lower risk of colon cancer than those who said they ate seafood 
less than once a week. Explain how confounding makes it unreasonable to conclude that 
eating seafood causes a reduction in the risk of colon cancer, based on this study.24 


44. Straight A’s now, healthy later A study by Pamela Herd of the University of Wisconsin— 
Madison found a link between high school grades and health. Analyzing data from the 
Wisconsin Longitudinal Study, which has tracked the lives of thousands of Wisconsin 
high school graduates from the class of 1957, Herd found that students with higher grade- 
point averages were more likely to say they were in excellent or very good health in their 
early 60s. Explain how confounding makes it unreasonable to conclude that people will 
live healthier lives if they increase their GPA, based on this study.22 


45. pg 244 |») Snacking and TV Does the type of program people watch influence how 


much they eat? A total of 94 college students were randomly assigned to one of three 
treatments: watching 20 minutes of a Hollywood action movie (The Island), watching the 
same 20-minute excerpt of the movie with no sound, and watching 20 minutes of an 
interview program (Charlie Rose). While watching, participants were given snacks 
(M&M’S®, cookies, carrots, and grapes) and allowed to eat as much as they wanted. 
Subjects who watched the highly stimulating excerpt from The Island ate 65% more 
calories than subjects who watched Charlie Rose. Participants who watched the silent 
version of The Island ate 46% more calories than those who watched Charlie Rose.2® 
Identify the explanatory and response variables in this study. Then determine if it is an 
experiment or an observational study. Explain your reasoning. 


46. Learning biology with computers An educator wants to compare the effectiveness of 
computer software for teaching biology with that of a textbook presentation. She gives a 
biology pretest to each student in a group of high school juniors, then randomly divides 
them into two groups. One group uses the computer, and the other studies the text. At the 
end of the year, she tests all the students again and compares the increase in biology test 
scores in the two groups. Identify the explanatory and response variables in this study. 
Then determine if it is an experiment or an observational study. Explain your reasoning. 


47. Child care and aggression A study of child care enrolled 1364 infants and followed them 
through their sixth year in school. Later, the researchers published an article in which they 
stated that “the more time children spent in child care from birth to age 4, the more 
adults tended to rate them, both at age 4% and at kindergarten, as less likely to get along 


with others, as more assertive, as disobedient, and as aggressive.”22 


a. What are the explanatory and response variables? 
b. Is this an observational study or an experiment? Justify your answer. 


c. Does this study show that child care makes children more aggressive? Explain your 
reasoning. 


48. Chocolate and happy babies A University of Helsinki (Finland) study wanted to 
determine if chocolate consumption during pregnancy had an effect on infant temperament 
at age 6 months. Researchers began by asking 305 healthy pregnant women to report their 
chocolate consumption. Six months after birth, the researchers asked mothers to rate their 
infants’ temperament using the traits of smiling, laughter, and fear. The babies born to 
women who had been eating chocolate daily during pregnancy were found to be more 
active and “positively reactive”—a measure that the investigators said encompasses traits 
like smiling and laughter.22 


a. What are the explanatory and response variables? 


b. Was this an observational study or an experiment? Justify your answer. 


49. 


50. 


ol. 


52. 


53. 


c. Does this study show that eating chocolate regularly during pregnancy helps produce 
infants with good temperament? Explain your reasoning. 


pg 246 ») Growing in the shade The ability to grow in shade may help pine trees 


found in the dry forests of Arizona to resist drought. How well do these pines grow in 
shade? Investigators planted pine seedlings in a greenhouse in either full light, light 
reduced to 25% of normal by shade cloth, or light reduced to 5% of normal. At the end of 
the study, they dried the young trees and weighed them. Identify the experimental units 
and the treatments. 


Sealing your teeth Many children have their molars sealed to help prevent cavities. In an 
experiment, 120 children aged 6—8 were randomly assigned to a control group, a group in 
which sealant was applied and reapplied periodically for 36 months, and a group in which 
fluoride varnish was applied and reapplied periodically for 42 months. After 9 years, the 
percent of initially healthy molars with cavities was calculated for each group.22 Identify 
the experimental units and the treatments. 


pg 246 ») Improving response rate How can we reduce the rate of refusals in 


telephone surveys? Most people who answer at all listen to the interviewer’s introductory 
remarks and then decide whether to continue. One study made telephone calls to randomly 
selected households to ask opinions about the next election. In some calls, the interviewer 
gave her name; in others, she identified the university she was representing; and in still 
others, she identified both herself and the university. For each type of call, the interviewer 
either did or did not offer to send a copy of the final survey results to the person 
interviewed. 


a. List the factors in this experiment and state how many levels each factor has. 


b. If the researchers used every possible combination to form the treatments, how many 
treatments were included in the experiment? 


c. List two of the treatments. 


Fabric science A maker of fabric for clothing is setting up a new line to “finish” the raw 
fabric. The line will use either metal rollers or natural-bristle rollers to raise the surface of 
the fabric; a dyeing-cycle time of either 30 or 40 minutes; and a temperature of either 
150°C or 175°C. Three specimens of fabric will be subjected to each treatment and scored 
for quality. 


a. List the factors in this experiment and state how many levels each factor has. 


b. If the researchers used every possible combination to form the treatments, how many 
treatments were included in the experiment? 


c. List two of the treatments. 


Want a snack? Can snacking on fruit rather than candy reduce later food consumption? 


54. 


50. 


56. 


57. 


58. 


59. 


Researchers randomly assigned 12 women to eat either 65 calories of berries or 65 
calories of candy. Two hours later, all 12 women were given an unlimited amount of pasta 
to eat. The researchers recorded the amount of pasta consumed by each subject. The 
women who ate the berries consumed 133 fewer calories, on average. Identify the 
explanatory and response variables, the experimental units, and the treatments. 


Pricey pizza? The cost of a meal might affect how customers evaluate and appreciate 
food. To investigate, researchers worked with an Italian all-you-can-eat buffet to perform 
an experiment. A total of 139 subjects were randomly assigned to pay either $4 or $8 for 
the buffet and then asked to rate the quality of the pizza on a 9-point scale. Subjects who 
paid $8 rated the pizza 11% higher than those who paid only $4.22 Identify the 
explanatory and response variables, the experimental units, and the treatments. 


pg 248 |») Oils and inflammation The extracts of avocado and soybean oils have been 


shown to slow cell inflammation in test tubes. Will taking avocado and soybean 
unsaponifiables (called ASU) help relieve pain for subjects with joint stiffness due to 
arthritis? In an experiment, 345 men and women were randomly assigned to receive either 
300 milligrams of ASU daily for three years or a placebo daily for three years.*! Explain 
why it was necessary to include a control group in this experiment. 


Supplements for testosterone As men age, their testosterone levels gradually decrease. 
This may cause a reduction in energy, an increase in fat, and other undesirable changes. 
Do testosterone supplements reverse some of these effects? A study in the Netherlands 
assigned 237 men aged 60 to 80 with low or low-normal testosterone levels to either a 
testosterone supplement or a placebo.42 Explain why it was necessary to include a control 
group in this experiment. 


Cocoa and blood flow A study of blood flow involved 27 healthy people aged 18 to 72. 
Each subject consumed a cocoa beverage containing 900 milligrams of flavonols daily for 
5 days. Using a finger cuff, blood flow was measured on the first and fifth days of the 
study. After 5 days, researchers measured what they called “significant improvement” in 
blood flow and the function of the cells that line the blood vessels.42 What flaw in the 
design of this experiment makes it impossible to say if the cocoa really caused the 
improved blood flow? Explain your answer. 


Reducing unemployment Will cash bonuses speed the return to work of unemployed 
people? A state department of labor notes that last year 68% of people who filed claims 
for unemployment insurance found a new job within 15 weeks. As an experiment, this 
year the state offers $500 to people filing unemployment claims if they find a job within 
15 weeks. The percent who do so increases to 77%. What flaw in the design of this 
experiment makes it impossible to say if the bonus really caused the increase? Explain 
your answer. 


pg 250 |») More oil and inflammation Refer to Exercise 55. Could blinding be used 


in this experiment? Explain your reasoning. Why is blinding an important consideration in 


60. 


61. 


62. 


63. 


this context? 


More testosterone Refer to Exercise 56. Could blinding be used in this experiment? 
Explain your reasoning. Why is blinding an important consideration in this context? 


Meditation for anxiety An experiment that claimed to show that meditation lowers 
anxiety proceeded as follows. The experimenter interviewed the subjects and rated their 
level of anxiety. Then the subjects were randomly assigned to two groups. The 
experimenter taught one group how to meditate and they meditated daily for a month. The 
other group was simply told to relax more. At the end of the month, the experimenter 
interviewed all the subjects again and rated their anxiety level. The meditation group now 
had less anxiety. Psychologists said that the results were suspect because the ratings were 
not blind. Explain what this means and how lack of blindness could affect the reported 
results. 


Side effects Even if an experiment is double-blind, the blinding might be compromised if 
side effects of the treatments differ. For example, suppose researchers at a skin-care 
company are comparing their new acne treatment against that of the leading competitor. 
Fifty subjects are assigned at random to each treatment, and the company’s researchers 
will rate the improvement for each of the 100 subjects. The researchers aren’t told which 
subjects received which treatments, but they know that their new acne treatment causes a 
slight reddening of the skin. How might this knowledge compromise the blinding? Explain 
why this is an important consideration in the experiment. 


pg 251 |») Layoffs and “survivor guilt” Workers who survive a layoff of other 


employees at their location may suffer from “survivor guilt.” A study of survivor guilt and 
its effects used as subjects 120 students who were offered an opportunity to earn extra 
course credit by doing proofreading. Each subject worked in the same cubicle as another 
student, who was an accomplice of the experimenters. At a break midway through the 
work, one of three things happened: 


Treatment 1: The accomplice was told to leave; it was explained that this was because she 
performed poorly. 


Treatment 2: It was explained that unforeseen circumstances meant there was only enough 
work for one person. By “chance,” the accomplice was chosen to be laid off. 


Treatment 3: Both students continued to work after the break. 


The subjects’ work performance after the break was compared with their performance 
before the break. Overall, subjects worked harder when told the other student’s dismissal 
was random.“4 Describe how you would randomly assign the subjects to the treatments 


a. using slips of paper. 
b. using technology. 


c. using Table D. 


64. Precise offers People often use round prices as first offers in a negotiation. But would a 


65. 


66. 


67. 


more precise number suggest that the offer was more reasoned and informed? In an 
experiment, 238 adults played the role of a person selling a used car. Each adult received 
one of three initial offers: $2000, $1865 (a precise under-offer), and $2135 (a precise 
over-offer). After hearing the initial offer, each subject made a counter-offer. The 
difference in the initial offer and counter-offer was the largest in the group that received 
the $2000 offer.“2 Describe how the researchers could have randomly assigned the 
subjects to the treatments 


a. using slips of paper. 
b. using technology. 


c. using Table D. 


Stronger players A football coach hears that a new exercise program will increase upper- 
body strength better than lifting weights. He is eager to test this new program in the off- 
season with the players on his high school team. The coach decides to let his players 
choose which of the two treatments they will undergo for 3 weeks—exercise or weight 
lifting. He will use the number of push-ups a player can do at the end of the experiment as 
the response variable. Which principle of experimental design does the coach’s plan 
violate? Explain how this violation could lead to confounding. 


Killing weeds A biologist would like to determine which of two brands of weed killer, X 
or Y, is less likely to harm the plants in a garden at the university. Before spraying near 
the plants, the biologist decides to conduct an experiment using 24 individual plants. 
Which of the following two plans for randomly assigning the treatments should the 
biologist use? Why? 

Plan A: Choose the 12 healthiest-looking plants. Then flip a coin. If it lands heads, apply 
Brand X weed killer to these plants and Brand Y weed killer to the remaining 12 plants. If 
it lands tails, do the opposite. 

Plan B: Choose 12 of the 24 plants at random. Apply Brand X weed killer to those 12 
plants and Brand Y weed killer to the remaining 12 plants. 


pg 254 |») Boosting preemies Do blood-building drugs help brain development in 


babies born prematurely? Researchers randomly assigned 53 babies, born more than a 
month premature and weighing less than 3 pounds, to one of three groups. Babies either 
received injections of erythropoietin (EPO) three times a week, darbepoetin once a week 
for several weeks, or no treatment. Results? Babies who got the medicines scored much 
better by age 4 on measures of intelligence, language, and memory than the babies who 
received no treatment.4® 


a. Explain how this experiment used comparison. 
b. Explain the purpose of randomly assigning the babies to the three treatments. 


c. Name two variables that were controlled in this experiment and why it was beneficial to 


68. 


69. 


70. 


control these variables. 


d. Explain how this experiment used replication. What is the purpose of replication in this 
context? 


The effects of day care Does day care help low-income children stay in school and hold 
good jobs later in life? The Carolina Abecedarian Project (the name suggests the ABCs) 
has followed a group of 111 children for over 40 years. Back then, these individuals were 
all healthy but low-income black infants in Chapel Hill, North Carolina. All the infants 
received nutritional supplements and help from social workers. Half were also assigned at 
random to an intensive preschool program. Results? Children who were assigned to the 
preschool program had higher IQ’s, higher standardized test scores, and were less likely to 
repeat a grade in school.44 


a. Explain how this experiment used comparison. 
b. Explain the purpose of randomly assigning the infants to the two treatments. 


c. Name two variables that were controlled in this experiment and why it was beneficial to 
control these variables. 


d. Explain how this experiment used replication. What is the purpose of replication in this 
context? 


pg 256 |») Treating prostate disease A large study used records from Canada’s 


national health care system to compare the effectiveness of two ways to treat prostate 
disease. The two treatments are traditional surgery and a new method that does not require 
surgery. The records described many patients whose doctors had chosen what method to 
use. The study found that patients treated by the new method were more likely to die 


within 8 years.48 


a. Further study of the data showed that this conclusion was wrong. The extra deaths 
among patients who were treated with the new method could be explained by other 
variables. What other variables might be confounded with a doctor’s choice of surgical 
or nonsurgical treatment? 


b. You have 300 prostate patients who are willing to serve as subjects in an experiment to 
compare the two methods. Write a few sentences describing a completely randomized 
design for this experiment. 


Diet soda and pregnancy A large study of 3000 Canadian children and their mothers 
found that the children of mothers who drank diet soda daily during pregnancy were twice 
as likely to be overweight at age 1 than children of mothers who avoided diet soda during 
pregnancy.*2 


a. A newspaper article about this study had the headline “Diet soda, pregnancy: Mix may 
fuel childhood obesity.” This headline suggests that there is a cause- and-effect 


relationship between diet soda consumption during pregnancy and the weight of the 
children 1 year after birth. However, this relationship could be explained by other 
variables. What other variables might be confounded with a mother’s consumption of 
diet soda during pregnancy? 


b. You have 300 pregnant mothers who are willing to serve as subjects in an experiment 
that compares three treatments during pregnancy: no diet soda, one diet soda per day, 
and two diet sodas per day. Write a few sentences describing a completely randomized 
design for this experiment. 


71. pg 259 |») A fruitful experiment A citrus farmer wants to know which of three 


72. 


73. 


fertilizers (A, B, and C) is most effective for increasing the number of oranges on his 
trees. He is willing to use 30 mature trees of various sizes from his orchard in an 
experiment with a randomized block design. 


a. Describe a randomized block design for this experiment. Justify your choice of blocks. 


b. Explain why a randomized block design might be preferable to a completely 
randomized design for this experiment. 


In the cornfield An agriculture researcher wants to compare the yield of 5 corn varieties: 
A, B, C, D, and E. The field in which the experiment will be carried out increases in 
fertility from north to south. The researcher therefore divides the field into 25 plots of 
equal size, arranged in 5 east—west rows of 5 plots each, as shown in the diagram. 


North 


a. Describe a randomized block design for this experiment. Justify your choice of blocks. 


b. Explain why a randomized block design might be preferable to a completely 
randomized design for this experiment. 


Doctors and nurses Nurse-practitioners are nurses with advanced qualifications who 
often act much like primary-care physicians. Are they as effective as doctors at treating 
patients with chronic conditions? An experiment was conducted with 1316 patients who 
had been diagnosed with asthma, diabetes, or high blood pressure. Within each condition, 
patients were randomly assigned to either a doctor or a nurse-practitioner. The response 
variables included measures of the patients’ health and of their satisfaction with their 
medical care after 6 months.22 


a. Which are the blocks in this experiment: the different diagnoses (asthma, diabetes, or 
high blood pressure) or the type of care (nurse or doctor)? Why? 


b. Explain why a randomized block design is preferable to a completely randomized 


design in this context. 


c. Suppose the experiment used only diabetes patients, but there were still 1316 subjects 


willing to participate. What advantage would this offer? What disadvantage? 


74, Comparing cancer treatments The progress of a type of cancer differs in women and 
men. Researchers want to design an experiment to compare three therapies for this cancer. 
They recruit 500 male and 300 female patients who are willing to serve as subjects. 


75. 


76. 


a. 


Which are the blocks in this experiment: the three cancer therapies or the two sexes? 
Why? 


. What are the advantages of a randomized block design over a completely randomized 


design using these 800 subjects? 


Suppose the researchers had 800 male and no female subjects available for the study. 
What advantage would this offer? What disadvantage? 


Aw, rats! A nutrition experimenter intends to compare the weight gain of newly weaned 
male rats fed Diet A with that of rats fed Diet B. To do this, she will feed each diet to 10 
rats. She has available 10 rats from one litter and 10 rats from a second litter. Rats in the 
first litter appear to be slightly healthier. 


a. 


C. 


If the 10 rats from Litter 1 were fed Diet A, then genetics and type of diet would be 
confounded. Explain this statement carefully. 


Describe how to randomly assign the rats to treatments using a randomized block 
design with litters as blocks. 


Use technology or Table D to carry out the random assignment. 


Comparing weight-loss treatments A total of 20 overweight females have agreed to 
participate in a study of the effectiveness of four weight-loss treatments: A, B, C, and D. 
The researcher first calculates how overweight each subject is by comparing the subject’s 
actual weight with her “ideal” weight. The subjects and their excess weights in pounds are 


as follows: 

Birnbaum 35 Hernandez 25 Moses 25 Smith 29 
Brown 34 Jackson 33 Nevesky 39 Stall 33 
Brunk 30 =Kendall 28 Obrach 30 ~=Tran Re 
Cruz 34 Loren 32 Rodriguez 30 =©Wilansky 42 


Deng 24 Mann 28 Santiago 27 Williams 22 


The response variable is the weight lost after 8 weeks of treatment. 


a. If the 5 most overweight women were assigned Treatment A, the next 5 most 


overweight women were assigned Treatment B, and so on, then the amount overweight 


and type of treatment would be confounded. Explain this statement carefully. 


b. Describe how to randomly assign the women to treatments using a randomized block 
design. Use blocks of size 4 formed by the amount overweight. 


c. Use technology or Table D to carry out the random assignment. 


7d: 260 _  ») SAT preparation A school counselor wants to compare the effectiveness of 
pg. ey prep 


78. 


rie 


an online SAT preparation program with an in-person SAT preparation class. For an 
experiment, the counselor recruits 30 students who have already taken the SAT once. The 
response variable will be the improvement in SAT score. 


a. Design an experiment that uses a completely randomized design to investigate this 
question. 


b. Design an experiment that uses a matched pairs design to investigate this question. 
Explain your method of pairing. 


c. Which design do you prefer? Explain your answer. 


Valve surgery Medical researchers want to compare the success rate of a new non- 
invasive method for replacing heart valves using a cardiac catheter with traditional open- 
heart surgery. They have 40 male patients, ranging in age from 55 to 75, who need valve 
replacement. One of several response variables will be the percentage of blood that flows 
backward—in the wrong direction—through the valve on each heartbeat. 


a. Design an experiment that uses a completely randomized design to investigate this 
question. 


b. Design an experiment that uses a matched pairs design to investigate this question. 
Explain your method of pairing. 


c. Which design do you prefer? Explain your answer. 


Look, Ma, no hands! Does talking on a hands-free cell phone distract drivers? 
Researchers recruit 40 student subjects for an experiment to investigate this question. 
They have a driving simulator equipped with a hands-free phone for use in the study. Each 
subject will complete two sessions in the simulator: one while talking on the hands-free 
phone and the other while just driving. The order of the two sessions for each subject will 
be determined at random. The route, driving conditions, and traffic flow will be the same 
in both sessions. 


a. What type of design did the researchers use in their study? 


b. Explain why the researchers chose this design instead of a completely randomized 
design. 


c. Why is it important to randomly assign the order of the treatments? 


d. Explain how and why researchers controlled for other variables in this experiment. 


80. Chocolate gets my heart pumping Cardiologists at Athens Medical School in Greece 
wanted to test if chocolate affects blood vessel function. The researchers recruited 17 
healthy young volunteers, who were each given a 3.5-ounce bar of dark chocolate, either 
bittersweet or fake chocolate. On another day, the volunteers received the other treatment. 
The order in which subjects received the bittersweet and fake chocolate was determined at 
random. The subjects had no chocolate outside the study, and investigators didn’t know if 
a subject had eaten the real or the fake chocolate. An ultrasound was taken of each 
volunteer’s upper arm to observe the functioning of the cells in the walls of the main 
artery. The researchers found that blood vessel function was improved when the subjects 
ate bittersweet chocolate, and that there were no such changes when they ate the placebo 
(fake chocolate).24 


a. What type of design did the researchers use in their study? 


b. Explain why the researchers chose this design instead of a completely randomized 
design. 


c. Why is it important to randomly assign the order of the treatments for the subjects? 


d. Explain how and why researchers controlled for other variables in this experiment. 


81. Got deodorant? A group of students wants to perform an experiment to determine 
whether Brand A or Brand B deodorant lasts longer. One group member suggests the 
following design: Recruit 40 student volunteers—20 male and 20 female. Separate by 
gender, because male and female bodies might respond differently to deodorant. Give all 
the males Brand A deodorant and all the females Brand B. Have the principal judge how 
well the deodorant is still working at the end of the school day on a 0 to 10 scale. Then 
compare ratings for the two treatments. 


a. Identify any flaws you see in the proposed design for this experiment. 


b. Describe how you would design the experiment. Explain how your design addresses 
each of the problems you identified in part (a). 


82. Close shave Which of two brands (X or Y) of electric razor shaves closer? Researchers 
want to design and carry out an experiment to answer this question using 50 adult male 
volunteers. Here’s one idea: Have all 50 subjects shave the left sides of their faces with the 
Brand X razor and shave the right sides of their faces with the Brand Y razor. Then have 
each man decide which razor gave the closer shave and compile the results. 


a. Identify any flaws you see in the proposed design for this experiment. 


b. Describe how you would design the experiment. Explain how your design addresses 
each of the problems you identified in part (a). 


Multiple Choice Select the best answer for Exercises 83-90. 


83. 


84. 


85. 


86. 


Can a vegetarian or low-salt diet reduce blood pressure? Men with high blood pressure are 
assigned at random to one of four diets: (1) normal diet with unrestricted salt; (2) 
vegetarian with unrestricted salt; (3) normal with restricted salt; and (4) vegetarian with 
restricted salt. This experiment has 


a. one factor, the type of diet. 

b. two factors, high blood pressure and type of diet. 

c. two factors, normal/vegetarian diet and unrestricted/restricted salt. 
d. three factors, men, high blood pressure, and type of diet. 


e. four factors, the four diets being compared. 


In the experiment of the preceding exercise, the subjects were randomly assigned to the 
different treatments. What is the most important reason for this random assignment? 


a. Random assignment eliminates the effects of other variables such as stress and body 
weight. 


b. Random assignment is a good way to create groups of subjects that are roughly 
equivalent at the beginning of the experiment. 


c. Random assignment makes it possible to make a conclusion about all men. 
d. Random assignment reduces the amount of variation in blood pressure. 


e. Random assignment prevents the placebo effect from ruining the results of the study. 


To investigate if standing up while studying affects performance in an algebra class, a 
teacher assigns half of the 30 students in his class to stand up while studying and assigns 
the other half to not stand up while studying. To determine who receives which treatment, 
the teacher identifies the two students who did best on the last exam and randomly assigns 
one to stand and one to not stand. The teacher does the same for the next two highest- 
scoring students and continues in this manner until each student is assigned a treatment. 
Which of the following best describes this plan? 


a. This is an observational study. 

b. This is an experiment with blocking. 

c. This is a completely randomized experiment. 
d. This is a stratified random sample. 


e. This is a cluster sample. 


A gardener wants to try different combinations of fertilizer (none, 1 cup, 2 cups) and 
mulch (none, wood chips, pine needles, plastic) to determine which combination produces 
the highest yield for a variety of green beans. He has 60 green-bean plants to use in the 


experiment. If he wants an equal number of plants to be assigned to each treatment, how 
many plants will be assigned to each treatment? 


a 1 
b. 3 
c. 4 
d. 5 


e. 12 


. Corn variety 1 yielded 140 bushels per acre last year at a research farm. This year, corn 
variety 2, planted in the same location, yielded only 110 bushels per acre. Based on these 
results, is it reasonable to conclude that corn variety 1 is more productive than corn variety 
2? 


a. Yes, because 140 bushels per acre is greater than 110 bushels per acre. 
b. Yes, because the study was done at a research farm. 


c. No, because there may be other differences between the two years besides the corn 
variety. 


d. No, because there was no use of a placebo in the experiment. 


e. No, because the experiment wasn’t double-blind. 


. A report in a medical journal notes that the risk of developing Alzheimer’s disease among 
subjects who regularly opted to take the drug ibuprofen was about half the risk of those 
who did not. Is this good evidence that ibuprofen is effective in preventing Alzheimer’s 
disease? 


a. Yes, because the study was a randomized, comparative experiment. 
b. No, because the effect of ibuprofen is confounded with the placebo effect. 
c. Yes, because the results were published in a reputable professional journal. 


d. No, because this is an observational study. An experiment would be needed to confirm 
(or not confirm) the observed effect. 


e. Yes, because a 50% reduction can’t happen just by chance. 


. A farmer is conducting an experiment to determine which variety of apple tree, Fuji or 
Gala, will produce more fruit in his orchard. The orchard is divided into 20 equally sized 
square plots. He has 10 trees of each variety and randomly assigns each tree to a separate 
plot in the orchard. What are the experimental unit(s) in this study? 


a. The trees 


b. The farmer 
c. The plots 
d. The orchard 


e. The apples 


90. Two essential features of all statistically designed experiments are 
a. comparing several treatments; using the double-blind method. 
b. comparing several treatments; using chance to assign subjects to treatments. 
c. always having a placebo group; using the double-blind method. 
d. using a block design; using chance to assign subjects to treatments. 


e. using enough subjects; always having a control group. 
Recycle and Review 


91. Seed weights (2.2) Biological measurements on the same species often follow a Normal 
distribution quite closely. The weights of seeds of a variety of winged bean are 
approximately Normal with mean 525 milligrams (mg) and standard deviation 110 mg. 


a. What percent of seeds weigh more than 500 mg? 
b. If we discard the lightest 10% of these seeds, what is the smallest weight among the 
remaining seeds? 


92. Comparing rainfall (1.3) The boxplots summarize the distributions of average monthly 
rainfall (in inches) for Tucson, Arizona, and Princeton, New Jersey.22 Compare these 
distributions. 


= Princeton 
| 
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Average monthly rainfall (in.) 


SECTION 4.3 Using Studies Wisely 


LEARNING TARGETS By the end of the section, you should be able to: 


e Explain the concept of sampling variability when making an inference about a 
population and how sample size affects sampling variability. 


e Explain the meaning of statistically significant in the context of an experiment 
and use simulation to determine if the results of an experiment are statistically 
significant. 


e Identify when it is appropriate to make an inference about a population and 
when it is appropriate to make an inference about cause and effect. 


e Evaluate if a statistical study has been carried out in an ethical manner! 


Researchers who conduct statistical studies often want to draw conclusions that go beyond the 
data they produce. Here are two examples. 


e The U.S. Census Bureau carries out a monthly Current Population Survey of about 60,000 
households. Their goal is to use data from these randomly selected households to estimate 
the percent of unemployed individuals in the population. 


e Scientists performed an experiment that randomly assigned 21 volunteer subjects to one of 
two treatments: sleep deprivation for one night or unrestricted sleep. The scientists hoped to 


show that sleep deprivation causes a decrease in performance two days later.22 


What conclusions can be drawn from a particular study? The answer depends on how the data 
were collected. 


Inference for Sampling 


When the members of a sample are selected at random from a population, we can use the 
sample results to infer things about the population. That is, we can make inferences about the 
population from which the sample was randomly selected. Inference from convenience samples 
or voluntary response samples would be misleading because these methods of choosing a 
sample are biased. In these cases, we are almost certain that the sample does not fairly 
represent the population. 


Even when making an inference from a random sample, it would be surprising if the 
estimate from the sample was exactly equal to the truth about the population. For example, in a 
random sample of 1399 U.S. teens aged 13-18, 26% reported more than 8 hours of 
entertainment media use per day. Because of sampling variability, it would be surprising if 
exactly 26% of all U.S. teens aged 13-18 reported more than 8 hours of entertainment media 
use per day. Why? Because different samples of 1399 U.S. teens aged 13-18 will include 


different sets of people and produce different estimates. 


DEFINITION Sampling variability 


Sampling variability refers to the fact that different random samples of the same 
size from the same population produce different estimates. 


The following activity explores the idea of sampling variability. 


ACTIVITY Exploring sampling variability 


When making an inference about a population from a random sample, we shouldn’t expect 
the estimate to be exactly correct. But how much do sample results vary? Your teacher has 
prepared a large population of beads, where 30% have a certain color (e.g., red) so you can 
explore this question. 


G. Curt Fiedler/Getty Images 


1. Ina moment, you will select a random sample of 20 beads. Do you expect that your 
sample will contain exactly 30% red beads? Explain your reasoning. 


2. Mix the beads thoroughly, select a random sample of 20 beads from the population, 
calculate the percent of red beads in the sample, and replace the beads in the population. 


3. After all students have selected a sample, make a class dotplot showing each student’s 
estimate for the percent of red beads. Where is the graph centered? How much does the 
percent of red beads vary? 


4. Imagine that you repeated Steps 2 and 3 with random samples of size 100. How do you 
expect the dotplots would compare? If there’s time, select random samples of size 100 to 
confirm your answer. 


When Mrs. Storrs’s class of 40 students did the red bead activity, they produced the dotplot 


shown at left (top) for samples of size 20. The dotplot is centered around 30%, the true percent 
of red beads. This shouldn’t be surprising because random sampling helps avoid bias. Notice 
also that the estimates varied from 10% to 50% and that only 11 of the 40 estimates were equal 
to exactly 30%. 


Percent of red beads ( = 20) 


10 2) 30 4) 50) 
Percent of red beads (n = 100) 


To see the effect of increasing the sample size, we simulated 40 random samples of size 
100 from the same population and recorded the percent of red beads in each sample. Notice that 
the graph is still centered at 30%, but there is much less variability. 


SAMPLING VARIABILITY AND SAMPLE SIZE 


Larger random samples tend to produce estimates that are closer to the true population value 
than smaller random samples. In other words, estimates from larger samples are more 
precise. 


EXAMPLE | Weighing football players (>) 


Inference for sampling 


Diamond Images/Getty Images 


PROBLEM: How much do National Football League (NFL) players weigh, on average? In 
a random sample of 50 NFL players, the average weight is 244.4 pounds. 


a. Do you think that 244.4 pounds is the true average weight of all NFL players? Explain 
your answer. 


b. Which would be more likely to give an estimate close to the true average weight of all 
NFL players: a random sample of 50 players or a random sample of 100 players? Explain 
your answer. 


SOLUTION: 


a. No. Different samples of size 50 would produce different average weights. So it would be 
surprising if this estimate is equal to the true average weight of all NFL players. 


b. A random sample of 100 players, because estimates tend to be closer to the truth when 
the sample size 1s larger. 


FOR PRACTICE, TRY EXERCISE 93 


Estimates from random samples often come with a margin of error that allows us to create 
an interval of plausible values for the true population value. In the preceding example about 
NFL players, the margin of error for the estimate of 244.4 pounds is 14.2 pounds. Based on this 
margin of error, it wouldn’t be surprising if the true average weight for all NFL players was as 
small as 244.4 — 14.2 = 230.2 pounds or as large as 244.4 + 14.2 = 258.6 pounds. 

You will learn how to calculate the margin of error in Chapter 8. For now, make sure to 
remember the effect of sampling variability when using data from a random sample to make an 
inference about a population. 


Inference for Experiments 


Well-designed experiments allow for inferences about cause and effect. But we should only 
conclude that changes in the explanatory variable cause changes in the response variable if the 
results of an experiment are statistically significant. 


DEFINITION Statistically significant 


When the observed results of a study are too unusual to be explained by chance 
alone, the results are called statistically significant. 


Mr. Wilcox and his students decided to perform the caffeine experiment from the preceding 
section. In their experiment, 10 student volunteers were randomly assigned to drink cola with 
caffeine and the remaining 10 students were assigned to drink caffeine-free cola. The table and 
graph show the change in pulse rate for each student (Final pulse rate — Initial pulse rate), along 
with the mean change for each group. 
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(Final — Initial) 

The dotplots provide some evidence that caffeine has an effect on pulse rates. The mean 
change for the 10 students who drank cola with caffeine was 3.2, which is 1.2 greater than for 
the group who drank caffeine-free cola. But are the results statistically significant? 

Recall that the purpose of random assignment in this experiment was to create two groups 
that were roughly equivalent at the beginning of the experiment. Subjects with high caffeine 
tolerance should be split up in about equal numbers, subjects with high metabolism should be 
split up in about equal numbers, and so on. 

Of course, the random assignment is unlikely to produce groups that are exactly equivalent. 
One group might get more “favorable” subjects just by chance. That is, the caffeine group 
might end up with a few extra subjects who were likely to have a pulse rate increase, just due to 
chance variation in the random assignment. 

There are two ways to explain why the mean change in pulse rate was 1.2 greater for the 
caffeine group: 


.. Caffeine does not have an effect on pulse rates, and the difference of 1.2 happened because 
of chance variation in the random assignment. 


’, Caffeine increases pulse rates. 


If it is plausible to get a difference of 1.2 or more simply due to the chance variation in 
random assignment, the results of the experiment are not statistically significant. But if it is 


very unlikely to get a difference of 1.2 or more by chance alone, we rule out Explanation 1 and 
say the results are statistically significant—and that caffeine increases pulse rates. 

How can we determine if a difference of 1.2 is statistically significant? You’ ll find out in 
the following activity. 


ACTIVITY Analyzing the caffeine experiment 


In the experiment performed by Mr. Wilcox’s class, the mean change in pulse rate for the 
caffeine group was 1.2 greater than the mean change for the no-caffeine group. This provides 
some evidence that caffeine increases pulse rates. Is this evidence convincing? Or is it 
plausible that a difference of 1.2 would arise just due to chance variation in the random 
assignment? In this activity, we’ll investigate by seeing what differences typically occur just 
by chance, assuming caffeine doesn’t affect pulse rates. That is, we’ll assume that the change 
in pulse rate for a particular student would be the same regardless of what treatment he or 
she was assigned. 
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Change in pulse rate Mean 

(Final pulse rate — Initial pulse rate) change 

Caffeine 8 3 5 14061 40 £322 
No Caffeine 3 2 4 -1 5 5 12 -1 4.) 2.0 


1. Gather 20 index cards to represent the 20 students in this experiment. On each card, write 
one of the 20 outcomes listed in the table. For example, write “8” on the first card, “3” on 
the second card, and so on. 


2. Shuffle the cards and deal two piles of 10 cards each. This represents randomly assigning 
the 20 students to the two treatments, assuming that the treatment received doesn’t affect 
the change in pulse rate. The first pile of 10 cards represents the caffeine group, and the 
second pile of 10 cards represents the no-caffeine group. 


3. Find the mean change for each group and subtract the means (Caffeine — No caffeine). 
Note: It is possible to get a negative difference. 


4. Your teacher will draw and label an axis for a class dotplot. Plot the difference you got in 
Step 3 on the graph. 


5. In Mr. Wilcox’s class, the observed difference in means was 1.2. Is a difference of 1.2 
Statistically significant? Discuss with your classmates. 


We used technology to perform 100 trials of the simulation described in the activity. The 
dotplot in Figure 4.3 shows that getting a difference of 1.2 isn’t that unusual. In 19 of the 100 
trials, we obtained a difference of 1.2 or more simply due to chance variation in the random 
assignment. 


In the 100 trials, 19 times 
the difference was 1.2 or 


4 =, z 
Simulated difference (Caffeine — No caffeine) 
in mean change in pulse rate 


FIGURE 4.3 Dotplot showing the differences in means that occurred in 100 simulated random 
assignments, assuming that caffeine has no effect on pulse rates. 


Because the difference of 1.2 or greater is somewhat likely to occur by chance alone, the 
results of Mr. Wilcox’s class experiment aren’t statistically significant. Based on this 
experiment, there isn’t convincing evidence that caffeine increases pulse rates. 


EXAMPLE | Distracted driving (>) 


Inference for experiments 


Sean Locke Photography/Shutterstock.com 


PROBLEM: Is talking on a cell phone while driving more distracting than talking to a 
passenger? David Strayer and his colleagues at the University of Utah designed an 


experiment to help answer this question. They used 48 undergraduate students as subjects. 
The researchers randomly assigned half of the subjects to drive in a simulator while talking 
on a cell phone, and the other half to drive in the simulator while talking to a passenger. One 
response variable was whether or not the driver stopped at a rest area that was specified by 
researchers before the simulation started. The table shows the results.™4 


a. Calculate the difference (Passenger — Cell phone) in the proportion of students who 
stopped at the rest area in the two groups. 
One hundred trials of a simulation were performed to see what differences in proportions 
would occur due only to chance variation in the random assignment, assuming that the 
type of distraction did not affect whether a subject stopped at the rest area. That is, 33 
“stoppers” and 15 “non-stoppers” were randomly assigned to two groups of 24. 


b. There are three dots at 0.29. Explain what these dots mean in this context. 


c. Use the results of the simulation to determine if the difference in proportions from part (a) 
is statistically significant. Explain your reasoning. 


Treatment 
Cell phone Passenger Total 
Stopped at rest area 33 
Response Didn't stop 15 
Total 48 
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Simulated difference (Passenger — Cell phone) 
in proportion of students who stopped 


SOLUTION: 


a. Difference in proportions = SNA = 1O/ DA = ORS = ORO =O 275 


b. When we assumed that the type of distraction doesn’t matter, there were three 
simulated random assignments where the difference in the proportion of students who 
stopped at the rest area was 0.2 oC 


c. Because a difference of 0.375 or greater never occurred in the simulation, the difference 


1S statistically significant. It ts extremely unlikely to get a difference this big simply 
due to chance variation in the random assignment. 


Because the difference is statistically significant, we can make a cause-and-effect 


conclusion: talking on a cell phone is more distracting than talking with a passenger— 
at least for subjects like those in the experiment. 
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In the caffeine example, we said that a difference in means of 1.2 was not unusual because 
a difference that big or bigger occurred 19% of the time by chance alone. In the distracted 
drivers example, we said that a difference in proportions of 0.375 was unusual because a 
difference this big or bigger occurred 0% of the time by chance alone. So the boundary 
between “not unusual” and “unusual” must be somewhere between 0% and 19%. For now, we 
recommend using a boundary of 5% so that differences that would occur less than 5% of the 
time by chance alone are considered statistically significant. 


The Scope of Inference: Putting it All Together 


The type of conclusion that can be drawn from a study depends on how the data in the study 
were collected. 

In the example about average weight in the NFL, the players were randomly selected from 
all NFL players. As you learned in Section 4.1, random sampling helps to avoid bias and 
produces reliable estimates of the truth about the population. Because the mean weight in the 
sample of players was 244.4 pounds, our best guess for the mean weight in the population of all 
NFL players is 244.4 pounds. Even though our estimates are rarely exactly correct, when 
samples are selected at random, we can make an inference about the population. 


In the distracted driver experiment, subjects were randomly assigned to talk on a cell phone 
or talk to a passenger. As you learned in Section 4.2, random assignment helps ensure that the 
two groups of subjects are as alike as possible before the treatments are imposed. If the group 
assigned to talk with a passenger remembers to stop at the rest area more often than the group 
assigned to talk on a cell phone, and the difference is too large to be explained by chance 
variation in the random assignment, it must be due to the treatments. In that case, the 
researchers could safely conclude that talking on a cell phone is more distracting than talking to 
a passenger. That is, they can make an inference about cause and effect. However, because the 
experiment used volunteer subjects, the scientists can only apply this conclusion to subjects 
like the ones in their experiment. 


Both random sampling and random assignment introduce chance variation into a statistical 


study. When performing inference, statisticians use the laws of probability to describe this 


chance variation. You'll learn how this works later in the book. 
Let’s recap what we’ve learned about the scope of inference in a statistical study. 


THE SCOPE OF INFERENCE 


e Random selection of individuals allows inference about the population from which the 
individuals were chosen. 


e Random assignment of individuals to groups allows inference about cause and effect. 


The following chart summarizes the possibilities.22 


Were individuals randomly assigned to groups? 


Yes No 
Inference about the population: Inference about the population: 
Yes YES Inference about cause and YES Inference about cause and 
Were individuals effect: YES effect: NO 
randomly selected? Inference about the population: Inference about the population: 
No NO Inference about cause and NO Inference about cause and 
effect: YES effect: NO 


Well-designed experiments randomly assign individuals to treatment groups. However, 
most experiments don’t select experimental units at random from the larger population. That 
limits such experiments to inference about cause and effect for individuals like those who 
received the treatments. Observational studies don’t randomly assign individuals to groups, 
which makes it challenging to make an inference about cause and effect. But an observational 
study that uses random sampling can make an inference about the population. 


EXAMPLE | When will | ever use this stuff? (>) 
The scope of inference 


PROBLEM: Researchers at the University of North Carolina were concerned about the 
increasing dropout rate in the state’s high schools, especially for low-income students. 
Surveys of recent dropouts revealed that many of these students had started to lose interest 
during middle school. They said they saw little connection between what they were studying 
in school and their future plans. To change this perception, researchers developed a program 
called CareerStart. The central idea of the program is that teachers show students how the 
topics they’re learning about can be applied to specific careers. 


To test the effectiveness of CareerStart, the researchers recruited 14 middle schools in 
Forsyth County to participate in an experiment. Seven of the schools, determined at random, 
used CareerStart along with the district’s standard curriculum. The other 7 schools just 


followed the standard curriculum. Researchers followed both groups of students for several 
years, collecting data on students’ attendance, behavior, standardized test scores, level of 
engagement in school, and whether or not the students graduated from high school. 


Results: Students at schools that used CareerStart generally had significantly better 
attendance and fewer discipline problems, earned higher test scores, reported greater 


engagement in their classes, and were more likely to graduate. 


What conclusion can we draw from this study? Explain your reasoning. 


SOLUTION: 


Because treatments were randomly assigned and the results were significant, we can 
conclude that using the CareerStart curriculum caused better attendance, fewer discipline 
problems, higher test scores, greater engagement, and increased graduation rates. However, 
these results only apply to schools like those in the study because the schools were not 
randomly selected from any population. 


With no random selection, the results of the study should be applied only to schools 


like those in the study. With random assignment, it is possible to make an inference 
about cause and effect. 


FOR PRACTICE, TRY EXERCISE 103 


CHECK YOUR UNDERSTANDING 


Ny 


y When an athlete suffers a sports-related concussion, does it help to remove the athlete 
from play immediately? Researchers recruited 95 athletes seeking care for a sports-related 
concussion at a medical clinic and followed their progress during recovery. Researchers 
found statistically significant evidence that athletes who were removed from play 
immediately recovered more quickly, on average, than athletes who continued to play.22 
What conclusion can we draw from this study? Explain your answer. 


The Challenges of Establishing Causation 


A well-designed experiment can tell us that changes in the explanatory variable cause changes 
in the response variable. More precisely, it tells us that this happened for specific individuals in 
the specific environment of this specific experiment. In some cases, it isn’t practical or even 
ethical to do an experiment. Consider these important questions: 


e Does going to church regularly help people live longer? 


e Does smoking cause lung cancer? 


To answer these cause-and-effect questions, we just need to perform a randomized 
comparative experiment. Unfortunately, we can’t randomly assign people to attend church or to 
smoke cigarettes. The best data we have about these and many other cause-and-effect questions 
come from observational studies. 


Doctors had long observed that most lung cancer patients were smokers. Comparison of 
smokers and similar nonsmokers showed a very strong association between smoking and death 
from lung cancer. Could the association be due to some other variable? Is there some genetic 
factor that makes people both more likely to become addicted to nicotine and to develop lung 
cancer? If so, then smoking and lung cancer would be strongly associated even if smoking had 
no direct effect on the lungs. Or maybe confounding is to blame. It might be that smokers live 
unhealthy lives in other ways (diet, alcohol, lack of exercise) and that some other habit 
confounded with smoking is a cause of lung cancer. Still, it is sometimes possible to build a 
strong case for causation in the absence of experiments. The evidence that smoking causes lung 
cancer is about as strong as nonexperimental evidence can be. 
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There are several criteria for establishing causation when we can’t do an experiment: 
e The association is strong. The association between smoking and lung cancer is very strong. 


e The association is consistent. Many studies of different kinds of people in many countries 
link smoking to lung cancer. That reduces the chance that some other variable specific to one 
group or one study explains the association. 


e Larger values of the explanatory variable are associated with stronger responses. People 
who smoke more cigarettes per day or who smoke over a longer period get lung cancer more 
often. People who stop smoking reduce their risk. 


e The alleged cause precedes the effect in time. Lung cancer develops after years of smoking. 
The number of men dying of lung cancer rose as smoking became more common, with a lag 
of about 30 years. Lung cancer kills more men than any other form of cancer. Lung cancer 


was rare among women until women began to smoke. Lung cancer in women rose along 
with smoking, again with a lag of about 30 years, and has passed breast cancer as the leading 
cause of cancer death among women. 


e The alleged cause is plausible. Experiments with animals show that tars from cigarette 
smoke do cause cancer. 


Medical authorities do not hesitate to say that smoking causes lung cancer. The U.S. 
Surgeon General states that cigarette smoking is “the largest avoidable cause of death and 
disability in the United States.”28 The evidence for causation is overwhelming—but it is not as 
strong as the evidence provided by well-designed experiments. Conducting an experiment in 
which some subjects were forced to smoke and others were not allowed to would be unethical. 
In cases like this, observational studies are our best source of reliable information. 


Data Ethics! 


There are some potential experiments that are clearly unethical. In other cases, the boundary 
between “ethical” and “unethical” isn’t as clear. Decide if you think each of the following 
studies is ethical or unethical: 


e A promising new drug has been developed for treating cancer in humans. Before giving the 
drug to human subjects, researchers want to administer the drug to animals to see if there are 
any potentially serious side effects. 


e Are companies discriminating against some individuals in the hiring process? To find out, 
researchers prepare several equivalent résumés for fictitious job applicants, with the only 
difference being the gender of the applicant. They send the fake résumés to companies 
advertising positions and keep track of the number of males and females who are contacted 
for interviews. 


e Will people try to stop someone from driving drunk? A television news program hires an 
actor to play a drunk driver and uses a hidden camera to record the behavior of individuals 
who encounter the driver. 


The most complex issues of data ethics arise when we collect data from people. The ethical 
difficulties are more severe for experiments that impose some treatment on people than for 
sample surveys that simply gather information. Trials of new medical treatments, for example, 
can do harm as well as good to their subjects. 

Here are some basic standards of data ethics that must be obeyed by all studies that gather 
data from human subjects, both observational studies and experiments. The law requires that 
studies carried out or funded by the federal government obey these principles.22 But neither the 
law nor the consensus of experts is completely clear about the details of their application. 


BASIC DATA ETHICS 


e All planned studies must be reviewed in advance by an institutional review board charged 
with protecting the safety and well-being of the subjects. 


e All individuals who are subjects in a study must give their informed consent before data 
are collected. 


e All individual data must be kept confidential. Only statistical summaries for groups of 
subjects may be made public. 


Institutional Review Boards The purpose of an institutional review board is not to decide 
whether a proposed study will produce valuable information or if it is statistically sound. The 
board’s purpose is, in the words of one university’s board, “to protect the rights and welfare of 
human subjects (including patients) recruited to participate in research activities.” The board 
reviews the plan of the study and can require changes. It reviews the consent form to be sure 
that subjects are informed about the nature of the study and about any potential risks. Once 
research begins, the board monitors its progress at least once a year. 


Informed Consent Both words in the phrase informed consent are important, and both can be 
controversial. Subjects must be informed in advance about the nature of a study and any risk of 
harm it may bring. In the case of a questionnaire, physical harm is not possible. But a survey on 
sensitive issues could result in emotional harm. The participants should be told what kinds of 
questions the survey will ask and roughly how much of their time it will take. Experimenters 
must tell subjects the nature and purpose of the study and outline possible risks. Subjects must 
then consent in writing. 


Confidentiality It is important to protect individuals’ privacy by keeping all data about them 
confidential. The report of an opinion poll may say what percent of the 1200 respondents 
believed that legal immigration should be reduced. It may not report what you said about this or 
any other issue. Confidentiality is not the same as anonymity. Anonymity means that 
individuals are anonymous—their names are not known even to the director of the study. 
Anonymity is rare in statistical studies. Even where anonymity is possible (mainly in surveys 
conducted by mail), it prevents any follow-up to improve nonresponse or inform individuals of 
results. 


Section 4.3 Summary 


e Sampling variability refers to the idea that different random samples of the same size from 
the same population produce different estimates. Reduce sampling variability by increasing 
the sample size. 


e When the observed results of a study are too unusual to be explained by chance alone, we 
say that the results are statistically significant. 


e Most studies aim to make inferences that go beyond the data produced. 


: Inference about a population requires that the individuals taking part in a study be 
randomly selected from the population. 


. A well-designed experiment that randomly assigns experimental units to treatments allows 
inference about cause and effect. 


e Inthe absence of an experiment, good evidence of causation requires a strong association 
that appears consistently in many studies, a clear explanation for the alleged causal link, and 
careful examination of other variables. 


e Studies involving humans must be screened in advance by an institutional review board. 
All participants must give their informed consent before taking part. Any information about 
the individuals in the study must be kept confidential. 


Section 4.3 Exercises 


93. PQ 271 |p) Tweet, tweet! What proportion of students at your school use Twitter? To 


find out, you survey a simple random sample of students from the school roster. 


a. Will your sample result be exactly the same as the true population proportion? Explain 
your answer. 


b. Which would be more likely to produce a sample result closer to the true population 
value: an SRS of 50 students or an SRS of 100 students? Explain your answer. 


94. Far from home? A researcher wants to estimate the average distance that students at a 
large community college live from campus. To find out, she surveys a simple random 
sample of students from the registrar’s database. 


a. Will the researcher’s sample result be exactly the same as the true population mean? 
Explain your answer. 


b. Which would be more likely to produce a sample result closer to the true population 
value: an SRS of 100 students or an SRS of 50 students? Explain your answer. 


95. Football on TV A Gallup poll conducted telephone interviews with a random sample of 
1000 adults aged 18 and older. Of these, 37% said that football is their favorite sport to 
watch on television. The margin of error for this estimate is 3.1 percentage points. 


a. Would you be surprised if a census revealed that 50% of adults in the population would 
say their favorite sport to watch on TV is football? Explain your answer. 


b. Explain how Gallup could decrease the margin of error. 


96. Car colors in Miami Using a webcam, a traffic analyst selected a random sample of 800 
cars traveling on I-195 in Miami on a weekday morning. Among the 800 cars in the 
sample, 24% were white. The margin of error for this estimate is 3.0 percentage points. 


a. Would you be surprised if a census revealed that 26% of cars on I-195 in Miami on a 
weekday morning were white? Explain your answer. 


b. Explain how the traffic analyst could decrease the margin of error. 


97. Kissing the right way According to a newspaper article, “Most people are kissing the 
‘right way.’” That is, according to a study, the majority of couples prefer to tilt their heads 
to the right when kissing. In the study, a researcher observed a random sample of 124 
kissing couples and found that 83/124 (66.9%) of the couples tilted to the right.2 To 
determine if these data provide convincing evidence that couples are more likely to tilt 
their heads to the right, 100 simulated SRSs were selected. 
Each dot in the graph shows the number of couples that tilt to the right in a simulated SRS 
of 124 couples, assuming that each couple has a 50% chance of tilting to the right. 
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a. Explain how the graph illustrates the concept of sampling variability. 


b. Based on the data from the study and the results of the simulation, is there convincing 
evidence that couples prefer to kiss the “right way”? Explain your answer. 


98. Weekend birthdays Over the years, the percentage of births that are planned caesarean 
sections has been rising. Because doctors can schedule these deliveries, there might be 
more children born during the week and fewer born on the weekend than if births were 
uniformly distributed throughout the week. To investigate, Mrs. McDonald and her class 
selected an SRS of 73 people born since 1993. Of these people, 24 were born on Friday, 
Saturday, or Sunday. 


To determine if these data provide convincing evidence that fewer than 43% (3/7) of 
people born since 1993 were born on Friday, Saturday, or Sunday, 100 simulated SRSs 
were selected. Each dot in the graph shows the number of people that were born on 
Friday, Saturday, or Sunday in a simulated SRS of 73 people, assuming that each person 
had a 43% chance of being born on one of these three days. 
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Simulated number who were born on 
Friday, Saturday, or Sunday 


a. Explain how the graph illustrates the concept of sampling variability. 


b. Based on the data from the study and the results of the simulation, is there convincing 
evidence that fewer than 43% of people born since 1993 were born on Friday, Saturday, 
or Sunday? Explain your answer. 


99. pg 274 |») I work out a lot Are people influenced by what others say? Michael 


conducted an experiment in front of a popular gym. As people entered, he asked them how 
many days they typically work out per week. As he asked the question, he showed the 
subjects one of two clipboards, determined at random. Clipboard A had the question and 
many responses written down, where the majority of responses were 6 or 7 days per week. 
Clipboard B was the same, except most of the responses were 1 or 2 days per week. The 
mean response for the Clipboard A group was 4.68 and the mean response for the 


Clipboard B group was 4.21.54 


a. Calculate the difference (Clipboard A — Clipboard B) in the mean number of days for 
the two groups. 
One hundred trials of a simulation were performed to see what differences in means 
would occur due only to chance variation in the random assignment, assuming that the 
responses on the clipboard don’t matter. The results are shown in the dotplot. 
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b. There is one dot at 0.72. Explain what this dot means in this context. 


c. Use the results of the simulation to determine if the difference in means from part (a) is 
Statistically significant. Explain your reasoning. 


100. A louse-y situation A study published in the New England Journal of Medicine compared 
two medicines to treat head lice: an oral medication called ivermectin and a topical lotion 


containing malathion. Researchers studied 812 people in 376 households in seven areas 
around the world. Of the 185 households randomly assigned to ivermectin, 171 were free 
from head lice after 2 weeks, compared with only 151 of the 191 households randomly 


assigned to malathion.©2 


a. Calculate the difference (Ivermectin — Malathion) in the proportion of households that 
were free from head lice in the two groups. 


One hundred trials of a simulation were performed to see what differences in 
proportions would occur due only to chance variation in the random assignment, 
assuming that the type of medication doesn’t matter. The results are shown in the 
dotplot. 
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b. There is one dot at 0.09. Explain what this dot means in this context. 


c. Use the results of the simulation to determine if the difference in proportions from 
part (a) is statistically significant. Explain your reasoning. 


101. Acupuncture and pregnancy A study sought to determine if the ancient Chinese art of 
acupuncture could help infertile women become pregnant.®2 A total of 160 healthy 
women undergoing assisted reproductive therapy were recruited for the study. Half of 
the subjects were randomly assigned to receive acupuncture treatment 25 minutes before 
embryo transfer and again 25 minutes after the transfer. The remaining 80 subjects were 
instructed to lie still for 25 minutes after the embryo transfer. Results: In the acupuncture 
group, 34 women became pregnant. In the control group, 21 women became pregnant. 


a. Why did researchers randomly assign the subjects to the two treatments? 


b. The difference in the percent of women who became pregnant in the two groups is 
Statistically significant. Explain what this means to someone who knows little 
statistics. 


c. Explain why the design of the study prevents us from concluding that acupuncture 
caused the difference in pregnancy rates. 


102. Do diets work? Dr. Linda Stern and her colleagues recruited 132 obese adults at the 
Philadelphia Veterans Affairs Medical Center in Pennsylvania. Half the participants 


were randomly assigned to a low-carbohydrate diet and the other half to a low-fat diet. 
Researchers measured each participant’s change in weight and cholesterol level after six 
months and again after one year. Subjects in the low-carb diet group lost significantly 
more weight than subjects in the low-fat diet group during the first six months. At the 
end of a year, however, the average weight loss for subjects in the two groups was not 
significantly different.©4 


a. Why did researchers randomly assign the subjects to the diet treatments? 


b. Explain to someone who knows little statistics what “lost significantly more weight” 
means. 


c. The subjects in the low-carb diet group lost an average of 5.1 kg in a year. The 
subjects in the low-fat diet group lost an average of 3.1 kg in a year. Explain how this 
information could be consistent with the fact that weight loss in the two groups was 
not significantly different. 


103. pg 276 |) Foster care versus orphanages Do abandoned children placed in foster 


homes do better than similar children placed in an institution? The Bucharest Early 
Intervention Project found statistically significant evidence that they do. The subjects 
were 136 young children abandoned at birth and living in orphanages in Bucharest, 
Romania. Half of the children, chosen at random, were placed in foster homes. The other 
half remained in the orphanages.®2 (Foster care was not easily available in Romania at 
the time and so was paid for by the study.) What conclusion can we draw from this 
study? Explain your reasoning. 


104. Frozen batteries Will storing batteries in a freezer make them last longer? To find out, a 
company that produces batteries takes a random sample of 100 AA batteries from its 
warehouse. The company statistician randomly assigns 50 batteries to be stored in the 
freezer and the other 50 to be stored at room temperature for 3 years. At the end of that 
time period, each battery’s charge is tested. Result: Batteries stored in the freezer had a 
significantly higher average charge. What conclusion can we draw from this study? 
Explain your reasoning. 


105. Attend church, live longer? One of the better studies of the effect of regular attendance 
at religious services gathered data from a random sample of 3617 adults. The researchers 
then measured lots of variables, not just the explanatory variable (religious activities) 
and the response variable (length of life). A news article said: “Churchgoers were more 
likely to be nonsmokers, physically active, and at their right weight. But even after 
health behaviors were taken into account, those not attending religious services regularly 
still were significantly more likely to have died.”©° What conclusion can we draw from 
this study? Explain your reasoning. 


106. Rude surgeons Is a friendly surgeon a better surgeon? In a study of more than 32,000 
surgical patients from 7 different medical centers, researchers classified surgeons by the 
number of unsolicited complaints that had been recorded about their behavior. The 


researchers found that surgical complications were significantly more common in 
patients whose surgeons had received the most complaints, compared with patients 
whose surgeons had received the fewest complaints.£4 What conclusion can we draw 
from this study? Explain your reasoning. 


107. Berry good Eating blueberries and strawberries might improve heart health, according to 
a long-term study of 93,600 women who volunteered to take part. These berries are high 
in anthocyanins due to their pigment. Women who reported consuming the most 
anthocyanins had a significantly smaller risk of heart attack compared to the women who 
reported consuming the least. What conclusion can we draw from this study? Explain 


your reasoning ©8 


108. Exercise and memory A study of strength training and memory randomly assigned 46 
young adults to two groups. After both groups were shown 90 pictures, one group had to 
bend and extend one leg against heavy resistance 60 times. The other group stayed 
relaxed, while the researchers used the same exercise machine to bend and extend their 
legs with no resistance. Two days later, each subject was shown 180 pictures—the 
original 90 pictures plus 90 new pictures and asked to identify which pictures were 
shown two days earlier. The resistance group was significantly more successful in 
identifying these pictures than was the relax group. What conclusions can we draw from 


this study? Explain your reasoning.© 

109. * Minimal risk? You have been invited to serve on a college’s institutional review board. 
You must decide whether several research proposals qualify for lighter review because 
they involve only minimal risk to subjects. Federal regulations say that “minimal risk” 
means the risks are no greater than “those ordinarily encountered in daily life or during 
the performance of routine physical or psychological examinations or tests.” That’s 
vague. Which of these do you think qualifies as “minimal risk”? 


a. Draw a drop of blood by pricking a finger to measure blood sugar. 
b. Draw blood from the arm for a full set of blood tests. 


c. Insert a tube that remains in the arm so that blood can be drawn regularly. 


110. * Who reviews? Government regulations require that institutional review boards consist 
of at least five people, including at least one scientist, one nonscientist, and one person 
from outside the institution. Most boards are larger, but many contain just one outsider. 


a. Why should review boards contain people who are not scientists? 


b. Do you think that one outside member is enough? How would you choose that 
member? (For example, would you prefer a medical doctor? A religious leader? An 


activist for patients’ rights?) 
111. * Facebook emotions In cooperation with researchers from Cornell University, Facebook 
randomly selected almost 700,000 users for an experiment in “emotional contagion.” 


Users’ news feeds were manipulated (without their knowledge) to selectively show 
postings from their friends that were either more positive or more negative in tone, and 
the emotional tone of their own subsequent postings was measured. The researchers 
found evidence that people who read emotionally negative postings were more likely to 
post messages with a negative tone, whereas those who read positive messages were 
more likely to post messages with a positive tone. The research was widely criticized for 


being unethical. Explain why.22 


112. * No consent needed? In which of the circumstances listed here would you allow 
collecting personal information without the subjects’ consent? 


a. A government agency takes a random sample of income tax returns to obtain 
information on the average income of people in different occupations. Only the 
incomes and occupations are recorded from the returns, not the names. 


b. A social psychologist attends public meetings of a religious group to study the 
behavior patterns of its members. 


c. A social psychologist pretends to be converted to membership in a religious group and 
attends private meetings to study the behavior patterns of its members. 


113. * Anonymous? Confidential? One of the most important nongovernment surveys in the 
United States is the National Opinion Research Center’s General Social Survey (GSS). 
The GSS regularly monitors public opinion on a wide variety of political and social 
issues. Interviews are conducted in person in the subject’s home. Are a subject’s 
responses to GSS questions anonymous, confidential, or both? Explain your answer. 


114, * Anonymous? Confidential? Texas A&M, like many universities, offers screening for 
HIV, the virus that causes AIDS. Students may choose either anonymous or confidential 
screening. An announcement says, “Persons who sign up for screening will be assigned a 
number so that they do not have to give their name.” They can learn the results of the test 
by telephone, still without giving their name. Does this describe anonymous or 
confidential screening? Why? 


115. * The Willowbrook hepatitis studies In the 1960s, children entering the Willowbrook 
State School, an institution for the intellectually disabled on Staten Island in New York, 
were deliberately infected with hepatitis. The researchers argued that almost all children 
in the institution quickly became infected anyway. The studies showed for the first time 
that two strains of hepatitis existed. This finding contributed to the development of 
effective vaccines. Despite these valuable results, the Willowbrook studies are now 
considered an example of unethical research. Explain why, according to current ethical 
standards, useful results are not enough to allow a study. 


116. * Unequal benefits Researchers on aging proposed to investigate the effect of 
supplemental health services on the quality of life of older people. Eligible patients on 
the rolls of a large medical clinic were to be randomly assigned to treatment and control 
groups. The treatment group would be offered hearing aids, dentures, transportation, and 


other services not available without charge to the control group. The review board 
believed that providing these services to some but not other persons in the same 
institution raised ethical questions. Do you agree? 


Multiple Choice Select the best answer for Exercises 117 and 118. 


117. Do product labels influence customer perceptions? To find out, researchers recruited more 
than 500 adults and asked them to estimate the number of calories, amount of added 
sugar, and amount of fat in a variety of food products. Half of the subjects were 
randomly assigned to evaluate products with the word “Natural” on the label, while the 
other half were assigned to evaluate the same products without the “Natural” label. On 
average, the products with the “Natural” label were judged to have significantly fewer 
calories. Based on this study, is it reasonable to conclude that including the word 
“Natural” on the label causes a reduction in estimated calories? 


a. No, because the adults weren’t randomly selected from the population of all adults. 
b. No, because there wasn’t a control group for comparison. 

c. No, because association doesn’t imply causation. 

d. Yes, because the adults were randomly assigned to the treatments. 


e. Yes, because there were a large number of adults involved in the study. 


118. Some news organizations maintain a database of customers who have volunteered to share 
their opinions on a variety of issues. Suppose that one of these databases includes 9000 
registered voters in California. To measure the amount of support for a controversial 
ballot issue, 1000 registered voters in California are randomly selected from the database 
and asked their opinion. Which of the following is the largest population to which the 
results of this survey should be generalized? 


a. The 1000 people in the sample 

b. The 9000 registered voters from California in the database 
c. All registered voters in California 

d. All California residents 


e. All registered voters in the United States 


Review and Recycle 


19. Animal testing (1.1) “It is right to use animals for medical testing if it might save human 
lives.” The General Social Survey asked 1152 adults to react to this statement. Here is 
the two-way table of their responses: 


Gender 


Male Female Total 


Strongly agree — 
Agree al 
Opinion about Neither agree nor 
using animals disagree a 
for medical 
testing Disagree ace 
Strongly agree 20 
Total 516 636 1152 


a. Construct segmented bar graphs to display the distribution of opinion for males and 
for females. 
b. Is there an association between gender and opinion for the members of this sample? 


Explain your answer. 
20. Initial public offerings (1.3) The business magazine Forbes reports that 4567 companies 


sold their first stock to the public between 1990 and 2000. The mean change in the stock 
price of these companies since the first stock was issued was +111%. The median change 


was —31%.2! Explain how this difference could happen. 


*Exercises 109-116: This is an important topic, but it is not required for the AP® Statistics 


exam. 


MM _. FRAPPY! FREE RESPONSE AP® 
PROBLEM, YAY! 


The following problem is modeled after actual AP® Statistics exam free-response 
questions. Your task is to generate a complete, concise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you use, because you will be 
scored on the correctness of your methods as well as on the accuracy and completeness of 
your results and explanations. 


In a recent study, 166 adults from the St. Louis area were recruited and randomly 
assigned to receive one of two treatments for a sinus infection. Half of the subjects 
received an antibiotic (amoxicillin) and the other half received a placebo. 


a. Describe how the researchers could have assigned treatments to subjects if they 
wanted to use a completely randomized design. 


All the subjects in the experiment had moderate, severe, or very severe Symptoms 
at the beginning of the study. Describe one statistical benefit and one statistical 
drawback for using subjects with moderate, severe, or very severe symptoms 
instead of just using subjects with very severe symptoms. 


At different stages during the next month, all subjects took the sino-nasal outcome 
test. After 10 days, the difference in average test scores was not statistically 
significant. In this context, explain what it means for the difference to be not 
Statistically significant. 


One possible way that researchers could have improved the study is to use a 
randomized block design. Explain how the researchers could have incorporated 
blocking in their design. 


After you finish, you can view two example solutions on the book’s website 
(highschool.bfwpub.com/tps6e). Determine whether you think each solution is 
“complete,” “substantial,” “developing,” or “minimal.” If the solution is not 
complete, what improvements would you suggest to the student who wrote it? 
Finally, your teacher will provide you with a scoring rubric. Score your response 
and note what, if anything, you would do differently to improve your own score 


Chapter 4 Review 


Section 4.1: Sampling and Surveys 


In this section, you learned that a population is the group of all individuals that we want 
information about. A sample is the subset of the population that we use to gather this 
information. The goal of most sample surveys is to use information from the sample to draw 
conclusions about the population. Choosing people for a sample because they are located 
nearby or letting people choose whether or not to be in the sample are poor ways to select a 
sample. Because convenience sampling and voluntary response sampling will produce 
estimates that are likely to underestimate or likely to overestimate the value you want to know, 
these methods of choosing a sample are biased. 


To avoid bias in the way the sample is formed, the members of the sample should be 
chosen at random. One way to do this is with a simple random sample (SRS), which is 
equivalent to selecting well-mixed slips of paper from a hat. It is often more convenient to 
select an SRS using technology or a table of random digits. 


Two other random sampling methods are stratified sampling and cluster sampling. To 
obtain a stratified random sample, divide the population into groups (strata) of individuals that 
are likely to have similar responses, take an SRS from each stratum, and combine the chosen 
individuals to form the sample. Stratified random samples can produce estimates with much 
greater precision than simple random samples. To obtain a cluster sample, divide the 
population into groups (clusters) of individuals that are in similar locations, randomly select 
clusters, and use every individual in the chosen clusters. Cluster samples are easier to obtain 
than simple random samples or stratified random samples, but they may not produce very 
precise estimates. 


Finally, you learned about other issues in sample surveys that can lead to bias: 
undercoverage occurs when the sampling method systematically underrepresents one part of 
the population. Nonresponse describes when answers cannot be obtained from some people that 
were chosen to be in the sample. Bias can also result when some people in the sample don’t 
give accurate responses due to question wording, interviewer characteristics, or other factors. 


Section 4.2: Experiments 


In this section, you learned about the difference between observational studies and 
experiments. Experiments deliberately impose a treatment to see if there is a cause-and-effect 
relationship between two variables. Observational studies look at relationships between two 
variables, but make it difficult to show cause and effect because other variables may be 
confounded with the explanatory variable. Variables are confounded when it is impossible to 
determine which of the variables is causing a change in the response variable. 


A common type of comparative experiment uses a completely randomized design. In this 


type of design, the experimental units are assigned to the treatments at random. With random 
assignment, the treatment groups will be roughly equivalent at the beginning of the experiment. 
Replication means giving each treatment to as many experimental units as possible. This makes 
it easier to see the effects of the treatments because the effects of other variables are more 
likely to be balanced among the treatment groups. 


During an experiment, it is important that other variables be controlled (kept the same) for 
each experimental unit. Doing so helps avoid confounding and removes a possible source of 
variation in the response variable. Also, beware of the placebo effect—the tendency for people 
to improve because they expect to, not because of the treatment they are receiving. One way to 
make sure that all experimental units have the same expectations is to make them blind— 
unaware of which treatment they are receiving. When the people interacting with the subjects 
and measuring the response variable are also blind, the experiment is called double-blind. 


Blocking in experiments is similar to stratifying in sampling. To form blocks, group 
together experimental units that are similar with respect to a variable that is associated with the 
response. Then randomly assign the treatments within each block. A randomized block design 
that uses blocks with two experimental units is called a matched pairs design. Blocking helps 
us estimate the effects of the treatments more precisely because we can account for the 
variability introduced by the variables used to form the blocks. 


Section 4.3: Using Studies Wisely 


In this section, you learned that the types of conclusions we can draw depend on how the data 
are produced. When samples are selected at random, we can make inferences about the 
population from which the sample was drawn. However, the estimates we calculate from 
sample data rarely equal the true population value because of sampling variability. We can 
reduce sampling variability by increasing the sample size. 


When treatments are applied to groups formed at random in an experiment, we can make an 
inference about cause and effect. Making a cause-and-effect conclusion is often difficult 
because it is impossible or unethical to perform certain types of experiments. Good data ethics 
requires that studies should be approved by an institutional review board, subjects should give 
informed consent, and individual data must be kept confidential. 


Finally, the results of a study are statistically significant if they are too unusual to occur by 
chance alone. 


What Did You Learn? 
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Example on Relevant Chapter Review 
Learning Target Section Page(s) Exercise(s) 
Identify the population and 4.1 221 R4.1 


sample in a Statistical 


study. 


Identify voluntary response 
sampling and convenience 
sampling and explain how 
these sampling methods 
can lead to bias. 


Describe how to select a 
simple random sample 
with technology or a table 
of random digits. 


Describe how to select a 
sample using stratified 
random sampling and 
cluster sampling, 
distinguish stratified 
random sampling from 
cluster sampling, and give 
an advantage of each 
method. 


Explain how 
undercoverage, 
nonresponse, question 
wording, and other aspects 
of a sample survey can 
lead to bias. 
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confounding and how it 
limits the ability to make 
cause-and-effect 
conclusions. 


Distinguish between an 
observational study and an 
experiment, and identify 
the explanatory and 
response variables in each 
type of study. 


Identify the experimental 
units and treatments in an 
experiment. 


Describe the placebo 
effect and the purpose of 
blinding in an experiment. 
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assign treatments in an 
experiment using slips of 
paper, technology, ora 
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Explain the purpose of 4.2 254 R4.6, R4.8 
comparison, random 

assignment, control, and 

replication in an 


experiment. 

Describe a completely 4.2 256 R4.6, R4.9 
randomized design for an 

experiment. 

Describe a randomized 4.2 259, 260 R4.6, R4.9 


block design anda 
matched pairs design for 
an experiment and explain 
the purpose of blocking in 
an experiment. 


Explain the concept of 4.3 271 
sampling variability when 

making an inference about 

a population and how 

sample size affects 

sampling variability. 

Explain the meaning of 4.3 274 
statistically significant in 

the context of an 

experiment and use 

simulation to determine if 

the results of an 

experiment are statistically 

significant. 


Identify when it is 4.3 276 
appropriate to make an 

inference about a 

population and when it is 

appropriate to make an 

inference about cause and 

effect. 


Evaluate if a statistical 4.3 278 
study has been carried out 
in an ethical manner.! 
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iThis is an important topic, but it is not required for the AP® Statistics exam. 


Chapter 4 Review Exercises 


R4.1 \») Nurses are the best A recent random sample of n = 805 adult U.S. residents found 


that the proportion who rated the honesty and ethical standards of nurses as high or very 


high is 0.85. This is 0.15 higher than the proportion recorded for doctors, the next 


highest-ranked profession.22 


a. Identify the sample and the population in this setting. 


b. Do you think that the proportion of all U.S. residents who would rate the honesty and 
ethical standards of nurses as high or very high is exactly 0.85? Explain your answer. 


c. What is the benefit of increasing the sample size in this context? 


R4.2 \_») Parking problems The administration at a high school with 1800 students wants to 


gather student opinion about parking for students on campus. It isn’t practical to contact 
all students. 


a. Give an example of a way to choose a voluntary response sample of students. Explain 
how this method could lead to bias. 


b. Give an example of a way to choose a convenience sample of students. Explain how 
this method could lead to bias. 


c. Describe how to select an SRS of 50 students from the school. 
d. Explain how the method you described in part (c) avoids the biases you described in 
parts (a) and (b). 
R4.3 \») Surveying NBA fans The manager of a sports arena wants to learn more about the 


financial status of the people who are attending an NBA basketball game. He would like 
to give a survey to a representative sample of about 10% of the fans in attendance. Ticket 
prices for the game vary a great deal: seats near the court cost over $200 each, while 
seats in the top rows of the arena cost $50 each. The arena is divided into 50 numbered 
sections, from 101 to 150. Each section has rows of seats labeled with letters from A 
(nearest the court) to ZZ (top row of the arena). 


a. Explain why it might be difficult to give the survey to an SRS of fans. 


b. Explain why it would be better to select a stratified random sample using the lettered 
rows rather than the numbered sections as strata. What is the benefit of using a 
stratified sample in this context? 


c. Explain how to select a cluster sample of fans. What is the benefit of using a cluster 
sample in this context? 


R4.4 |») Been to the movies? An opinion poll calls 2000 randomly chosen residential 


telephone numbers, then asks to speak with an adult member of the household. The 


interviewer asks, “Box office revenues are at an all-time high. How many movies have 
you watched in a movie theater in the past 12 months?” In all, 1131 people responded. 
The researchers used the responses to estimate the mean number of movies adults had 

watched in a movie theater over the past 12 months. 


a. Describe a potential source of bias related to the wording of the question. Suggest a 
change that would help fix this problem. 


b. Describe how using only residential phone numbers might lead to bias and how this 
will affect the estimate. 


c. Describe how nonresponse might lead to bias and how this will affect the estimate. 


R4.5 p) Are anesthetics safe? The National Halothane Study was a major investigation of 


the safety of anesthetics used in surgery. Records of over 850,000 operations performed 


in 34 major hospitals showed the following death rates for four common anesthetics: 


Anesthetic A B Cc D 
Death rate 1.7% 1.7% 3.4% 1.9% 


There seems to be a clear association between the anesthetic used and the death rate of 
patients. Anesthetic C appears to be more dangerous. 


a. Explain why we call the National Halothane Study an observational study rather than 
an experiment, even though it compared the results of using different anesthetics in 
actual surgery. 


b. Identify the explanatory and response variables in this study. 


c. When the study looked at other variables that are related to a doctor’s choice of 
anesthetic, it found that Anesthetic C was not causing extra deaths. Explain the 
concept of confounding in this context and identify a variable that might be 
confounded with the doctor’s choice of anesthetic. 


R4.6 ») Ugly fries Few people want to eat discolored french fries. To prevent spoiling and to 


preserve flavor, potatoes are kept refrigerated before being cut for french fries. But 
immediate processing of cold potatoes causes discoloring due to complex chemical 
reactions. The potatoes must therefore be brought to room temperature before processing. 
Researchers want to design an experiment in which tasters will rate the color and flavor 
of french fries prepared from several groups of potatoes. The potatoes will be freshly 
picked or stored for a month at room temperature or stored for a month refrigerated. 
They will then be sliced and cooked either immediately or after an hour at room 
temperature. 


a. Identify the experimental units, the factors, the number of levels for each factor, and 
the treatments. 


b. Describe a completely randomized design for this experiment using 300 potatoes. 


c. A single supplier has made 300 potatoes available to the researchers. Describe a 


Statistical benefit and a statistical drawback of using potatoes from only one supplier. 


d. The researchers decided to do a follow-up experiment using potatoes from several 
different suppliers. Describe how they should change the design of the experiment to 
account for the addition of other suppliers. 

R4.7 \») Don’t catch a cold! A recent study of 1000 students at the University of Michigan 


investigated how to prevent catching the common cold. The students were randomly 
assigned to three different cold prevention methods for 6 weeks. Some wore masks, some 
wore masks and used hand sanitizer, and others took no precautions. The two groups 
who used masks reported 10—50% fewer cold symptoms than those who did not wear a 
mask.22 


a. Does this study allow for inference about a population? Explain your answer. 
b. Does this study allow for inference about cause and effect? Explain your answer. 
R4.8 _») An herb for depression? Does the herb St. John’s wort relieve major depression? 


Here is an excerpt from the report of one study of this issue: “Design: Randomized, 
Double-Blind, Placebo-Controlled Clinical Trial.”2® The study concluded that the 
difference in effectiveness of St. John’s wort and a placebo was not statistically 
significant. 


a. Describe the placebo effect in this context. How did the design of this experiment 
account for the placebo effect? 
b. Explain the purpose of the random assignment. 
c. Why is a double-blind design a good idea in this setting? 
d. Explain what “not statistically significant” means in this context. 
R4.9 \_») How long did I work? A psychologist wants to know if the difficulty of a task 


influences our estimate of how long we spend working at it. She designs two sets of 
mazes that subjects can work through on a computer. One set has easy mazes and the 
other has difficult mazes. Subjects work until told to stop (after 6 minutes, but subjects 
do not know this). They are then asked to estimate how long they worked. The 
psychologist has 30 students available to serve as subjects. 


a. Describe an experiment using a completely randomized design to learn the effect of 
difficulty on estimated time. Make sure to carefully explain your method of assigning 
treatments. 


b. Describe a matched pairs experimental design using the same 30 subjects. 


c. Which design would be more likely to detect a difference in the effects of the 
treatments? Explain your answer. 
R4.10l! .») Deceiving subjects Students sign up to be subjects in a psychology experiment. 


When they arrive, they are told that interviews are running late and are taken to a waiting 
room. The experimenters then stage the theft of a valuable object left in the waiting 


room. Some subjects are alone with the thief, and others are present in pairs—these are 
the treatments being compared. Will the subject report the theft? 


a. 


The students had agreed to take part in an unspecified study, and the true nature of the 
experiment is explained to them afterward. Does this meet the requirement of 
informed consent? Explain your answer. 


What two other ethical principles should be followed in this study? 


Chapter 4 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


T4.1 When we take a census, we attempt to collect data from 
a. a Stratified random sample. 
b. every individual chosen in a simple random sample. 
c. every individual in the population. 
d. a voluntary response sample. 
e. a convenience sample. 


T4.2 You want to take a simple random sample (SRS) of 50 of the 816 students who live in 
a dormitory on campus. You label the students 001 to 816 in alphabetical order. In the 
table of random digits, you read the entries 


95592 94007 69769 33547 72450 16632 81194 14873 


The first three students in your sample have labels 
a. 955, 929, 400. 
b. 400, 769, 769. 
c. 559, 294, 007. 
d. 929, 400, 769. 
e. 400, 769, 335. 


T4.3 A study of treatments for angina (pain due to low blood supply to the heart) compared 
bypass surgery, angioplasty, and use of drugs. The study looked at the medical records 
of thousands of angina patients whose doctors had chosen one of these treatments. It 
found that the average survival time of patients given drugs was the highest. What do 
you conclude? 


a. This study proves that drugs prolong life and should be the treatment of choice. 


b. We can conclude that drugs prolong life because the study was a comparative 
experiment. 


c. We can’t conclude that drugs prolong life because the patients were volunteers. 


d. We can’t conclude that drugs prolong life because the groups might differ in ways 
besides the treatment. 


e. We can’t conclude that drugs prolong life because no placebo was used. 


T4.4 Tonya wanted to estimate the average amount of time that students at her school spend 


on Facebook each day. She gets an alphabetical roster of students in the school from 
the registrar’s office and numbers the students from 1 to 1137. Then Tonya uses a 
random number generator to pick 30 distinct labels from 1 to 1137. She surveys those 
30 students about their Facebook use. Tonya’s sample is a simple random sample 
because 


a. it was selected using a chance process. 

b. it gave every individual the same chance to be selected. 

c. it gave every possible sample of size 30 an equal chance to be selected. 
d. it doesn’t involve strata or clusters. 

e. it is guaranteed to be representative of the population. 


T4.5 Consider an experiment to investigate the effectiveness of different insecticides in 
controlling pests and their impact on the productivity of tomato plants. What is the best 
reason for randomly assigning treatment levels (spraying or not spraying) to the 
experimental units (farms)? 


a. Random assignment eliminates the effects of other variables, like soil fertility. 
b. Random assignment eliminates chance variation in the responses. 


c. Random assignment allows researchers to generalize conclusions about the 
effectiveness of the insecticides to all farms. 


d. Random assignment will tend to average out all other uncontrolled factors such as 
soil fertility so that they are not confounded with the treatment effects. 


e. Random assignment helps avoid bias due to the placebo effect. 


T4.6 Researchers randomly selected 1700 people from Canada who had never suffered a 
heart attack and rated the happiness of each person. Ten years later, the researchers 
followed up with each person and found that people who were initially rated as happy 
were less likely to have a heart problem.“ Which of the following is the most 
appropriate conclusion based on this study? 


a. Happiness causes better heart health for all people. 

b. Happiness causes better heart health for Canadians. 

c. Happiness causes better heart health for the 1700 people in the study. 
d. Happier people in Canada are less likely to have heart problems. 

e. Happier people in the study were less likely to have heart problems. 


T4.7 A TV station wishes to obtain information on the TV viewing habits in its market area. 
The market area contains one city of population 170,000, another city of 70,000, and 
four towns of about 5000 residents each. The station suspects that the viewing habits 
may be different in larger and smaller cities and in the rural areas. Which of the 
following sampling designs would yield the type of information the station requires? 
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A stratified sample from the cities and towns in the market area 


. A cluster sample using the cities and towns as clusters 


A convenience sample from the market area 


. A simple random sample from the market area 


An online poll that invites all people from the cities and towns in the market area to 
participate 


T4.8 Bias in a sampling method is 


- 


e. 


any difference between the sample result and the truth about the population. 


. the difference between the sample result and the truth about the population due to 


using chance to select a sample. 


any difference between the sample result and the truth about the population due to 
practical difficulties such as contacting the subjects selected. 


. any difference between the sample result and the truth about the population that 


tends to occur in the same direction whenever you use this sampling method. 


racism or sexism on the part of those who take the sample. 


T4.9 You wonder if TV ads are more effective when they are longer or repeated more often 
or both. So you design an experiment. You prepare 30-second and 60-second ads for a 
camera. Your subjects all watch the same TV program, but you assign them at random 
to four groups. One group sees the 30-second ad once during the program; another sees 
it three times; the third group sees the 60-second ad once; and the last group sees the 
60-second ad three times. You ask all subjects how likely they are to buy the camera. 
Which of the following best describes the design of this experiment? 


a. This is a randomized block design, but not a matched pairs design. 
b. 

C. 
d. 


e. 


This is a matched pairs design. 
This is a completely randomized design with one explanatory variable (factor). 
This is a completely randomized design with two explanatory variables (factors). 


This is a completely randomized design with four explanatory variables (factors). 


T4.10 Can texting make you healthier? Researchers randomly assigned 700 Australian adults 
to either receive usual health care or usual heath care plus automated text messages with 
positive messages, such as “Walking is cheap. It can be done almost anywhere. All you 
need is comfortable shoes and clothing.” The group that received the text messages 
showed a statistically significant increase in physical activity.22 What is the meaning of 
“statistically significant” in this context? 


a. The results of this study are very important. 


b. The results of this study should be generalized to all people. 


c. The difference in physical activity for the two groups is greater than 0. 


d. The difference in physical activity for the two groups is very large. 


e. The difference in physical activity for the two groups is larger than the difference 
that could be expected to happen by chance alone. 


T4.11 You want to know the opinions of American high school teachers on the issue of 
establishing a national proficiency test as a prerequisite for graduation from high 
school. You obtain a list of all high school teachers belonging to the National Education 
Association (the country’s largest teachers’ union) and mail a survey to a random 
sample of 2500 teachers. In all, 1347 of the teachers return the survey. Of those who 
responded, 32% say that they favor some kind of national proficiency test. Which of the 
following statements about this situation is true? 


a. Because random sampling was used, we can feel confident that the percent of all 
American high school teachers who would say they favor a national proficiency test 
is close to 32%. 


b. We cannot trust these results, because the survey was mailed. Only survey results 
from face-to-face interviews are considered valid. 


c. Because over half of those who were mailed the survey actually responded, we can 
feel fairly confident that the actual percent of all American high school teachers who 
would say they favor a national proficiency test is close to 32%. 


d. The results of this survey may be affected by undercoverage and nonresponse. 


e. The results of this survey cannot be trusted due to voluntary response bias. 


Section Il: Free Response Show all your work. Indicate clearly the methods you use, 
because you will be graded on the correctness of your methods as well as on the accuracy and 
completeness of your results and explanations. 


T4.12 Elephants sometimes damage trees in Africa. It turns out that elephants dislike bees. 
They recognize beehives in areas where they are common and avoid them. Can this 
information be used to keep elephants away from trees? Researchers want to design an 
experiment to answer these questions using 72 acacia trees and three treatments: active 


hives, empty hives, and no hives. 


a. Identify the experimental units in this experiment. 
b. Explain why it is beneficial to include some trees that have no hives. 


c. Describe how the researchers could carry out a completely randomized design for 
this experiment. Include a description of how the treatments should be assigned. 


T4.13 Google and Gallup teamed up to survey a random sample of 1673 U.S. students in 
grades 7-12. One of the questions was “How confident are you that you could learn 
computer science if you wanted to?” Overall, 54% of students said they were very 


confident.22 


a. Identify the population and the sample. 


b. Explain why it was better to randomly select the students rather than putting the 
survey question on a website and inviting students to answer the question. 


c. Do you expect that the percent of all U.S. students in grades 7-12 who would say 
“very confident” is exactly 54%? Explain your answer. 


d. The report also broke the results down by gender. For this question, 62% of males 
and 48% of females said they were very confident. Which of the three estimates 
(54%, 62%, 48%) do you expect is closest to the value it is trying to estimate? 
Explain your answer. 


T4.14 Many people start their day with a jolt of caffeine from coffee or a soft drink. Most 
experts agree that people who consume large amounts of caffeine each day may suffer 
from physical withdrawal symptoms if they stop ingesting their usual amounts of 
caffeine. Researchers recruited 11 volunteers who were caffeine dependent and who 
were willing to take part in a caffeine withdrawal experiment. The experiment was 
conducted on two 2-day periods that occurred one week apart. During one of the 2-day 
periods, each subject was given a capsule containing the amount of caffeine normally 
ingested by that subject in one day. During the other study period, the subjects were 
given placebos. The order in which each subject received the two types of capsules was 
randomized. The subjects’ diets were restricted during each of the study periods. At the 
end of each 2-day study period, subjects were evaluated using a tapping task in which 


they were instructed to press a button 200 times as fast as they could.24 


Identify the explanatory and response variables in this experiment. 


> 


b. How was blocking used in the design of this experiment? What is the benefit of 
blocking in this context? 


c. Researchers randomized the order of the treatments to avoid confounding. Explain 
how confounding might occur if the researchers gave all subjects the placebo first 
and the caffeine second. In this context, what problem does confounding cause? 


d. Could this experiment have been carried out in a double-blind manner? Explain your 
answer. 


Chapter 4 Project Response Bias 


In this project, your team will design and conduct an experiment to investigate the effects of 
response bias in surveys.22 You may choose the topic for your surveys, but you must design 
your experiment so that it can answer at least one of the following questions. 


e Does the wording of a question affect the response? 


Do the characteristics of the interviewer affect the response? 
e Does anonymity change the responses to sensitive questions? 


e Does manipulating the answer choices affect the response? 


Can revealing other peoples’ answers to a question change the response? 


. Write a proposal describing the design of your experiment. Be sure to include the following 
items: 


a. Your chosen topic and which of the above questions you’|I try to answer. 


b. A detailed description of how you will obtain your subjects (minimum of 50). Your plan 
must be practical! 


c. An explanation of the treatments in your experiment and how you will determine which 
subjects get which treatment. 


d. A clear explanation of how you will implement your design. 


e. Precautions you will take to collect data ethically. 
Here are two examples of successful student experiments. 


“Make-Up,” by Caryn S. and Trisha T. (all questions asked to males): 


i. “Do you find females who wear makeup attractive?” (Questioner wearing makeup: 
75% answered “Yes.”’) 


ii. “Do you find females who wear makeup attractive?” (Questioner not wearing makeup: 
30% answered “Yes.”’) 


“Cartoons” by Sean W. and Brian H.: 
i. “Do you watch cartoons?” (90% answered “Yes.”) 
ii. “Do you still watch cartoons?” (60% answered “Yes.’’) 


’, Once your teacher has approved your design, carry out the experiment. Record your data in a 
table. 


3. Analyze your data. What conclusion do you draw? Provide appropriate graphical and 
numerical evidence to support your answer. 


|, Prepare a report that includes the data you collected, your analysis from Step 3, anda 
discussion of any problems you encountered and how you dealt with them. 


Cumulative AP® Practice Test 1 


Section I: Multiple Choice Choose the best answer for Questions AP1.1—AP1.14. 


AP1.1 You look at real estate ads for houses in Sarasota, Florida. Many houses have prices 
from $200,000 to $400,000. The few houses on the water, however, have prices up to 
$15 million. Which of the following statements best describes the distribution of home 
prices in Sarasota? 


a. The distribution is most likely skewed to the left, and the mean is greater than the 
median. 


b. The distribution is most likely skewed to the left, and the mean is less than the 
median. 


c. The distribution is roughly symmetric with a few high outliers, and the mean is 
approximately equal to the median. 


d. The distribution is most likely skewed to the right, and the mean is greater than the 
median. 


e. The distribution is most likely skewed to the right, and the mean is less than the 
median. 


AP1.2 A child is 40 inches tall, which places her at the 90th percentile of all children of 
similar age. The heights for children of this age form an approximately Normal 
distribution with a mean of 38 inches. Based on this information, what is the standard 
deviation of the heights of all children of this age? 


a. 0.20 inch 
b. 0.31 inch 
c. 0.65 inch 
d. 1.21 inches 
e. 1.56 inches 


AP1.3 A large set of test scores has mean 60 and standard deviation 18. If each score is 
doubled, and then 5 is subtracted from the result, the mean and standard deviation of the 
new scores are 


a. mean 115 and standard deviation 31. 
b. mean 115 and standard deviation 36. 


c. mean 120 and standard deviation 6. 
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. mean 120 and standard deviation 31. 


e. mean 120 and standard deviation 36. 


AP1.4 For a certain experiment, the available experimental units are eight rats, of which four 
are female (F1, F2, F3, F4) and four are male (M1, M2, M3, M4). There are to be four 
treatment groups, A, B, C, and D. If a randomized block design is used, with the 
experimental units blocked by gender, which of the following assignments of treatments 
is impossible? 


a. A = (Fl, M1), B — (F2, M2), C — (F3, M3), D = (F4, M4) 
b. A = (F1, M2), B — (F2, M3), C — (F3, M4), D — (F4, M1) 
c. A = (F1, M2), B = (F3, F2), C — (F4, M1), D = (M3, M4) 
d. A — (F4, M1), B - (F2, M3), C = (F3, M2), D - (F1, M4) 
e. A = (F4, M1), B = (F1, M4), C = (F3, M2), D = (F2, M3) 


AP1.5 For a biology project, you measure the weight in grams (g) and the tail length in 
millimeters (mm) of a group of mice. The equation of the least-squares line for 
predicting tail length from weight is 


predicted tail length = 20 + 3 x weightPredicted tail length = 20+ 3 x weight 
Which of the following is not correct? 


a. The slope is 3, which indicates that a mouse’s predicted tail length should increase 
by about 3 mm for each additional gram of weight. 


b. The predicted tail length of a mouse that weighs 38 grams is 134 millimeters. 


c. By looking at the equation of the least-squares line, you can see that the correlation 
between weight and tail length is positive. 


d. If you had measured the tail length in centimeters instead of millimeters, the slope of 
the regression line would have been 3/10 = 0.3. 


e. Mice that have a weight of 0 grams will have a tail of length 20 mm. 


AP1.6 The figure shows a Normal density curve. Which of the following gives the best 
estimates for the mean and standard deviation of this Normal distribution? 
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a. p = 200, 0= 50 


b. p= 200, 0 = 25 
c p= 225,0=50 
d. p=225,0=25 
e. p= 225,0=275 


AP1.7 The owner of a chain of supermarkets notices that there is a positive correlation 
between the sales of beer and the sales of ice cream over the course of the previous 
year. During seasons when sales of beer were above average, sales of ice cream also 
tended to be above average. Likewise, during seasons when sales of beer were below 
average, Sales of ice cream also tended to be below average. Which of the following 
would be a valid conclusion from these facts? 


a. Sales records must be in error. There should be no association between beer and ice 
cream sales. 


b. Evidently, for a significant proportion of customers of these supermarkets, drinking 
beer causes a desire for ice cream or eating ice cream causes a thirst for beer. 


c. A scatterplot of monthly ice cream sales versus monthly beer sales would show that 
a straight line describes the pattern in the plot, but it would have to be a horizontal 
line. 


d. There is a clear negative association between beer sales and ice cream sales. 


e. The positive correlation is most likely a result of the variable temperature; that is, as 
temperatures increase, so do both beer sales and ice cream sales. 


AP1.8 Here are the IQ scores of 10 randomly chosen fifth-grade students: 


145 139 126 122 125 130 96 110 118 118 


Which of the following statements about this data set is not true? 


a 


The student with an IQ of 96 is considered an outlier by the 1.5 x JQR rule. 
b. The five-number summary of the 10 IQ scores is 96, 118, 123.5, 130, 145. 


c. If the value 96 were removed from the data set, the mean of the remaining 9 IQ 
scores would be greater than the mean of all 10 IQ scores. 


d. If the value 96 were removed from the data set, the standard deviation of the 
remaining 9 IQ scores would be less than the standard deviation of all 10 IQ scores. 


e. If the value 96 were removed from the data set, the JQR of the remaining 9 IQ scores 
would be less than the JQR of all 10 IQ scores. 


AP1.9 Before he goes to bed each night, Mr. Kleen pours dishwasher powder into his 
dishwasher and turns it on. Each morning, Mrs. Kleen weighs the box of dishwasher 
powder. From an examination of the data, she concludes that Mr. Kleen dispenses a 
rather consistent amount of powder each night. Which of the following statements is 
true? 


I. There is a high positive correlation between the number of days that have passed 
since the box of dishwasher powder was opened and the amount of powder left in 
the box. 

II. A scatterplot with days since purchase as the explanatory variable and total amount 
of dishwasher powder used as the response variable would display a strong positive 
association. 

III. The correlation between the amount of powder left in the box and the amount of 
powder used should be —1. 


a. I only 
b. Il only 

c. I only 

d. II and II only 


e. I, IJ, and III 

.P1.10 The General Social Survey (GSS), conducted by the National Opinion Research Center 
at the University of Chicago, is a major source of data on social attitudes in the United 
States. Once each year, 1500 adults are interviewed in their homes all across the 
country. The subjects are asked their opinions about sex and marriage; attitudes toward 
women, welfare, foreign policy; and many other issues. The GSS begins by selecting a 
sample of counties from the 3000 counties in the country. The counties are divided into 
urban, rural, and suburban; a separate sample of counties is chosen at random from each 
group. This is a 
a. simple random sample. 
b. systematic random sample. 
c. cluster sample. 
d. stratified random sample. 


e. voluntary response sample. 

.P1.11 You are planning an experiment to determine the effect of the brand of gasoline and the 
weight of a car on gas mileage measured in miles per gallon. You will use a single test 
car, adding weights so that its total weight is 3000, 3500, or 4000 pounds. The car will 
drive on a test track at each weight using each of Amoco, Marathon, and Speedway 
gasoline. Which is the best way to organize the study? 


a. Start with 3000 pounds and Amoco and run the car on the test track. Then do 3500 
and 4000 pounds. Change to Marathon and go through the three weights in order. 
Then change to Speedway and do the three weights in order once more. 

b. Start with 3000 pounds and Amoco and run the car on the test track. Then change to 


Marathon and then to Speedway without changing the weight. Then add weights to 
get 3500 pounds and go through the three gasolines in the same order. Then change 


to 4000 pounds and do the three gasolines in order again. 


c. Choose a gasoline at random, and run the car with this gasoline at 3000, 3500, and 
4000 pounds in order. Choose one of the two remaining gasolines at random and 
again run the car at 3000, then 3500, then 4000 pounds. Do the same with the last 
gasoline. 


d. There are nine combinations of weight and gasoline. Run the car several times using 
each of these combinations. Make all these runs in random order. 


e. Randomly select an amount of weight and a brand of gasoline, and run the car on the 
test track. Repeat this process a total of 30 times. 


.P1.12 A linear regression was performed using the following five data points: A(2, 22), B(10, 
4), C(6, 14), D(14, 2), E(18, —4). The residual for which of the five points has the 
largest absolute value? 


a A 
b. B 
ec. G 
d. D 
e E 


.P1.13 The frequency table summarizes the distribution of time that 140 patients at the 
emergency room of a small-city hospital waited to receive medical attention during the 


last month. 

Waiting time Frequency 
Less than 10 minutes 5 

At least 10 but less than 20 minutes 24 

At least 20 but less than 30 minutes 45 

At least 30 but less than 40 minutes 38 

At least 40 but less than 50 minutes 19 


At least 50 but less than 60 minutes 
At least 60 but less than 70 minutes 


Which of the following represents possible values for the median and JQR of waiting 
times for the emergency room last month? 


a. median = 27 minutes and JQR = 15 minutes 
b. median = 28 minutes and JQR = 25 minutes 
c. median = 31 minutes and JQR = 35 minutes 
d. median = 35 minutes and JQR = 45 minutes 
e. median = 45 minutes and JQR = 55 minutes 


.P1.14 Boxplots of two data sets are shown. 


Plot 1) +} 


Based on the boxplots, which of the following is true? 
a. The range of both plots is about the same. 
b. The means of both plots are approximately equal. 
c. Plot 2 contains more data points than Plot 1. 

d. The medians are approximately equal. 


e. Plot 1 is more symmetric than Plot 2. 


Section Il: Free Response Show all your work. Indicate clearly the methods you use, 
because you will be graded on the correctness of your methods as well as on the accuracy and 
completeness of your results and explanations. 


.P1.15 The manufacturer of exercise machines for fitness centers has designed two new 
elliptical machines that are meant to increase cardiovascular fitness. The two machines 
are being tested on 30 volunteers at a fitness center near the company’s headquarters. 
The volunteers are randomly assigned to one of the machines and use it daily for two 
months. A measure of cardiovascular fitness is administered at the start of the 
experiment and again at the end. The following stemplot contains the differences (After 
— Before) in the two scores for the two machines. Note that higher scores indicate larger 
gains in fitness. 


Machine A Machine B 


54 
876320 
97411 
61 


Key: 2| 1 represents a 
difference (After — Before) 
of 21 in fitness scores. 


a. Write a few sentences comparing the distributions of cardiovascular fitness gains 
from the two elliptical machines. 


b. Which machine should be chosen if the company wants to advertise it as achieving 
the highest overall gain in cardiovascular fitness? Explain your reasoning. 


c. Which machine should be chosen if the company wants to advertise it as achieving 
the most consistent gain in cardiovascular fitness? Explain your reasoning. 


d. Give one reason why the advertising claims of the company (the scope of inference) 
for this experiment would be limited. Explain how the company could broaden that 
scope of inference. 


.P1.16 Those who advocate for monetary incentives in a work environment claim that this type 
of incentive has the greatest appeal because it allows the winners to do what they want 
with their winnings. Those in favor of tangible incentives argue that money lacks the 
emotional appeal of, say, a weekend for two at a romantic country inn or elegant hotel, 
or a weeklong trip to Europe. 

A few years ago a national tire company, in an effort to improve sales of a new line of 
tires, decided to test which method—offering cash incentives or offering non-cash 
prizes such as vacations—was more successful in increasing sales. The company had 60 
retail sales districts of various sizes across the country and data on the previous sales 
volume for each district. 


a. Describe a completely randomized design using the 60 retail sales districts that would 
help answer this question. 


b. Explain how you would use the following excerpt from the table of random digits to 
do the random assignment that your design requires. Then use your method to make 
the first three assignments. 


07511 88915 41267 16853 84569 79367 32337 03316 
81486 69487 60513 09297 00412 71238 27649 39950 


c. One of the company’s officers suggested that it would be better to use a matched 
pairs design instead of a completely randomized design. Explain how you would 
change your design to accomplish this. 


.P1.17 In retail stores, there is a lot of competition for shelf space. There are national brands 
for most products, and many stores carry their own line of in-house brands, too. 
Because shelf space is not infinite, the question is how many linear feet to allocate to 
each product and which shelf (top, bottom, or somewhere in the middle) to put it on. 
The middle shelf is the most popular and lucrative, because many shoppers, if 
undecided, will simply pick the product that is at eye level. 


A local store that sells many upscale goods is trying to determine how much shelf space 
to allocate to its own brand of men’s personal-grooming products. The middle shelf 
space is randomly varied between 3 and 6 linear feet over the next 12 weeks, and 
weekly sales revenue (in dollars) from the store’s brand of personal-grooming products 
for men is recorded. Here is some computer output from the study, along with a 


scatterplot: 

Predictor Coef SE Coef T P 
Constant 317.940 31.32 10.15 0.000 
Shelf 152.680 6.445 23.69 0.000 
length 


S = 22,9212 R-Sq = 98.2% R-Sq(adj) = 98.1% 
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e. 
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Describe the relationship between shelf length and sales. 


. Write the equation of the least-squares regression line. Be sure to define any 


variables you use. 


If the store manager were to decide to allocate 5 linear feet of shelf space to the 
store’s brand of men’s grooming products, what is the best estimate of the weekly 
sales revenue? 


Interpret the value of s. 


Identify and interpret the coefficient of determination. 


.P1.18 The manager of the store in the preceding exercise calculated the residual for each point 
in the scatterplot and made a dotplot of the residuals. The distribution of residuals is 
roughly Normal with a mean of $0 and standard deviation of $22.92. 


a. 


What percent of the actual sales amounts do you expect to be within $5 of their 
expected sales amount? 


. The middle 95% of residuals should be between which two values? Use this 


information to give an interval of plausible values for the weekly sales revenue if 5 
linear feet are allocated to the store’s brand of men’s grooming products. 


Chapter 5 Probability: What Are the 


Chances? 
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Chapter 5 Review Exercises 


Chapter 5 AP® Statistics Practice Test 


INTRODUCTION 


Chance is all around us. You and your friend play rock-paper-scissors to determine who gets 
the last slice of pizza. A coin toss decides which team gets to receive the ball first in a football 
game. Many adults regularly play the lottery, hoping to win a big jackpot with a few lucky 
numbers. Others head to casinos or racetracks, hoping that some combination of luck and skill 
will pay off. People young and old play games of chance involving cards or dice or spinners. 
The traits that children inherit—hair and eye color, blood type, handedness, dimples, whether 
they can roll their tongues—are determined by the chance involved in which genes their 
parents pass along. 

The mathematics of chance behavior is called probability. Probability is the topic of this 
chapter. Here is an activity that gives you some idea of what lies ahead. 


ACTIVITY The “1 in 6 wins” game 


In this activity, you and your classmates will use simulation to test whether a company’s 
claim is believable. 

As a special promotion for its 20-ounce bottles of soda, a soft drink company printed a 
message on the inside of each bottle cap. Some of the caps said, “Please try again!” while 
others said, “You’re a winner!” The company advertised the promotion with the slogan “1 in 
6 wins a prize.” The prize is a free 20-ounce bottle of soda. 

Grayson’s statistics class wonders if the company’s claim holds true for the bottles at a 
nearby convenience store. To find out, all 30 students in the class go to the store and each 
student buys one 20-ounce bottle of the soda. Two of them get caps that say, “You’re a 
winner!” Does this result give convincing evidence that the company’s 1-in-6 claim is 
inaccurate? 

For now, let’s assume that the company is telling the truth, and that every 20-ounce 
bottle of soda it fills has a 1-in-6 chance of getting a cap that says, “You’re a winner!” We 
can model the status of an individual bottle with a six-sided die: let 1 through 5 represent 
“Please try again!” and 6 represent “You’re a winner!” 


1. Roll your die 30 times to imitate the process of the students in Grayson’s statistics class 
buying their sodas. How many of them won a prize? 

2. Your teacher will draw and label axes for a class dotplot. Plot on the graph the number of 
prize winners you got in Step 1. 

3. Repeat Steps 1 and 2, if needed, to get a total of at least 40 repetitions of the simulation for 
your class. 

4. Discuss the results with your classmates. What percent of the time did the simulation yield 


two or fewer prize winners in a class of 30 students, just by chance? Does it seem 
plausible (believable) that the company is telling the truth, but that the class just got 


unlucky? Or is there convincing evidence that the 1-in-6 claim is wrong? Explain your 
reasoning. 
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As the activity shows, simulation is a powerful method for modeling chance behavior. 
Section 5.1 begins by examining the idea of probability and then illustrates how simulation can 
be used to estimate probabilities. In Sections 5.2 and 5.3, we develop the basic rules and 
techniques of probability. 

Probability calculations are the basis for inference. When we produce data by random 
sampling or randomized comparative experiments, the laws of probability answer the question 
“What would happen if we repeated the random sampling or random assignment process many 
times?” Many of the examples, exercises, and activities in this chapter focus on the connection 
between probability and inference. 


SECTION 5.1 Randomness, Probability, and Simulation 


LEARNING TARGETS By the end of the section, you should be able to: 


e Interpret probability as a long-run relative frequency. 


e Use simulation to model chance behavior. 


Imagine tossing a coin 10 times. How likely are you to get a run of 3 or more consecutive 
heads? An airline knows that a certain percent of customers who purchase tickets will not show 
up for a flight. If the airline overbooks a particular flight, what are the chances that they’ ll have 
enough seats for the passengers who show up? A couple plans to have children until they have 
at least one boy and one girl. How many children should they expect to have? To answer these 
questions, you need a better understanding of how chance behavior operates. 


The Idea of Probability 


In football, a coin toss helps determine which team gets the ball first. Why do the rules of 
football require a coin toss? Because tossing a coin seems a “fair” way to decide. What does 
that mean exactly? The following activity should help shed some light on this question. 


ACTIVITY What is probability? 


If you toss a fair coin, what’s the probability that it shows heads? It’s 1/2, or 0.5, right? But 
what does probability 1/2 really mean? In this activity, you will investigate by flipping a coin 
several times. 


1. Toss your coin once. Record whether you get heads or tails. 


2. Toss your coin a second time. Record whether you get heads or tails. What proportion of 
your first two tosses is heads? 


3. Toss your coin 8 more times so that you have 10 tosses in all. Record whether you get 
heads or tails on each toss in a table like the one that follows. 


4. Calculate the overall proportion of heads after each toss and record these values in the 
bottom row of the table. For instance, suppose you got tails on the first toss and heads on 
the second toss. Then your overall proportion of heads would be 0/1 = 0.00 after the first 
toss and 1/2 = 0.50 after the second toss. 
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pePLe> 
¢ Let’s use technology to speed things up. Launch the Idea of Probability applet at 


highschool.bfwpub.com/tps6e. Set the number of tosses to 10 and click toss. What 
proportion of the tosses were heads? Click “Reset” and toss the coin 10 more times. What 
proportion of heads did you get this time? Repeat this process several more times. What 
do you notice? 


Probability of Heads: 0.5 


Heads = 4/10 = 0.400 
ey | 


Tails = 6/10 = 0.600 


Number of Tosses: 10 


Sa 1 


™ Show true probability 


0.7 
Heads 
0.5 
RESET 
| Reser | 0.4 


. What if you toss the coin 100 times? Reset the applet and have it do 100 tosses. Is the 
proportion of heads exactly equal to 0.5? Close to 0.5? 


. Keep on tossing without hitting “Reset.” What happens to the proportion of heads? 


. Asa class, discuss what the following statement means: “If you toss a fair coin, the 
probability of heads is 0.5.” 


. If you toss a coin, it can land heads or tails. If you “toss” a thumbtack, it can land with the 
point sticking up or with the point down. Does that mean the probability of a tossed 
thumbtack landing point up is 0.5? How can you find out? Discuss with your classmates. 


Figure 5.1 shows some results from the preceding activity. The proportion of tosses that 


land heads varies from 0.30 to 1.00 in the first 10 tosses. As we make more and more tosses, 
however, the proportion of heads gets closer to 0.5 and stays there. 


Proportion of heads 
Proportion of heads 


1 500 
(a) Number of tosses (b) Number of tosses 

FIGURE 5.1 (a) The proportion of heads in the first 10 tosses of a coin. (b) The proportion of 
heads in the first 500 tosses of a coin. 


When we watch coin tosses or the results of random sampling and random assignment 
closely, a remarkable fact emerges: Chance behavior is unpredictable in the short run but has a 
regular and predictable pattern in the long run. This is the basis for the idea of probability. 


DEFINITION Probability 


The probability of any outcome of a chance process is a number between 0 and 
1 that describes the proportion of times the outcome would occur in a very long 
series of repetitions. 


A repetition of a chance process is sometimes called a trial. 


Outcomes that never occur have probability 0. An outcome that happens on every repetition 
has probability 1. An outcome that happens half the time in a very long series of trials has 
probability 0.5. 

The fact that the proportion of heads in many tosses eventually closes in on 0.5 is 
guaranteed by the law of large numbers. You can see this in Figure 5.1(b). The horizontal line 
represents the probability, and the proportion of heads in the simulation approaches this value 
as the number of repetitions becomes large. 


DEFINITION Law of large numbers 


The law of large numbers says that if we observe more and more repetitions of 
any chance process, the proportion of times that a specific outcome occurs 
approaches its probability. 


Probability gives us a language to describe the long-term regularity of chance behavior that 
is guaranteed by the law of large numbers. The outcome of a coin toss and the sex of the next 
baby born in a local hospital are both random. So is the result of a random sample or a random 
assignment. Even life insurance is based on the fact that deaths occur at random among many 
individuals. Because men are more likely to die at a younger age than women, insurance 
companies sometimes charge a man up to 3 times more for a life insurance policy than a 


woman of the same age. 


Recall from Chapter 4 that random doesn’t mean “haphazard.” In statistics, random means “by 


chance.” 


EXAMPLE |Who drinks coffee? (>) 
Interpreting probability 


A. and |. Kruk/Shutterstock.com 
PROBLEM: According to The Book of Odds, the probability that a randomly selected U.S. 
adult drinks coffee on a given day is 0.56. 
a. Interpret this probability as a long-run relative frequency. 
b. If a researcher randomly selects 100 U.S. adults, will exactly 56 of them drink coffee that 
day? Explain your answer. 


SOLUTION: 


a. apr you take a very large random sample of U.S. adults, about 56% of them will drink 
coffee that day. 


The chance process consists of randomly selecting a U.S. adult and recording 


whether or not the person drinks coffee that day. 


b. Probably not. With only 100 randomly selected adults, the number who drink coffee 
that day may not be very close to 56. 


Probability describes what happens in many, many repetitions (way more than 


100) of a chance process. 


FOR PRACTICE, TRY EXERCISE 1 


Life insurance companies, casinos, and others who make important decisions based on 
probability rely on the long-run predictability of random behavior. 


UNDERSTANDING RANDOMNESS The idea of probability seems straightforward. It 
answers the question “What would happen if we did this many times?” Understanding chance 
behavior is important for making decisions, especially when our data collection process 
includes random sampling or random assignment. But understanding randomness isn’t always 
easy, as the following activity illustrates. 


ACTIVITY Investigating randomness 


In this activity, you and your classmates will test your ability to imitate chance behavior. 


1. Pretend that you are flipping a fair coin. Without actually flipping a coin, imagine the first 
toss. Write down the result you see in your mind, heads (H) or tails (T). 


2. Imagine a second coin flip. Write down the result. 


3. Keep doing this until you have recorded the results of 50 imaginary flips. Write your 
results in groups of 5 to make them easier to read, like this: HTHTH TTHHT, and so on. 


4. Arunis a repetition of the same result. In the example in Step 3, there is a run of two tails 
followed by a run of two heads in the first 10 coin flips. Read through your 50 imagined 
coin flips and find the longest run. 


5. Your teacher will draw and label a number line for a class dotplot. Plot on the graph the 
length of the longest run you got in Step 4. 


6. Use an actual coin, Table D, or technology to generate a similar list of 50 coin flips. Find 
the longest run that you have. 


7. Your teacher will draw and label a number line with the same scale immediately above or 
below the one in Step 5. Plot on the new dotplot the length of the longest run you got in 
Step 6. 


8. Compare the distributions of longest run from imagined tosses and random tosses. What 
do you notice? 


Eyebyte/Alamy 


The idea of probability is that randomness is predictable in the long run. Unfortunately, our 
intuition about chance behavior tries to tell us that randomness should also be predictable in the 
short run. When it isn’t, we look for some explanation other than chance variation. 


Suppose you toss a coin 6 times and get TTTTTT. Some people think that the next toss 
must be more likely to give a head. It’s true that in the long run, heads will appear half the time. 
What is a myth is that future outcomes must make up for an imbalance like six straight tails. 


Coins and dice have no memories. A coin doesn’t know that the first 6 outcomes were tails, 
and it can’t try to get a head on the next toss to even things out. Of course, things do even out 
in the long run. That’s the law of large numbers in action. After 10,000 tosses, the results of the 
first six tosses don’t matter. They are overwhelmed by the results of the next 9994 tosses. 


Some people use the phrase /aw of averages to refer to the misguided belief that the results of a 


chance process have to even out in the short run. 


When asked to predict the sex—boy (B) or girl (G)—of the next seven babies born in a 
local hospital, most people will guess something like B-G-B-G-B-G-G. Few people would say 
G-G-G-B-B-B-G because this sequence of outcomes doesn’t “look random.” In fact, these two 
sequences of births are equally likely. “Runs” consisting of several of the same outcomes in a 
row are surprisingly common in chance behavior. Many students are not aware of this fact 
when they imagine a sequence of 50 coin tosses in the “Investigating randomness” activity! 


Is there such a thing as a “hot hand” in basketball? Belief that runs must result from 
something other than “just chance” influences behavior. If a basketball player makes several 
consecutive shots, both the fans and her teammates believe that she has a “hot hand” and is 
more likely to make the next shot. Several early studies of the hot hand theory showed that runs 


of baskets made or missed are no more frequent in basketball than would be expected if the 
result of each shot is unrelated to the outcomes of the player’s previous shots. If a player makes 


half her shots in the long run, her made shots and misses behave just like tosses of a coin— 


which means that runs of makes and misses are more common than our intuition expects. 


Two more recent studies provide some evidence that there is a small hot hand effect for 
basketball players. These studies also suggest that “hot” shooters take riskier shots, which then 
masks the hot hand effect.2 


CHECK YOUR UNDERSTANDING 


‘ 
y 


1. Pedro drives the same route to work on Monday through Friday. His route includes one 
traffic light. According to the local traffic department, there is a 55% probability that the 
light will be red when Pedro reaches the light. Interpret the probability. 


2. Probability is a measure of how likely an outcome is to occur. Match one of the 
probabilities that follow with each statement. Be prepared to defend your answer. 


00.0010.30.60.9919 9-001 0.3 0.6 0.99 1 


a. This outcome is impossible. It can never occur. 
b. This outcome is certain. It will occur on every trial. 


c. This outcome is very unlikely, but it will occur once in a while in a long sequence of 
trials. 


d. This outcome will occur more often than not. 


3. A husband and wife decide to have children until they have at least one child of each sex. 
The couple has had seven girls in a row. Their doctor assures them that they are much 
more likely to have a boy next. Explain why the doctor is wrong. 


Simulation 


We can model chance behavior and estimate probabilities with a simulation. 


DEFINITION Simulation 


Simulation is the imitation of chance behavior, based on a model that accurately 
reflects the situation. 


You already have some experience with simulations. In the “Hiring discrimination—it just 
won’t fly!” activity in Chapter 1 (page 6), you drew beads or slips of paper to imitate a random 
lottery to choose which pilots would become captains. The “Analyzing the caffeine 
experiment” activity in Chapter 4 (page 273) asked you to shuffle and deal piles of cards to 
mimic the random assignment of subjects to treatments. The “1 in 6 wins” game that opened 
this chapter had you roll a die several times to simulate buying 20-ounce sodas and looking 
under the cap. 


The goal in each of these cases was to use simulation to answer a question of interest about 
some chance process. Different chance “devices” were used to perform the simulations—beads, 
slips of paper, cards, or dice. But the same basic strategy was followed each time. 


THE SIMULATION PROCESS 


e Describe how to use a chance device to imitate one trial (repetition) of the simulation. 


Tell what you will record at the end of each trial. 
e Perform many trials of the simulation. 


e Use the results of your simulation to answer the question of interest. 


For the 1-in-6 wins game, we wanted to estimate the probability of getting two or fewer 
prize winners in a class of 30 students if the company’s 1-in-6 wins claim is true. 


e We rolled a six-sided die 30 times to determine the outcome for each person’s bottle of soda: 
6 = wins a prize, 1 to 5 = no prize, and recorded the number of winners. The dotplot shows 
the number of winners in 40 trials of this simulation. 


e In 4 of the 40 trials, two or fewer of the students won a prize. So our estimate of the 
probability is 4/40 = 0.10 = 10%. According to these results, getting 2 winners isn’t very 
likely, but it isn’t unusual enough to conclude that the company is lying. 


) 2 4 6 8 10 12 


Simulated number of winners in a class of 30 students 


EXAMPLE |NASCAR cards and cereal boxes (>) 


Performing simulations 


Daniel Shirey/Getty Images 


PROBLEM: In an attempt to increase sales, a breakfast cereal company decides to offer a 
NASCAR promotion. Each box of cereal will contain a collectible card featuring one of the 
following NASCAR drivers: Joey Lagano, Kevin Harvick, Chase Elliott, Danica Patrick, or 
Jimmie Johnson. The company claims that each of the 5 cards is equally likely to appear in 
any box of cereal. A NASCAR fan decides to keep buying boxes of the cereal until she has 
all 5 drivers’ cards. She is surprised when it takes her 23 boxes to get the full set of cards. 
Does this outcome provide convincing evidence that the 5 cards are not equally likely? To 
help answer this question, we want to perform a simulation to estimate the probability that it 
will take 23 or more boxes to get a full set of 5 NASCAR collectible cards. 


a. Describe how to use a random number generator to perform one trial of the simulation. 
The dotplot shows the number of cereal boxes it took to get all 5 drivers’ cards in 50 trials. 
b. Explain what the dot at 20 represents. 


c. Use the results of the simulation to estimate the probability that it will take 23 or more 
boxes to get a full set of cards. Does this outcome provide convincing evidence that the 5 
cards are not equally likely? 


Simulated number of boxes required 
SOLUTION: 


Guectk= Joey Lagano, 2 = Kevin Harvick, 3 = Chase Elliott, 4 = Danica Patrick, and 5 
= Jimmie Johnson. Generate a random integer from 1 to 5 to simulate buying one box 
of cereal and looking at which card is inside. ‘Keep generating random integers until all 
five labels from 1 to 5 appear. Record the number of boxes it takes to get all 5 cards. 


You can use the command Randint(1,5) on the TI-83/84 to generate a random 


integer from 1 to 5. 


b. One trial where it took 20 boxes to get all 5 drivers’ cards. 


Be sure to tell what you will record at the end of a trial. 


ove ca oO = 
G. Probabilitysoso=o }OPAbility ™~ 50 a so there’s about a 0% chance it would take 23 or 


more boxes to get a full set. Because it 1s so unlikely that it would take 2 3 or more boxes 
to get a full set, this result provides convincing evidence that the 5 NASCAR drivers’ 
cards are not equally likely to appear in each box of cereal. 


FOR PRACTICE, TRY EXERCISE 9 


It took our NASCAR fan 23 boxes to complete the set of 5 cards. Does that mean the 
company didn’t tell the truth about how the cards were distributed? Not necessarily. Our 
simulation says that it’s very unlikely for someone to have to buy 23 boxes to get a full set if 
each card is equally likely to appear in a box of cereal. The evidence suggests that the 
company’s statement is incorrect. It’s still possible, however, that the NASCAR fan was just 
very unlucky. 

Here’s one more example that shows the simulation process in action. 


EXAMPLE | Golden ticket parking lottery (>) 


Performing simulations 


Ann Heath 


PROBLEM: Ata local high school, 95 students have permission to park on campus. Each 


month, the student council holds a “golden ticket parking lottery” at a school assembly. The 
two lucky winners are given reserved parking spots next to the school’s main entrance. Last 
month, the winning tickets were drawn by a student council member from the AP® Statistics 
class. When both golden tickets went to members of that same class, some people thought 
the lottery had been rigged. There are 28 students in the AP® Statistics class, all of whom are 
eligible to park on campus. We want to perform a simulation to estimate the probability that 
a fair lottery would result in two winners from the AP® Statistics class. 


a. Describe how you would use a table of random digits to carry out this simulation. 


b. Perform 3 trials of the simulation using the random digits given. Make your procedure 
clear so that someone can follow what you did. 
70708 41098 55181 94904 43563 56934 48394 51719 
70708 41098 55181 94904 43563 56934 48394 51719 
c. In 9 of the 100 trials of the simulation, both golden tickets were won by members of the 


AP® Statistics class. Do these results give convincing evidence that the lottery was not 
carried out fairly? Explain your reasoning. 


SOLUTION: 


a. Label the students in the AP® Statistics class from 01 to 28, and label the remaining 
students from 29 to 95. Numbers from 96 to oo will be skipped. Moving left to right 
across a row, look at pairs of digits until we come across two different labels from 01 to 
95. The two students with these labels win the prime parking spaces. Record whether or 
not both winners come from the AP® Statistics class. Perform many simulated lotteries. 
See what ‘percent of the time both winners come from this statistics class. 


Describe how to use a chance device to imitate one trial of the simulation. Tell 


what you will record at the end of each trial. 


Hi 904 43563 56934 48394 51719 
x Rpt YT00X\Y ¥ 


Perform many trials. 


There was one trial out of the first 3 inwhich both golden parking tickets went to 


members of the AP® Statistics class. 


Use the results of your simulation to answer the question of interest. 


c. No; there’s about a 9% chance of. getting both winners from the AP® Statistics class in 
a fair lottery. Because this probability isn’t very small, we don’t have convincing 
evidence that the lottery was unfair. Outcomes like this could occur by chance alone in a 


fair lottery. 


Does that mean the lottery was conducted fairly? Not necessarily. 


FOR PRACTICE, TRY EXERCISE 11 


In the golden ticket lottery example, we ignored repeated numbers from 01 to 95 within a 
given repetition. That’s because the chance process involved sampling students without 
replacement. In the NASCAR example, we allowed repeated numbers from 1 to 5 in a given 
repetition. That’s because we were selecting a small number of cards from a very large 
population of cards in thousands of cereal boxes. So the probability of getting, say, a Danica 
Patrick card in the next box of cereal was still very close to 1/5 even if we had already selected 
a Danica Patrick card. 


AP® EXAM TIP 


On the AP® Statistics exam, you may be asked to describe how to perform a 
simulation using rows of random digits. If so, provide a clear enough description of 
your process for the reader to get the same results from only your written 
explanation. Remember that every label needs to be the same length. In the golden 
ticket lottery example, the labels should be 01 to 95 (all two digits), not 1 to 95. 
When sampling without replacement, be sure to mention that repeated numbers 
should be ignored. 


CHECK YOUR UNDERSTANDING 


. 


,, A basketball announcer suggests that a certain player is a streaky shooter. That is, the 
announcer believes that if the player makes a shot, the player is more likely to make the next 
shot. As evidence, the announcer points to a recent game where the player took 30 shots and 
had a streak of 10 made shots in a row. Is this convincing evidence of streaky shooting by 


the player? Assume that this player makes 50% of the shots and that the results of a shot 
don’t depend on previous shots. 


1. Describe how you would carry out a simulation to estimate the probability that a 50% 
shooter who takes 30 shots in a game would have a streak of 10 or more made shots. 


The dotplot displays the results of 50 simulated games in which this player took 30 shots. 
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2. Explain what the two dots above 9 indicate. 


3. What conclusion would you draw about whether this player was streaky? Explain your 
answer. 


Section 5.1 Summary 


e Chance behavior is unpredictable in the short run but has a regular and predictable pattern in 
the long run. 


e The long-run relative frequency of an outcome after many repetitions of a chance process is 
its probability. A probability is a number between 0 (never occurs) and 1 (always occurs). 


e The law of large numbers says that in many repetitions of the same chance process, the 
proportion of times that a particular outcome occurs will approach its probability. 


e Simulation can be used to imitate chance behavior and to estimate probabilities. To perform 
a simulation: 


= Describe how to use a chance device to imitate one trial of the simulation. Tell what you 
will record at the end of each trial. 


» Perform many trials of the simulation. 


=: Use the results of your simulation to answer the question of interest. 


Section 5.1 Exercises 


1. pg 302 ») Another commercial If Aaron tunes into his favorite radio station at a 


randomly selected time, there is a 0.20 probability that a commercial will be playing. 


a. Interpret this probability as a long-run relative frequency. 


b. If Aaron tunes into this station at 5 randomly selected times, will there be exactly one 
time when a commercial is playing? Explain your answer. 
Genetics There are many married couples in which the husband and wife both carry a 


gene for cystic fibrosis but don’t have the disease themselves. Suppose we select one of 
these couples at random. According to the laws of genetics, the probability that their first 


child will develop cystic fibrosis is 0.25. 
a. Interpret this probability as a long-run relative frequency. 


b. If researchers randomly select 4 such couples, is one of these couples guaranteed to 
have a first child who develops cystic fibrosis? Explain your answer. 

Mammograms Many women choose to have annual mammograms to screen for breast 

cancer after age 40. A mammogram isn’t foolproof. Sometimes the test suggests that a 


woman has breast cancer when she really doesn’t (a “false positive”). Other times the test 
says that a woman doesn’t have breast cancer when she actually does (a “false negative”). 


Suppose the false negative rate for a mammogram is 0.10. 


a. Explain what this probability means. 


b. Which is a more serious error in this case: a false positive or a false negative? Justify 
your answer. 


4. Liar, liar! Sometimes police use a lie detector test to help determine whether a suspect is 
telling the truth. A lie detector test isn’t foolproof—sometimes it suggests that a person is 
lying when he or she is actually telling the truth (a “false positive”). Other times, the test 
says that the suspect is being truthful when he or she is actually lying (a “false negative”). 
For one brand of lie detector, the probability of a false positive is 0.08. 


a. Explain what this probability means. 
b. Which is a more serious error in this case: a false positive or a false negative? Justify 


your answer. 


5. Three pointers The figure shows the results of a basketball player attempting many 3- 
point shots. Explain what this graph tells you about chance behavior in the short run and 


long run. 


Percent made 


Number of shots 


6. Keep on tossing The figure shows the results of two different sets of 5000 coin tosses. 
Explain what this graph tells you about chance behavior in the short run and the long run. 
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7. Anunenlightened gambler 


a. A gambler knows that red and black are equally likely to occur on each spin of a 
roulette wheel. He observes that 5 consecutive reds have occurred and bets heavily on 
black at the next spin. Asked why, he explains that “black is due.” Explain to the 
gambler what is wrong with this reasoning. 


b. After hearing you explain why red and black are still equally likely after 5 reds on the 
roulette wheel, the gambler moves to a card game. He is dealt 5 straight red cards from 
a standard deck with 26 red cards and 26 black cards. He remembers what you said and 
assumes that the next card dealt in the same hand is equally likely to be red or black. 
Explain to the gambler what is wrong with this reasoning. 


8. Due for a hit A very good professional baseball player gets a hit about 35% of the time 
over an entire season. After the player failed to hit safely in six straight at-bats, a TV 
commentator said, “He is due for a hit.” Explain why the commentator is wrong. 


9. pg 305 ») Will Luke pass the quiz? Luke’s teacher has assigned each student in his 


class an online quiz, which is made up of 10 multiple-choice questions with 4 options 
each. Luke hasn’t been paying attention in class and has to guess on each question. 
However, his teacher allows each student to take the quiz three times and will record the 
highest of the three scores. A passing score is 6 or more correct out of 10. We want to 
perform a simulation to estimate the score that Luke will earn on the quiz if he guesses at 
random on all the questions. 


a. Describe how to use a random number generator to perform one trial of the simulation. 


The dotplot shows Luke’s simulated quiz score in 50 trials of the simulation. 
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Simulated quiz score 
b. Explain what the dot at 1 represents. 
c. Use the results of the simulation to estimate the probability that Luke passes the quiz. 


d. Doug is in the same class and claims to understand some of the material. If he scored 8 
points on the quiz, is there convincing evidence that he understands some of the 
material? Explain your answer. 


10. Double fault! A professional tennis player claims to get 90% of her second serves in. Ina 
recent match, the player missed 5 of her first 20 second serves. Is this a surprising result if 
the player’s claim is true? Assume that the player has a 0.10 probability of missing each 
second serve. We want to carry out a simulation to estimate the probability that she would 
miss 5 or more of her first 20 second serves. 


a. Describe how to use a random number generator to perform one trial of the simulation. 


The dotplot displays the number of second serves missed by the player out of the first 
20 second serves in 100 simulated matches. 
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b. Explain what the dot at 6 represents. 


c. Use the results of the simulation to estimate the probability that the player would miss 5 
or more of her first 20 second serves in a match. 


d. Is there convincing evidence that the player misses more than 10% of her second 
serves? Explain your answer. 


11. 306 Airport security The Transportation Security Administration (TSA) is 
pg 


12. 


13. 


responsible for airport safety. On some flights, TSA officers randomly select passengers 
for an extra security check prior to boarding. One such flight had 76 passengers—12 in 
first class and 64 in coach class. Some passengers were surprised when none of the 10 
passengers chosen for screening were seated in first class. We want to perform a 
simulation to estimate the probability that no first-class passengers would be chosen in a 
truly random selection. 


a. Describe how you would use a table of random digits to carry out this simulation. 


b. Perform one trial of the simulation using the random digits that follow. Copy the digits 
onto your paper and mark directly on or above them so that someone can follow what 
you did. 


71487 09984 29077 14863 61683 47052 62224 51025 


c. In 15 of the 100 trials of the simulation, none of the 10 passengers chosen was seated in 
first class. Does this result provide convincing evidence that the TSA officers did not 
carry out a truly random selection? Explain your answer. 


Scrabble In the game of Scrabble, each player begins by randomly selecting 7 tiles from a 
bag containing 100 tiles. There are 42 vowels, 56 consonants, and 2 blank tiles in the bag. 
Cait chooses her 7 tiles and is surprised to discover that all of them are vowels. We want 
to perform a simulation to determine the probability that a player will randomly select 7 
vowels. 


a. Describe how you would use a table of random digits to carry out this simulation. 


b. Perform one trial of the simulation using the random digits given. Copy the digits onto 
your paper and mark directly on or above them so that someone can follow what you 
did. 


00694 05977 19664 65441 20903 62371 22125 53340 


c. In 2 of the 1000 trials of the simulation, all 7 tiles were vowels. Does this result give 
convincing evidence that the bag of tiles was not well mixed? 


Bull’s-eye! In a certain archery competition, each player continues to shoot until he or she 
misses the center of the target twice. Quinn is one of the archers in this competition. Based 


14. 


on past experience, she has a 0.60 probability of hitting the center of the target on each 
shot. We want to design a simulation to estimate the probability that Quinn stays in the 
competition for at least 10 shots. Describe how you would use each of the following 
chance devices to perform one trial of the simulation. 


a. Slips of paper 
b. Random digits table 


c. Random number generator 


Free-throw practice At the end of basketball practice, each player on the team must shoot 
free throws until he makes 10 of them. Dwayne is a 70% free-throw shooter. That is, his 
probability of making any free throw is 0.70. We want to design a simulation to estimate 
the probability that Dwayne makes 10 free throws in at most 12 shots. Describe how you 
would use each of the following chance devices to perform one trial of the simulation. 


a. Slips of paper 
b. Random digits table 


c. Random number generator 


In Exercises 15—18, determine whether the simulation design is valid. Justify your answer. 


15. 


16. 


Smartphone addiction? A media report claims that 50% of U.S. teens with smartphones 
feel addicted to their devices.2 A skeptical researcher believes that this figure is too high. 
She decides to test the claim by taking a random sample of 100 U.S. teens who have 
smartphones. Only 40 of the teens in the sample feel addicted to their devices. Does this 
result give convincing evidence that the media report’s 50% claim is too high? To find 
out, we want to perform a simulation to estimate the probability of getting 40 or fewer 
teens who feel addicted to their devices in a random sample of size 100 from a very large 
population of teens with smartphones in which 50% feel addicted to their devices. 

Let 1 = feels addicted and 2 = doesn’t feel addicted. Use a random number generator 
to produce 100 random integers from 1 to 2. Record the number of 1’s in the simulated 
random sample. Repeat this process many, many times. Find the percent of trials on which 
the number of 1’s was 40 or less. 


Lefties A website claims that 10% of U.S. adults are left-handed. A researcher believes 
that this figure is too low. She decides to test this claim by taking a random sample of 20 
U.S. adults and recording how many are left-handed. Four of the adults in the sample are 
left-handed. Does this result give convincing evidence that the website’s 10% claim is too 
low? To find out, we want to perform a simulation to estimate the probability of getting 4 
or more left-handed people in a random sample of size 20 from a very large population in 
which 10% of the people are left-handed. 


Let 00 to 09 indicate left-handed and 10 to 99 represent right-handed. Move left to 


right across a row in Table D. Each pair of digits represents one person. Keep going until 
you get 20 different pairs of digits. Record how many people in the simulated sample are 
left-handed. Repeat this process many, many times. Find the proportion of trials in which 
4 or more people in the simulated sample were left-handed. 


17. Notebook check Every 9 weeks, Mr. Millar collects students’ notebooks and checks their 
homework. He randomly selects 4 different assignments to inspect for all of the students. 
Marino is one of the students in Mr. Millar’s class. Marino completed 20 homework 
assignments and did not complete 10 assignments. He is surprised when Mr. Millar only 
selects 1 assignment that he completed. Should he be surprised? To find out, we want to 
design a simulation to estimate the probability that Mr. Millar will randomly select 1 or 
fewer of the homework assignments that Marino completed. 


Get 30 identical slips of paper. Write “N” on 10 of the slips and “C” on the remaining 
20 slips. Put the slips into a hat and mix well. Draw 1 slip without looking to represent the 
first randomly selected homework assignment, and record whether Marino completed it. 
Put the slip back into the hat, mix again, and draw another slip representing the second 
randomly selected assignment. Record whether Marino completed this assignment. Repeat 
this process two more times for the third and fourth randomly selected homework 
assignments. Record the number out of the 4 randomly selected homework assignments 
that Marino completed in this trial of the simulation. Perform many trials. Find the 
proportion of trials in which Mr. Millar randomly selects 1 or fewer of the homework 
assignments that Marino completed. 


18. Random assignment Researchers recruited 20 volunteers—8 men and 12 women—to 
take part in an experiment. They randomly assigned the subjects into two groups of 10 
people each. To their surprise, 6 of the 8 men were randomly assigned to the same 
treatment. Should they be surprised? We want to design a simulation to estimate the 
probability that a proper random assignment would result in 6 or more of the 8 men 
ending up in the same group. 

Get 20 identical slips of paper. Write “M” on 8 of the slips and “W” on the remaining 
12 slips. Put the slips into a hat and mix well. Draw 10 of the slips without looking and 
place into one pile representing Group 1. Place the other 10 slips in a pile representing 
Group 2. Record the largest number of men in either of the two groups from this simulated 
random assignment. Repeat this process many, many times. Find the percent of trials in 
which 6 or more men ended up in the same group. 


19. Color-blind men About 7% of men in the United States have some form of red—green 
color blindness. Suppose we randomly select one U.S. adult male at a time until we find 
one who is red—green color-blind. Should we be surprised if it takes us 20 or more men? 
Describe how you would carry out a simulation to estimate the probability that we would 
have to randomly select 20 or more U.S. adult males to find one who is red—green color- 
blind. Do not perform the simulation. 


20. Taking the train According to New Jersey Transit, the 8:00 A.M. weekday train from 


Princeton to New York City has a 90% chance of arriving on time. To test this claim, an 
auditor chooses 6 weekdays at random during a month to ride this train. The train arrives 
late on 2 of those days. Does the auditor have convincing evidence that the company’s 
claim is false? Describe how you would carry out a simulation to estimate the probability 
that a train with a 90% chance of arriving on time each day would be late on 2 or more of 
6 days. Do not perform the simulation. 


21. Recycling Do most teens recycle? To find out, an AP® Statistics class asked an SRS of 
100 students at their school whether they regularly recycle. In the sample, 55 students said 
that they recycle. Is this convincing evidence that more than half of the students at the 
school would say they regularly recycle? The dotplot shows the results of taking 200 SRSs 
of 100 students from a population in which the true proportion who recycle is 0.50. 
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a. Explain why the sample result (55 out of 100 said “Yes”) does not give convincing 
evidence that more than half of the school’s students recycle. 


b. Suppose instead that 63 students in the class’s sample had said “Yes.” Explain why this 
result would give convincing evidence that a majority of the school’s students recycle. 


22. Brushing teeth, wasting water? A recent study reported that fewer than half of young 
adults turn off the water while brushing their teeth. Is the same true for teenagers? To find 
out, a group of statistics students asked an SRS of 60 students at their school if they 
usually brush with the water off. In the sample, 27 students said “Yes.” The dotplot shows 
the results of taking 200 SRSs of 60 students from a population in which the true 
proportion who brush with the water off is 0.50. 
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Explain why the sample result (27 of the 60 students said “Yes”) does not give 
convincing evidence that fewer than half of the school’s students brush their teeth with 
the water off. 


. Suppose instead that 18 of the 60 students in the class’s sample had said “Yes.” Explain 


why this result would give convincing evidence that fewer than 50% of the school’s 
students brush their teeth with the water off. 


Multiple Choice Select the best answer for Exercises 23-28. 


23. You read in a book about bridge that the probability that each of the four players is dealt 
exactly one ace is approximately 0.11. This means that 


24. 


a. 


b. 


in every 100 bridge deals, each player has 1 ace exactly 11 times. 


in 1 million bridge deals, the number of deals on which each player has 1 ace will be 
exactly 110,000. 


in a very large number of bridge deals, the percent of deals on which each player has 1 
ace will be very close to 11%. 


. ina very large number of bridge deals, the average number of aces in a hand will be 


very close to 0.11. 


. If each player gets an ace in only 2 of the first 50 deals, then each player should get an 


ace in more than 11% of the next 50 deals. 


If I toss a fair coin five times and the outcomes are TTTTT, then the probability that tails 
appears on the next toss is 


a. 0.5. 


b. 


C. 


less than 0.5. 


greater than 0.5. 


d. 0. 


e. 1. 


Exercises 25 to 27 refer to the following setting. A basketball player claims to make 47% of her 
shots from the field. We want to simulate the player taking sets of 10 shots, assuming that her 
claim is true. 


20. 


26. 


a7, 


To simulate the number of makes in 10 shot attempts, you would perform the simulation 
as follows: 


a. Use 10 random one-digit numbers, where 0—4 are a make and 5—9 are a miss. 

b. Use 10 random two-digit numbers, where 00—46 are a make and 47—99 are a miss. 
c. Use 10 random two-digit numbers, where 00-47 are a make and 48—99 are a miss. 
d. Use 47 random one-digit numbers, where 0 is a make and 1-9 are a miss. 


e. Use 47 random two-digit numbers, where 00-46 are a make and 47—99 are a miss. 


A total of 25 repetitions of the simulation were performed. The number of makes in each 
set of 10 simulated shots was recorded on the dotplot. What is the approximate probability 
that a 47% shooter makes 5 or more shots in 10 attempts? 


Simulated number of made shots 
a. 5/10 
b. 3/10 
c. 12/25 
d. 3/25 
e. 47/100 


Suppose this player attempts 10 shots in a game and makes only 3 of them. Does this 
provide convincing evidence that she is less than a 47% shooter? 


a. Yes, because 3/10 (30%) is less than 47%. 
b. Yes, because she never made 47% of her shots in the simulation. 


c. No, because it is plausible (believable) that she would make 3 or fewer shots by chance 
alone. 


d. No, because the simulation was only repeated 25 times. 


e. No, because more than half of the simulated results were less than 47%. 


28. Ten percent of U.S. households contain 5 or more people. You want to simulate choosing 
a household at random and recording “Yes” if it contains 5 or more people. Which of 
these is a correct assignment of digits for this simulation? 


a. Odd = Yes; Even = No 
b. 0= Yes; 1-9 = No 

c. 0-5 = Yes; 6-9 = No 
d. 0-4 = Yes; 5-9 = No 
e. None of these 


Recycle and Review 


29. AARP and Medicare (4.1) To find out what proportion of Americans support proposed 
Medicare legislation to help pay medical costs, the AARP conducted a survey of their 
members (people over age 50 who pay membership dues). One of the questions was: 
“Even if this plan won’t affect you personally either way, do you think it should be passed 
so that people with low incomes or people with high drug costs can be helped?” Of the 


respondents, 75% answered “Yes.”4 


a. Describe how undercoverage might lead to bias in this study. Explain the likely 
direction of the bias. 


b. Describe how the wording of the question might lead to bias in this study. Explain the 
likely direction of the bias. 


30. Waiting to park (1.3, 4.2) Do drivers take longer to leave their parking spaces when 
someone is waiting? Researchers hung out in a parking lot and collected some data. The 
graphs and numerical summaries display information about how long it took drivers to 
exit their spaces. 
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Descriptive Statistics: Time 


Waiting n Mean StDev Min Q1 Median Q3 Max 
No 20 44.42 14.10 33.76 35.61 39.56 48.48 84.92 
Yes 20 54.11 14.39 41.61 43.41 47.14 66.44 85.97 


a. Write a few sentences comparing these distributions. 


b. Can we conclude that having someone waiting causes drivers to leave their spaces more 
slowly? Why or why not? 


SECTION 5.2 Probability Rules 


LEARNING TARGETS By the end of the section, you should be able to: 


e Give a probability model for a chance process with equally likely outcomes and 
use it to find the probability of an event. 


Use basic probability rules, including the complement rule and the addition rule 


for mutually exclusive events. 


Use a two-way table or Venn diagram to model a chance process and calculate 
probabilities involving two events. 


Apply the general addition rule to calculate probabilities. 


The idea of probability rests on the fact that chance behavior is predictable in the long run. In 
Section 5.1, we used simulation to imitate chance behavior. Do we always need to repeat a 
chance process—tossing coins, rolling dice, drawing slips from a hat—many times to 
determine the probability of a particular outcome? Fortunately, the answer is no. 


Probability Models 


In Chapter 2, we saw that a Normal density curve could be used to model some distributions of 
data. In Chapter 3, we modeled linear relationships between two quantitative variables with a 
least-squares regression line. Now we’re ready to develop a model for chance behavior. 


Steve Gorton/Getty Images 


Many board games involve rolling dice. Imagine rolling two fair, six-sided dice—one that’s 
red and one that’s blue. How do we develop a probability model for this chance process? 
Figure 5.2 displays the sample space of 36 possible outcomes. Because the dice are fair, each 
of these outcomes will be equally likely and have probability 1/36. 
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FIGURE 5.2 The 36 possible outcomes from rolling two six-sided dice, one red and one blue. 
Each of these equally likely outcomes has probability 1/36. 
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DEFINITION Probability model, Sample space 


A probability model is a description of some chance process that consists of two 
parts: a list of all possible outcomes and the probability for each outcome. 


The list of all possible outcomes is called the sample space. 


A sample space can be very simple or very complex. If we toss a coin once, there are only 
two possible outcomes in the sample space, heads and tails. When Gallup takes a random 
sample of 1523 U.S. adults and asks a survey question, the sample space consists of all possible 
sets of responses from 1523 of the over 240 million adults in the country. 

A probability model does more than just assign a probability to each outcome. It allows us 
to find the probability of an event. 


DEFINITION Event 


An event is any collection of outcomes from some chance process. 


Events are usually designated by capital letters, like A, B, C, and so on. For rolling two six- 
sided dice, we can define event A as getting a sum of 5. We write the probability of event A as 
P(A) or P(sum is 5). 

It is fairly easy to find the probability of an event in the case of equally likely outcomes. 
There are 4 outcomes in event A: 


2d 


The probability that event A occurs is therefore 


P(A)=number of outcomes with sum of Stotal number of outcomes when rolling two dice=436= 
P(A) number of outcomes with sum of 5 - - =0111 


~ total number of outcomes when rolling two dice 36 


FINDING PROBABILITIES: EQUALLY LIKELY OUTCOMES 


If all outcomes in the sample space are equally likely, the probability that event A occurs 
can be found using the formula 


P(A)=number of outcomes in event Atotal number of outcomes in sample space 
number of outcomes in event A 
P(A) 


total number of outcomes in sample space 


EXAMPLE | Spin the spinner (>) 
Probability models: Equally likely outcomes 


PROBLEM: A spinner has three equal sections: red, blue, and yellow. Suppose you spin the 
spinner two times. 


a. Give a probability model for this chance process. 

b. Define event A as spinning blue at least once. Find P(A). 

SOLUTION: 

a. Sample space: RR RBRY BR BB BY YR YB YY. Because the spinner has equal 
sections, each of these outcomes will be equally likely and have probability 1/9. 


Remember: A probability model consists of a list of all possible outcomes and the 


probability of each outcome. 


b. There are 5 outcomes with at least one blue: RB BR BB BY YB. So P(A)=59=0.556 
P(A) = 2 = 0.556 


If all outcomes in the sample space are equally likely, 


P(A)=number of outcomes in event Atotal number of outcomes in sample space 
number of outcomes in event A 
P(A) 


total number of outcomes in sample space 
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Basic Probability Rules 


Our work so far suggests three commonsense rules that a valid probability model must obey: 


.. If all outcomes in the sample space are equally likely, the probability that event A 
occurs is 
P(A)=number of outcomes in event Atotal number of outcomes in sample space 
P(A) = number of outcomes in event A 
~ total number of outcomes in sample space 


’. The probability of any event is a number between 0 and 1. This rule follows from the 
definition of probability in Section 5.1: the proportion of times the event would occur in 
many repetitions of the chance process. A proportion is a number between 0 and 1, so any 
probability is also a number between 0 and 1. 


3. All possible outcomes together must have probabilities that add up to 1. Because some 
outcome must occur on every trial of a chance process, the sum of the probabilities for all 
possible outcomes must be exactly 1. 


Here’s another rule that follows from the previous two: 


|. The probability that an event does not occur is 1 minus the probability that the event 
does occur. If an event occurs in (say) 70% of all trials, it fails to occur in the other 30%. 
The probability that an event occurs and the probability that it does not occur always add to 
100%, or 1. Earlier, we found that the probability of getting a sum of 5 when rolling two fair, 
six-sided dice is 4/36. What’s the probability that the sum is not 5? 
P(sum is not 5)=1—P(sum is 5)=1-436=3236=0.889 
4 32 
P(sum is not 5) = 1 — P(sum is 5) = 1 ——-=~~= 0.889 
(sum is not 5) (sum is 5) 36 36 
We refer to the event “not A” as the complement of A and denote it by A©. For that reason, 
this handy result is known as the complement rule. Using the complement rule in this setting 
is much easier than counting all 32 possible ways to get a sum that isn’t 5. 


DEFINITION Complement rule, Complement 


The complement rule says that P(AC) = 1-P(A), where AC is the complement of 


event A; that is, the event that A does not occur. 


Let’s consider one more event involving the chance process of rolling two fair, six-sided 
dice: getting a sum of 6. The outcomes in this event are 


1a aa hc 


(sum is 6) = 35 What's the probability that we get a sum of 5 ora 


So P(sum is 6)=536" 
sum of 6? 
P(sum is 5 or sum is 6) = P(sum is 5) + P(sum is 6) = 436+536=936=0.25 


4 9 9 
P(sum is 5 or sum is 6) = P(sum is 5) + P(sum is 6) = 36 { 7367 0.25 


Why does this formula work? Because the events “getting a sum of 5” and “getting a sum 


of 6” have no outcomes in common—that is, they can’t both happen at the same time. We say 
that these two events are mutually exclusive. As a result, this intuitive formula is known as the 


addition rule for mutually exclusive events. 


DEFINITION Mutually exclusive, Addition rule for mutually exclusive 
events 


Two events A and B are mutually exclusive if they have no outcomes in 
common and so can never occur together—that is, if P(A and B) = 0. 


The addition rule for mutually exclusive events A and B says that 


P(A or B) = P(A) + P(B)P(A oF B) = P(A) + P(B) 


Mutually exclusive events are sometimes called disjoint events. 


Oo Note that this rule works only for mutually exclusive events. We will soon develop a 
more general rule for finding P(A or B) that works for any two events. 


We can summarize the basic probability rules more concisely in symbolic form. 


BASIC PROBABILITY RULES 


e For any event A, 0 < P(A) <1. 
e If S is the sample space in a probability model, P(S) = 1. 
e In the case of equally likely outcomes, 


P(A)=number of outcomes in event Atotal number of outcomes in sample space 


number of outcomes in event A 
P(A) 


total number of outcomes in sample space 


¢ Complement rule: P(A®) = 1 - P(A). 


e Addition rule for mutually exclusive events: If A and B are mutually exclusive, P(A or 
B) = P(A) + P(B). 


The earlier dice-rolling and spin the spinner settings involved equally likely outcomes. 
Here’s an example that illustrates use of the basic probability rules when the outcomes of a 
chance process are not equally likely. 


EXAMPLE | Avoiding blue M&M’S® (>) 
Basic probability rules 


Niels Poulsen std/Alamy 


PROBLEM: Suppose you tear open the corner of a bag of M&M’S® Milk Chocolate 
Candies, pour one candy into your hand, and observe the color. According to Mars, Inc., the 
maker of M&M’S, the probability model for a bag from its Cleveland factory is: 


Color Blue Orange Green Yellow Red Brown 
Probability 0.207 0.205 0.198 0.135 0.131 0.124 


a. Explain why this is a valid probability model. 
b. Find the probability that you don’t get a blue M&M. 
c. What’s the probability that you get an orange or a brown M&M? 


SOLUTION: 


a. The probability of each outcome is a number between o and 1. The sum of the 
probabilities is 0.207 + 0.205 + 0.198 + 0.135 + 0.131 + 0.124 =1. 


b. P (not blue) = 1 - P (blue) = 1 - 0.207 = 0.793 


Using the complement rule: P (A° )=1-P (A). 


Caer (orange or brown) = 


BP) (orange) + P (brown) 


= 0,205 + 0,124 = 0,329 


Using the addition rule for mutually exclusive events because an M&M® can't be 


both orange and brown. 
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For part (b) of the example, we could also use an expanded version of the addition rule for 


mutually exclusive events: 


P(not blue)=P(orange or green or yellow or red or brown)=P(orange)+P(green)+P(yellow)+P(re 


P(not blue) = 


P (orange or green or yellow or red or brown) 
P (orange) + P (green) + P (yellow) + P (red) + P (brown) 


= 0.205 + 0.198 + 0.135 + 0.131 + 0.124 


0.793 


Using the complement rule is much simpler than adding 5 probabilities together! 


CHECK YOUR UNDERSTANDING 


q 


y Suppose we choose an American adult at random. Define two events: 
A = the person has a cholesterol level of 240 milligrams per deciliter of blood 
(mg/dl) or above (high cholesterol) 
B = the person has a cholesterol level of 200 to <240 mg/dl (borderline high 


cholesterol) 


According to the American Heart Association, P(A) = 0.16 and P(B) = 0.29. 


1. Explain why events A and B are mutually exclusive. 


2. Say in plain language what the event “A or B” is. Then find P(A or B). 


3. Let C be the event that the person chosen has a cholesterol level below 200 mg/dl (normal 
cholesterol). Find P(C). 


Two-Way Tables, Probability, and the General Addition Rule 


So far, you have learned how to model chance behavior and some basic rules for finding the 
probability of an event. What if you’re interested in finding probabilities involving two events 
that are not mutually exclusive? For instance, a survey of all residents in a large apartment 
complex reveals that 68% use Facebook, 28% use Instagram, and 25% do both.2 Suppose we 
select a resident at random. What’s the probability that the person uses Facebook or uses 
Instagram? 

There are two different uses of the word or in everyday life. In a restaurant, when you are 
asked if you want “soup or salad,” the waiter wants you to choose one or the other, but not 
both. However, when you order coffee and are asked if you want “cream or sugar,” it’s OK to 
ask for one or the other or both. 

Mutually exclusive events A and B cannot both happen at the same time. For such events, 
“A or B” means that only event A happens or only event B happens. You can find P(A or B) 
with the addition rule for mutually exclusive events: 


P(A or B)=P(A)+P(B)P (A or B) = P(A) + P(B) 


How can we find P(A or B) when the two events are not mutually exclusive? Now we have 
to deal with the fact that “A or B” means one or the other or both. For instance, “uses Facebook 
or uses Instagram” in the scenario just described includes U.S. adults who do both. 


When you’re trying to find probabilities involving two events, like P(A or B), a two-way 
table can display the sample space in a way that makes probability calculations easier. 


EXAMPLE |Who has pierced ears? (>) 
Two-way tables and probability 


Big Cheese Photo/Superstock 


PROBLEM: Students in a college statistics class wanted to find out how common it is for 
young adults to have their ears pierced. They recorded data on two variables—gender and 
whether or not the student had a pierced ear—for all 178 people in the class. The two-way 
table summarizes the data. 


Gender 


Male Female Total 

' Yes 103 
Pierced ear 

No 75 

Total 90 88 178 


Suppose we choose a student from the class at random. Define event A as getting a male 
student and event B as getting a student with a pierced ear. 


a. Find P(B). 
b. Find P(A and B). Interpret this value in context. 
c. Find P(A or B). 


SOLUTION: 


D\ _ i je ; 7 103 = C 
a. P(B) = P(pierced ear) = 103178 = 0579" (B) = F (pierced ear) mg eS 


b. P(AandB) = P(male and pierced ear) = 19178 = 0.107 
é “ — > } > af 5) as —— 19 — 
P(AandB) = P (male and pierced ear) = = 0.107, Shere ab outaw AON Gane 
that a randomly selected student from this class is male and has a pierced ear. 
C. P(AorB)=P(male or pierced ear)=71+19+841 Jo = 174178 = 0.976 
P(AorB) = P(male or pierced ear) 


71419+84 174 ~ 
178 ~ 78 0.978 


In statistics, or means “one or the other or both.” So “male or pierced ear” includes 


(i) male but no pierced ear; (ii) pierced ear but not male; and (ili) male and pierced 
ear. 
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When we found P(male and pierced ear) in part (b) of the example, we could have 
described this as either P(A and B) or P(B and A). Why? Because “male and pierced ear” 
describes the same event as “pierced ear and male.” Likewise, P(A or B) is the same as P(B or 
A). Don’t get so caught up in the notation that you lose sight of what’s really happening! 

Part (c) of the example reveals an important fact about finding the probability P(A or B): 
we can’t use the addition rule for mutually exclusive events unless events A and B have no 
outcomes in common. In this case, there are 19 outcomes that are shared by events A and B— 
the students who are male and have a pierced ear. If we did add the probabilities of A and B, 
we'd get 90/178 + 103/178 = 193/178. This is clearly wrong because the probability is bigger 
than 1! As Figure 5.3 illustrates, outcomes common to both events are counted twice when we 
add the probabilities of these two events. 


Outcomes here are double-counted 


19 by P(A) + P(B) 


P(A and B) = 


Pierced ear 


FIGURE 5.3 Two-way table showing events A and B from the pierced-ear example. These 
events are not mutually exclusive, so we can't find P(A or B) by just adding the probabilities of 
the two events. 


We can fix the double-counting problem illustrated in the two-way table by subtracting the 
probability P(male and pierced ear) from the sum. That is, 


P(male or pierced ear)=P(male)+P(pierced ear) 
—P(male and pierced ear)=90/178+103/178-19/178=174/178 


P (male or pierced ear) = P(male) + P (pierced ear) — P (male and pierced ear) 
= 90/178 + 103/178 — 19/178 
= 174/178 


This result is known as the general addition rule. 


DEFINITION General addition rule 


If A and B are any two events resulting from some chance process, the general 
addition rule says that 


P(A or B)=P(A)+P(B)-P(A and B)P (A or B) = P(A) + P(B) — P(A and B) 


Sometimes it’s easier to label events with letters that relate to the context, as the following 
example shows. 


EXAMPLE | Facebook versus Instagram (>) 
General addition rule 
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Ann Heath 
PROBLEM: A survey of all residents in a large apartment complex reveals that 68% use 


Facebook, 28% use Instagram, and 25% do both. Suppose we select a resident at random. 
What’s the probability that the person uses Facebook or uses Instagram? 


SOLUTION: 


Let event F = uses Facebook and 1 = uses Instagram. 
P(E or I)=P(F)+P(1) — P(F and 1)=0.68+0.28 - 0.25=0.71 
P(ForlI) = P(F)+P(I) — P(F andI) 
0.68 + 0.28 — 0.25 
= 0.7% 
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AP® EXAM TIP 


Many probability problems involve simple computations that you can do on your 


calculator. It may be tempting to just write down your final answer without showing 
the supporting work. Don’t do it! A “naked answer,” even if it’s correct, will usually 
be penalized on a free-response question. 


What happens if we use the general addition rule for two mutually exclusive events A and 
B? In that case, P(A and B) = 0, and the formula reduces to 


P(A or B)=P(A)+P(B)-P(A and B)=P(A)+P(B)-0=P(A)+P(B) 


P(A or B) = P(A) + P(B) — P(AandB) = P(A) + P(B) —0= P(A) + P(B) 


In other words, the addition rule for mutually exclusive events is just a special case of the 
general addition rule. 

You might be wondering if there is also a rule for finding P(A and B). There is, but it’s not 
quite as intuitive. Stay tuned for that later. 


CHECK YOUR UNDERSTANDING 


fy 


v Yellowstone National Park staff surveyed a random sample of 1526 winter visitors to 
the park. They asked each person whether they belonged to an environmental club (like the 
Sierra Club). Respondents were also asked whether they owned, rented, or had never used a 
snowmobile. The two-way table summarizes the survey responses. 


Environmental club 


No Yes 

Snowmobile Never used 212 
experience Renter 77 
Owner 279 16 


Suppose we choose one of the survey respondents at random. 

1. What’s the probability that the person is an environmental club member? 
2. Find P(not a snowmobile renter). 

3. What’s P(environmental club member and not a snowmobile renter)? 

4. 


Find the probability that the person is not an environmental club member or is a 
snowmobile renter. 


Venn Diagrams and Probability 


We have seen that two-way tables can be used to illustrate the sample space of a chance 
process involving two events. So can Venn diagrams, like the one shown in Figure 5.4. 


Sample space 


FIGURE 5.4 A typical Venn diagram that shows the sample space and the relationship between 


two events A and B. 


DEFINITION Venn diagram 


A Venn diagram consists of one or more circles surrounded by a rectangle. Each 
circle represents an event. The region inside the rectangle represents the sample 
space of the chance process. 


In an earlier example, we looked at data from a survey on gender and ear piercings for a 
large group of college students. The chance process was selecting a student in the class at 
random. Our events of interest were A: is male and B: has a pierced ear. Here is the two-way 
table that summarizes the data: 


Gender 
Male Female Total 
. Yes 103 
Pierced ear 
No 75 
Total 90 88 178 


The Venn diagram in Figure 5.5 displays the sample space in a slightly different way. There 
are four distinct regions in the Venn diagram. These regions correspond to the four (non-total) 
cells in the two-way table as follows. 


Male and 
pierced ear 


Male and 
no pierced ear 


FIGURE 5.5 The completed Venn diagram for the large group of college students. The circles 
represent the two events A = male and B = has a pierced ear. 


Female and 
pierced ear 


Female and 
no pierced ear 


Region in Venn diagram In words Count 
In the intersection of two circles Male and pierced ear 19 
Inside circle A, outside circle B Male and no pierced ear 71 
Inside circle B, outside circle A Female and pierced ear 84 
Outside both circles Female and no pierced ear 4 


Because Venn diagrams have uses in other branches of mathematics, some standard 
vocabulary and notation have been developed that will make our work with Venn diagrams a 
bit easier. 


© We introduced the complement of an event earlier. In Figure 5.6(a), the complement A© 
contains the outcomes that are not in A. 


e Figure 5.6(b) shows the event “A and B.” You can see why this event is also called the 
intersection of A and B. The corresponding notation is A n B. 


e The event “A or B” is shown in Figure 5.6(c). This event is also known as the union of A 
and B. The corresponding notation is A U B. 


ANB ANB 
tC) (b) @ 


FIGURE 5.6 The green shaded region in each Venn diagram shows: (a) the complement A° of 
event A, (b) the intersection of events A and B, and (c) the union of events A and B. 


DEFINITION Intersection, Union 


The event “A and B’” is called the intersection of events A and B. It consists of all 
outcomes that are common to both events, and is denoted A n B. 


The event “A or B” is called the union of events A and B. It consists of all 
outcomes that are in event A or event B, or both, and is denoted A U B. 


Here’s a way to keep the symbols straight: U for union; 1 for intersection. 


With this new notation, we can rewrite the general addition rule in symbols as 
P(AUB)=P(A)+P(B)-P(AnB)? (AU B) = P(A) + P(B) — P(AN B) 


This Venn diagram shows why the formula works in the pierced-ear example. 


Outcomes here are 
double-counted by 
P(A) + P(B) 


Event B 
pierced ear 
P(B) = 103/178 


Event A 
male 
P(A) = 90/178 


Event A and B male 
and pierced ear 
P(A B)= 19/178 


For mutually exclusive events A and B, the two events have no outcomes in common. So 


the corresponding Venn diagram consists of two non-overlapping circles. You can see from the 
figure at left why, in this special case, the general addition rule reduces to 


P(AUB)=P(A)+P(B)P(A U B) = P(A) + P(B) 


EXAMPLE | Facebook versus Instagram (>) 
Venn diagrams and probability 


Ann Heath 


PROBLEM: A survey of all residents in a large apartment complex reveals that 68% use 
Facebook, 28% use Instagram, and 25% do both. Suppose we select a resident at random. 


a. Make a Venn diagram to display the sample space of this chance process using the events 


F: uses Facebook and I: uses Instagram. 
b. Find the probability that the person uses neither Facebook nor Instagram. 


SOLUTION: 


—— 


(= B) 


b. P (no Facebook and no Instagram) O20 


i 


Start with the intersection: P(F n I) = 0.25. 

We know that 68% of residents use Facebook. That figure includes the 25% who 
also use Instagram. So 68% — 25% = 43% = 0.43 only use Facebook. 

We know that 28% of residents use Instagram. That figure includes the 25% who 


also use Facebook. So 28% — 25% = 3% = 0.03 only use Instagram. 

A total of 0.43 + 0.25 + 0.03 = 0.71 = 71% of residents use at least one of 
Facebook or Instagram. By the complement rule, 1 — 0.71 = 0.29 = 29% use neither 
Facebook nor Instagram. 


FOR PRACTICE, TRY EXERCISE 51 


In the preceding example, the event “uses neither Facebook nor Instagram” is the 
complement of the event “uses at least one of Facebook or Instagram.” To solve part (b) of the 
problem, we could have used our answer from the example on page 321 and the complement 
rule: 

P(neither Facebook nor Instagram)=1-—P(at least one of Facebook or Instagram)=1-—0.71=0.29 
P(neither Facebook nor Instagram) 
= 1 — P(at least one of Facebook or Instagram) 
= 1-—0.71 = 0.29 


As you’|I see in Section 5.3, the fact that “none” is the opposite of “at least 1” comes in handy 
for a variety of probability questions. 

An alternate solution to the example uses a two-way table. Here is a partially completed 
table with the information given in the problem statement. Do you see how we can fill in the 
missing entries? 


Facebook use 
Yes No Total 
Yes 28% 
No 


Total 68% 100% 


Instagram use 


100% — 68% = 32% of residents do not use Facebook. 
100% — 28% = 72% of residents do not use Instagram. 


e 68% — 25% = 43% of residents use Facebook but do not use Instagram. 


28% — 25% = 3% of residents use Instagram but do not use Facebook. 


© 32% — 3% = 72% — 43% = 29% of residents do not use Facebook and do not use Instagram. 


The completed table is shown here. We can see the desired probability marked in bold in 
the table: P(neither Facebook nor Instagram) = 0.29. 


Facebook use 


Yes No Total 
Yes 28% 
Instagram use 
No 72% 
Total 68% 32% 100% 


Section 5.2 Summary 


e A probability model describes a chance process by listing all possible outcomes in the 
sample space and giving the probability of each outcome. A valid probability model 
requires that all possible outcomes have probabilities that add up to 1. 


e An event is a collection of possible outcomes from the sample space. The probability of any 
event is a number between 0 and 1. 


e The event “A or B” is known as the union of A and B, denoted A U B. It consists of all 
outcomes in event A, event B, or both. 


e The event “A and B” is known as the intersection of A and B, denoted A n B. It consists of 
all outcomes that are common to both events. 


e To find the probability that an event occurs, we use some basic rules: 
« If all outcomes in the sample space are equally likely, 


P(A)=number of outcomes in event Atotal number of outcomes in sample space 
number of outcomes in event A 
P(A) 


total number of outcomes in sample space 


. Complement rule: P(A) = 1 - P(A), where AC is the complement of event A; that is, the 
event that A does not happen. 
: General addition rule: For any two events A and B, 


« Addition rule for mutually exclusive events: Events A and B are mutually exclusive if 
they have no outcomes in common. If A and B are mutually exclusive, P(A or B) = P(A) + 


P(B). 
e A two-way table or Venn diagram can be used to display the sample space and to help find 
probabilities for a chance process involving two events. 


Section 5.2 Exercises 


31. pg 315 |») Four-sided dice A four-sided die is a pyramid whose four faces are labeled 
with the numbers 1, 2, 3, and 4 (see image). Imagine rolling two fair, four-sided dice and 
recording the number that is showing at the base of each pyramid. For instance, you would 
record a 4 if the die landed as shown in the figure. 


Fe 


a. Give a probability model for this chance process. 

b. Define event A as getting a sum of 5. Find P(A). 
32. Tossing coins Imagine tossing a fair coin 3 times. 

a. Give a probability model for this chance process. 


b. Define event B as getting more heads than tails. Find P(B). 


33. Grandkids Mr. Starnes and his wife have 6 grandchildren: Connor, Declan, Lucas, Piper, 
Sedona, and Zayne. They have 2 extra tickets to a holiday show, and will randomly select 
which 2 grandkids get to see the show with them. 


a. Give a probability model for this chance process. 


b. Find the probability that at least one of the two girls (Piper and Sedona) get to go to the 
show. 


34. Who’s paying? Abigail, Bobby, Carlos, DeAnna, and Emily go to the bagel shop for 
lunch every Thursday. Each time, they randomly pick 2 of the group to pay for lunch by 
drawing names from a hat. 


a. Give a probability model for this chance process. 
b. Find the probability that Carlos or DeAnna (or both) ends up paying for lunch. 
35. PQ 317 |») Mystery box Ms. Tyson keeps a Mystery Box in her classroom. If a student 


meets expectations for behavior, she or he is allowed to draw a slip of paper without 
looking. The slips are all of equal size, are well mixed, and have the name of a prize 


written on them. One of the “prizes”—extra homework— isn’t very desirable! Here is the 
probability model for the prizes a student can win: 


Prize Pencil Candy — Stickers 


Homework pass Extra homework 
Probability 0.40 0.25 0.15 


0.15 0.05 
a. Explain why this is a valid probability model. 
b. Find the probability that a student does not win extra homework. 


c. What’s the probability that a student wins candy or a homework pass? 


36. Languages in Canada Canada has two official languages, English and French. Choose a 
Canadian at random and ask, “What is your mother tongue?” Here is the distribution of 


responses, combining many separate languages from the broad Asia/Pacific region:® 


Language English French 


Asian/Pacific Other 
Probability 0.63 0.22 


0.06 0.09 


a. Explain why this is a valid probability model. 
b. What is the probability that the chosen person’s mother tongue is not English? 


c. What is the probability that the chosen person’s mother tongue is one of Canada’s 
official languages? 


37. Household size In government data, a household consists of all occupants of a dwelling 


unit. Choose an American household at random and count the number of people it 
contains. Here is the assignment of probabilities for the outcome. The probability of 
finding 3 people in a household is the same as the probability of finding 4 people. 


Number of people uy 


2 3.4 5 6 7+ 
Probability 0.25 


0.32 ans 0.07 0.03 0.01 
a. What probability should replace “?” in the table? Why? 


b. Find the probability that the chosen household contains more than 2 people. 


38. When did you leave? The National Household Travel Survey gathers data on the time of 
day when people begin a trip in their car or other vehicle. Choose a trip at random and 
record the time at which the trip started.2 Here is an assignment of probabilities for the 


outcomes: 

Time of day 10 P.M.—12:59 A.M. 1 A.M.—5:59 A.M. 6 A.M.—8:59 A.M. 
Probability 0.040 0.033 0.144 
Time of day 9 A.M.—12:59 P.M. 1 P.M.—3:59 P.M. 4 P.M.—6:59 P.M. 


Probability 0.234 0.208 ? 
Time of day 7 P.M.—9:59 P.M. 
Probability 0.123 


a. What probability should replace “?” in the table? Why? 


b. Find the probability that the chosen trip did not begin between 9 A.M. and 12:59 P.M. 


39. Education among young adults Choose a young adult (aged 25 to 29) at random. The 


40. 


41. 


probability is 0.13 that the person chosen did not complete high school, 0.29 that the 
person has a high school diploma but no further education, and 0.30 that the person has at 
least a bachelor’s degree. 


a. What must be the probability that a randomly chosen young adult has some education 
beyond high school but does not have a bachelor’s degree? Why? 


b. Find the probability that the young adult completed high school. Which probability rule 
did you use to find the answer? 


c. Find the probability that the young adult has further education beyond high school. 
Which probability rule did you use to find the answer? 


Preparing for the GMAT A company that offers courses to prepare students for the 
Graduate Management Admission Test (GMAT) has collected the following information 
about its customers: 20% are undergraduate students in business, 15% are undergraduate 
students in other fields of study, and 60% are college graduates who are currently 
employed. Choose a customer at random. 


a. What must be the probability that the customer is a college graduate who is not 
currently employed? Why? 


b. Find the probability that the customer is currently an undergraduate. Which probability 


rule did you use to find the answer? 


c. Find the probability that the customer is not an undergraduate business student. Which 


probability rule did you use to find the answer? 


pg 319 |») Who eats breakfast? Students in an urban school were curious about how 


many children regularly eat breakfast. They conducted a survey, asking, “Do you eat 
breakfast on a regular basis?” All 595 students in the school responded to the survey. The 
resulting data are shown in the two-way table.2 


Gender 
Male Female Total 
Eats breakfast Yes 300 
regularly No 295 


Total 320 275 595 


Suppose we select a student from the school at random. Define event F as getting a female 
student and event B as getting a student who eats breakfast regularly. 


a. Find P(B®). 
b. Find P(E and B®). Interpret this value in context. 


c. Find P(F or B®). 


42. Is this your card? A standard deck of playing cards (with jokers removed) consists of 52 
cards in four suits—clubs, diamonds, hearts, and spades. Each suit has 13 cards, with 
denominations ace, 2, 3, 4, 5, 6, 7, 8, 9, 10, jack, queen, and king. The jacks, queens, and 
kings are referred to as “face cards.” Imagine that we shuffle the deck thoroughly and deal 
one card. Define events F: getting a face card and H: getting a heart. The two-way table 
summarizes the sample space for this chance process. 


Card 
Face card Nonface card Total 
. Heart is 
Suit 
Nonheart 39 
Total 52 


a. Find P(H°). 
b. Find P(H© and F). Interpret this value in context. 
c. Find P(H® or F). 


43. Cell phones The Pew Research Center asked a random sample of 2024 adult cell-phone 
owners from the United States their age and which type of cell phone they own: iPhone, 
Android, or other (including non-smartphones). The two-way table summarizes the data. 


Age 
18-34 35-54 55+ Total 
iPhone 467 
Type ofcell A nari 503 
phone 
Other 1054 
Total 517 637 870 2024 


Suppose we select one of the survey respondents at random. What’s the probability that: 
a. The person is not age 18 to 34 and does not own an iPhone? 
b. The person is age 18 to 34 or owns an iPhone? 


44. Middle school values Researchers carried out a survey of fourth-, fifth-, and sixth-grade 
students in Michigan. Students were asked whether good grades, athletic ability, or being 
popular was most important to them. The two-way table summarizes the survey data.2 


Grade 


Ath grade 5th grade 6th grade Total 

Grades 168 

most Athletic 98 
important 

Popular 69 

Total 92 108 135 335 


Suppose we select one of these students at random. What’s the probability of each of the 


following? 
a. The student is a sixth-grader or rated good grades as important. 


b. The student is not a sixth-grader and did not rate good grades as important. 


45. Roulette An American roulette wheel has 38 slots with numbers 1 through 36, 0, and 00, 
as shown in the figure. Of the numbered slots, 18 are red, 18 are black, and 2—the O and 
00—are green. When the wheel is spun, a metal ball is dropped onto the middle of the 
wheel. If the wheel is balanced, the ball is equally likely to settle in any of the numbered 
slots. Imagine spinning a fair wheel once. Define events B: ball lands in a black slot, and 
E: ball lands in an even-numbered slot. (Treat 0 and 00 as even numbers.) 


a. Make a two-way table that displays the sample space in terms of events B and E. 
b. Find P(B) and P(E). 
c. Describe the event “B and E” in words. Then find the probability of this event. 


d. Explain why P(B or E) # P(B) + P(E). Then use the general addition rule to compute 
P(B or E). 


46. Colorful disks A jar contains 36 disks: 9 each of four colors—red, green, blue, and 


47. 


48. 


49. 


50. 


ol. 


yellow. Each set of disks of the same color is numbered from 1 to 9. Suppose you draw 
one disk at random from the jar. Define events R: get a red disk, and N: get a disk with the 
number 9. 


a. Make a two-way table that describes the sample space in terms of events R and N. 
b. Find P(R) and P(N). 
c. Describe the event “R and N” in words. Then find the probability of this event. 


d. Explain why P(R or N) # P(R) + P(N) Then use the general addition rule to compute 
P(R or N). 


pg 321 |») Dogs and cats In one large city, 40% of all households own a dog, 32% 


own a Cat, and 18% own both. Suppose we randomly select a household. What’s the 
probability that the household owns a dog or a cat? 


Reading the paper In a large business hotel, 40% of guests read the Los Angeles Times. 
Only 25% read the Wall Street Journal. Five percent of guests read both papers. Suppose 
we select a hotel guest at random and record which of the two papers the person reads, if 
either. What’s the probability that the person reads the Los Angeles Times or the Wall 
Street Journal? 


Mac or PC? A recent census at a major university revealed that 60% of its students 
mainly used Macs. The rest mainly used PCs. At the time of the census, 67% of the 
school’s students were undergraduates. The rest were graduate students. In the census, 
23% of respondents were graduate students and used a Mac as their main computer. 
Suppose we select a student at random from among those who were part of the census. 
Define events G: is a graduate student and M: primarily uses a Mac. 


a. Find P(G U M). Interpret this value in context. 


b. Consider the event that the randomly selected student is an undergraduate student and 
primarily uses a PC. Write this event in symbolic form and find its probability. 


Gender and political party In January 2017, 52% of U.S. senators were Republicans and 
the rest were Democrats or Independents. Twenty-one percent of the senators were 
females, and 47% of the senators were male Republicans. Suppose we select one of these 
senators at random. Define events R: is a Republican and M: is male. 


a. Find P(R UM). Interpret this value in context. 


b. Consider the event that the randomly selected senator is a female Democrat or 
Independent. Write this event in symbolic form and find its probability. 


pg 324 |») Dogs and cats Refer to Exercise 47. 


a. Make a Venn diagram to display the outcomes of this chance process using events D: 
owns a dog and C: owns a cat. 


b. Find P(D n C°). 
52. Reading the paper Refer to Exercise 48. 


a. Make a Venn diagram to display the outcomes of this chance process using events L: 
reads the Los Angeles Times and W: reads the Wall Street Journal. 


b. Find P(L© n W). 


53. Union and intersection Suppose A and B are two events such that P(A) = 0.3, P(B) = 
0.4, and P(A U B) = 0.58. Find P(A n B). 


54. Union and intersection Suppose C and D are two events such that P(C) = 0.6, P(D) = 
0.45, and P(C U D) = 0.75. Find P(C n D). 


Multiple Choice Select the best answer for Exercises 55-58. 


55. The partially completed table that follows shows the distribution of scores on the 2016 
AP® Statistics exam. 


Score 1 fs 3 4 5 
Probability 0.235 0.155 0.249 0.217 4 


Suppose we randomly select a student who took this exam. What’s the probability that he 
or she earned a score of at least 3? 


a. 0.249 
b. 0.361 
c. 0.390 
d. 0.466 
e. 0.610 


56. In a sample of 275 students, 20 say they are vegetarians. Of the vegetarians, 9 eat both fish 
and eggs, 3 eat eggs but not fish, and 7 eat neither. Choose one of the vegetarians at 
random. What is the probability that the chosen student eats fish or eggs? 


a. 9/20 
b. 13/20 
c. 22/20 
d. 9/275 


e. 22/275 


Exercises 57 and 58 refer to the following setting. The casino game craps is based on rolling 


two dice. Here is the assignment of probabilities to the sum of the numbers on the up-faces 
when two dice are rolled: 


Outcome 2 3 4 5 6) 7 8 9 10 11 12 
Probability 1/36 2/36 3/36 4/36 5/36 6/36 5/386 4/36 3/36 2/36 1/36 


57. The most common bet in craps is the “pass line.” A pass line bettor wins immediately if 
either a 7 or an 11 comes up on the first roll. This is called a natural. What is the 
probability that a natural does not occur? 


a. 2/36 
b. 6/36 
c. 8/36 
d. 16/36 


e. 28/36 


58. If a player rolls a 2, 3, or 12, it is called craps. What is the probability of getting craps or 
an even sum on one roll of the dice? 


a. 4/36 
b. 18/36 
c. 20/36 
d. 22/36 
e. 32/36 


Recycle and Review 


59. Crawl before you walk (3.1, 3.2, 4.3) At what age do babies learn to crawl? Does it take 
longer to learn in the winter, when babies are often bundled in clothes that restrict their 
movement? Perhaps there might even be an association between babies’ crawling age and 
the average temperature during the month they first try to crawl (around 6 months after 
birth). Data were collected from parents who brought their babies to the University of 
Denver Infant Study Center to participate in one of a number of studies. Parents reported 
the birth month and the age at which their child was first able to creep or crawl a distance 
of 4 feet within one minute. Information was obtained on 414 infants (208 boys and 206 
girls). Crawling age is given in weeks, and average temperature (in degrees Fahrenheit) is 
given for the month that is 6 months after the birth month.12 


Birth month Average crawling age Average temperature 
January 29.84 66 
February 30.52 73 


March 29.70 72 


April 31.84 63 
May 28.58 52 
June 31.44 39 
July 33.64 33 
August 32.82 30 
September 33.83 33 
October 33.35 af 
November 33.38 48 
December 32.32 57 


a. Make an appropriate graph to display the relationship between average temperature and 
average crawling age. Describe what you see. 


Some computer output from a least-squares regression analysis of the data is shown. 


Term Coef SE Coef T-Value P-Value 
Constant 35.68 1.32 27.08 0.000 

Average -0.0777 0.0251 -3.10 0.011 

temperature 

S = 1.31920 R-Sq = 48.96% R-Sq(adj)= 43.86% 


b. What is the equation of the least-squares regression line that describes the relationship 
between average temperature and average crawling age? Define any variables that you 
use. 


c. Interpret the slope of the regression line. 


d. Can we conclude that warmer temperatures 6 months after babies are born causes them 
to crawl sooner? Justify your answer. 


60. Treating low bone density (4.2, 4.3) Fractures of the spine are common and serious 
among women with advanced osteoporosis (low mineral density in the bones). Can taking 
strontium ranelate help? A large medical trial was conducted to investigate this question. 
Researchers recruited 1649 women with osteoporosis who had previously had at least one 
fracture for an experiment. The women were assigned to take either strontium ranelate or a 
placebo each day. All the women were taking calcium supplements and receiving standard 


medical care. One response variable was the number of new fractures over 3 years. 


a. Describe a completely randomized design for this experiment. 


b. Explain why it is important to keep the calcium supplements and medical care the same 
for all the women in the experiment. 


c. The women who took strontium ranelate had statistically significantly fewer new 
fractures, on average, than the women who took a placebo over a 3-year period. Explain 
what this means to someone who knows little statistics. 


SECTION 5.3 Conditional Probability and Independence 


LEARNING TARGETS By the end of the section, you should be able to: 
Calculate and interpret conditional probabilities. 


Determine if two events are independent. 


Use the general multiplication rule to calculate probabilities. 


Use a tree diagram to model a chance process involving a sequence of 
outcomes and to calculate probabilities. 


When appropriate, use the multiplication rule for independent events to calculate 
probabilities. 


The probability of an event can change if we know that some other event has occurred. For 
instance, suppose you toss a fair coin twice. The probability of getting two heads is 1/4 because 
the sample space consists of the 4 equally likely outcomes 


HH HT TH Tr HH HT TH TT 


Suppose that the first toss lands tails. Now what’s the probability of getting two heads? It’s 0. 
Knowing that the first toss is a tail changes the probability that you get two heads. 


This idea is the key to many applications of probability. 


What Is Conditional Probability? 


Let’s return to the college statistics class from Section 5.2. Earlier, we used the two-way table 
shown on the next page to find probabilities involving events A: is male and B: has a pierced 
ear for a randomly selected student. 


Gender 
Male Female Total 
. Yes 103 
Pierced ear 
No 75 
Total 90 88 178 


Here is a summary of our previous results: 


P(A)=P(male)=90/178P(A nB)=P(male and pierced ear)=19/178P(B)=P(pierced ear)=103/178P( 
P(A) = P(male) = 90/178 P(A B) = P(male and pierced ear) = 19/178 


P(B) = P(pierced ear) = 103/178 P(AUB) = P(male or pierced ear) = 174/178 


Now let’s turn our attention to some other interesting probability questions. 


. If we know that a randomly selected student has a pierced ear, what is the probability 
that the student is male? There are 103 students in the class with a pierced ear. We can 
restrict our attention to this group, since we are told that the chosen student has a pierced ear. 
Because there are 19 males among the 103 students with a pierced ear, the desired 


probability is 
P(male given pierced ear)=19/103=0.184=18.4% 
P(male given pierced ear) = 19/103 = 0.184 = 18.4% 
Gender 
Male Female Total 
. Yes 103 
Pierced ear 

No 15 
Total 90 88 178 


’. If we know that a randomly selected student is male, what’s the probability that the 
student has a pierced ear? This time, our attention is focused on the males in the class. 
Because 19 of the 90 males in the class have a pierced ear, 


P(pierced ear given male)=19/90=0.211=21.1% 
P(pierced ear given male) = 19/90 = 0.211 = 21.1% 


Gender 
Male Female Total 
; Yes 103 
Pierced ear 
No 75 
Total 90 88 178 


These two questions sound alike, but they actually ask two very different things. Each of 


these probabilities is an example of a conditional probability. The name comes from the fact 
that we are trying to find the probability that one event will happen under the condition that 


some other event is already known to have occurred. We often use the phrase “given that” to 


signal the condition. 


DEFINITION Conditional probability 


The probability that one event happens given that another event is known to have 
happened is called a conditional probability. The conditional probability that 
event B happens given that event A has happened is denoted by P(B | A). 


With this new notation available, we can restate the answers to the two questions just posed 


as 
P(male|pierced ear)=P(A|B)=19/103andP(pierced ear|male)=P(B|A)=19/90 


P(male|pierced ear) = P(A|B) = 19/103 
and 
P(pierced ear|male) = P(B|A) = 19/90 


Here’s an example that illustrates how conditional probability works in a familiar setting. 


EXAMPLE |A Titanic disaster (>) 
Two-way tables and conditional probabilities 


Ss ——es 

Images Group/REX/Shutterstock 

PROBLEM: In 1912, the luxury liner Titanic, on its first voyage across the Atlantic, struck 
an iceberg and sank. Some passengers got off the ship in lifeboats, but many died. The two- 
way table gives information about adult passengers who survived and who died, by class of 
travel. 


Class of travel 


First Second Third Total 

Survival Survived 442 
status Died 765 
Total 319 261 627 1207 


Suppose we randomly select one of the adult passengers from the Titanic. Define events F: 
first-class passenger, S: survived, and T: third-class passenger. 


a. Find P(T|S). Interpret this value in context. 


b. Given that the chosen person is not a first-class passenger, what’s the probability that she 
or he survived? Write your answer as a probability statement using correct symbols for 
the events. 


SOLUTION: 


a. P (TIS) = P(third-class passenger | survived) = 151/442 = 0.342. Given that the 


randomly chosen person survived, there ts about aA 34.2% chance that she OT he was a 
third-class passenger. 


To answer part (a), only consider values in the “Survived” row. 


b. P(survivedinot first-class passenger)=P(SIF C)=9441 51261+627=245888=0.276 
P(survived|not first-class passenger ) 


= P(S|F°) = = = 8 = 0.276 


2614627 888 


To answer part (b), only consider values in the “Second class” and “Third class” 


columns. 
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Is there is a connection between conditional probability and conditional relative frequency 
from Chapter 1? Yes! In part (a) of the example, we found the conditional probability P(third- 
class passenger|survived) = 151/442 = 0.342. In Chapter 1, we asked, “What proportion of 
survivors were third-class passengers?” Our answer was also 151/442 = 0.342. This is a 
conditional relative frequency because we are finding the percent or proportion of third-class 
passengers among those who survived. So a conditional probability is just a conditional relative 
frequency that comes from a chance process—in this case, randomly selecting an adult 
passenger. 


Let’s look more closely at how conditional probabilities are calculated using the data from 
the college statistics class. From the two-way table that follows, we see that 


P(male|pierced ear)=19103=number of students who are male and have a pierced earnumber of : 


-_ 19 number of students who are male and have a pierced ear 
P(male|pierced ear) = > 


103 — number of students with a pierced ear 
Gender 
Male Female Total 
. Yes 103 
Pierced ear 
No 15 
Total 90 88 178 


What if we focus on probabilities instead of numbers of students? Notice that 


P(male and pierced ear)P(pierced ear)=19178103178=19103=P(male | pierced ear) 
19 


P(male and pierced ear) . 178 _ 49 


P(pierced ear) 103 103 
178 


= P (male | pierced ear) 


This observation leads to a general formula for computing a conditional probability. 


CALCULATING CONDITIONAL PROBABILITIES 


To find the conditional probability P(A | B), use the formula 


P(A | B)=P(A and B)P(B)=P(A nm B)P(B)=P(both events occur)P(given event occurs) 
P(A\B) - P(AandB) P(AMB) _ P(both events occur) 
~ P(B) ~— P(B):~—CséP( given event occurs) 


By the same reasoning, 
P(B| A) - P(BandA) P(BNM A) 
P(B | A)=P(B and A)P(A)=P(B n A)P(A) P(A) P(A) 


EXAMPLE | Facebook or Instagram? (>) 
Calculating conditional probability 


PROBLEM: A survey of all residents in a large apartment complex reveals that 68% use 
Facebook, 28% use Instagram, and 25% do both. Suppose we select a resident at random. 
Given that the person uses Facebook, what’s the probability that she or he uses Instagram? 


SOLUTION: 

P(Instagram|Facebook)=P(IIF)=P(I A F)P(F)=0.250.68=0.368 
3 : _ ' _ PUINF) — 0.25 _ 

P(Instagram|Facebook) = P(I|F) = a Le 0.368 
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Refer back to the example. If the person chosen is an Instagram user, what is the probability 
that he or she uses Facebook? By the conditional probability formula, it’s 


P(Facebook|Instagram)=P(F|I)=P(F n I)P(1)=0.250.28=0.893 
P(FNI) 0.25 


—— = ().893 


P(Facebook|Instagram) = P(F|I) = Pi) 0.28 


AP® EXAM TIP 


You can write statements like P (A | B) if events A and B are clearly defined ina 
problem. Otherwise, it’s probably easier to use contextual labels, like P (I | F) in the 
preceding example. Or you can just use words: P (Instagram | Facebook). 


If the chosen resident uses Instagram, it is extremely likely that he or she uses Facebook. 
However, if the chosen resident uses Facebook, he or she is not nearly so likely to use 
Instagram. 

You could also use a two-way table to help you find these conditional probabilities. Here’s 
the table that we made for this setting in Section 5.2 (page 325). It is easy to see that 


P(Instagram|Facebook)=0.250.68=0.368andP(Facebook|Instagram)=0.250.28=0.893 


0.25 | 0.25 
P(Instagram|Facebook) =e 0.368 and P(Facebook|Instagram) = 0.893 


Facebook use 


Yes No Total 
Yes 28% 
Instagram use 
No 72% 
Total 68% 32% 100% 


CHECK YOUR UNDERSTANDING 


+. 


”, Yellowstone National Park surveyed a random sample of 1526 winter visitors to the 
park. They asked each person whether he or she owned, rented, or had never used a 
snowmobile. Respondents were also asked whether they belonged to an environmental 
organization (like the Sierra Club). The two-way table summarizes the survey responses. 


Environmental club 


No Yes Total 

; Never used 657 

Snowmobile Renter 574 
experience 

Owner 295 

Total 1221 305 1526 


Suppose we randomly select one of the survey respondents. Define events E: environmental 
club member, S: snowmobile owner, and N: never used. 

1. Find P(N | E). Interpret this value in context. 

2. Given that the chosen person is not a snowmobile owner, what’s the probability that she or 
he is an environmental club member? Write your answer as a probability statement using 
correct symbols for the events. 

3. Is the chosen person more likely to not be an environmental club member if he or she has 
never used a snowmobile, is a snowmobile owner, or a snowmobile renter? Justify your 


answer. 


Conditional Probability and Independence 


Suppose you toss a fair coin twice. Define events A: first toss is a head, and B: second toss is a 
head. We know that P(A) = 1/2 and P(B) = 1/2. 
e What’s P(B | A)? It’s the conditional probability that the second toss is a head given that the 
first toss was a head. The coin has no memory, so P(B | A) = 1/2. 
© What’s P(B | A©)? It’s the conditional probability that the second toss is a head given that 
the first toss was not a head. Getting a tail on the first toss does not change the probability of 
getting a head on the second toss, so P(B | A°) = 1/2. 
In this case, P(B | A) = P(B | A©) = P(B). Knowing the outcome of the first toss does not 
change the probability that the second toss is a head. We say that A and B are independent 
events. 


DEFINITION Independent events 

A and B are independent events if Knowing whether or not one event has 
occurred does not change the probability that the other event will happen. In other 
words, events A and B are independent if 


P(AIB)=P(AIBC)=P(AyP(AIB) = P(AIB®) = P(A) 


Alternatively, events A and B are independent if 


P(BIA)=P(BIAC)=P(By’ BIA) = PBIA™) = PB) 


Let’s contrast the coin-toss scenario with our earlier pierced-ear example. In that case, the 
chance process involved randomly selecting a student from a college statistics class. The events 
of interest were A: is male, and B: has a pierced ear. Are these two events independent? 


Gender 
Male Female Total 
: Yes 103 
Pierced ear 
No 75 
Total 90 88 178 


e Suppose that the chosen student is male. We can see from the two-way table that P(pierced 
ear | male) = P(B | A) = 19/90 = 0.211. 

e Suppose that the chosen student is female. From the two-way table, we see that P(pierced ear 
| female) = P(B | A°) = 84/88 = 0.955. 


Knowing that the chosen student is a male changes (greatly reduces) the probability that the 
student has a pierced ear. So these two events are not independent. 


Another way to determine if two events A and B are independent is to compare P(A | B) to 
P(A) or P(B | A) to P(B). For the pierced-ear setting, 


P(pierced ear|male)=P(B|A)=19/90=0.211" (pierced ear|male) = P(B|A) = 19/90 = 0.211 
The unconditional probability that the chosen student has a pierced ear is 
P(pierced ear)=P(B)=103/178=0.579" (Pierced ear) = P(B) = 103/178 = 0.579 


Again, knowing that the chosen student is male changes (reduces) the probability that the 
individual has a pierced ear. So these two events are not independent. 


EXAMPLE | Gender and handedness (>) 
Checking for independence 


Graeme Harris/Getty Images 


PROBLEM: Is there a relationship between gender and handedness? To find out, we used 

Census At School’s Random Data Selector to choose an SRS of 100 Australian high school 
students who completed a survey. The two-way table summarizes the relationship between 

gender and dominant hand for these students. 


Gender 
Male Female Total 
i Right 90 
Dominant hand 
Left 10 
Total 46 54 100 


Suppose we choose one of the students in the sample at random. Are the events “male” and 
“left-handed” independent? Justify your answer. 


SOLUTION: 


P(left-handedimale)=7/46=0.152" (left-handed |male) = 7/46 = 0.152 
P(left-handedifemale)=3/5.4=0.056" (left-handed|female) = 3/54 = 0.056 


Because these probabilities are not equal, the events “male” and “left-handed” are not 
independent. Knowing that the student is male increases the probability that the student 


is left-handed. 
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In the example, we could have also determined that the two events are not independent by 
showing that 


P(left-handed | male)=7/46=0.152P(left-handed)=10/100=0.100 
P (left-handed | male) = 7/46 = 0.152 # P (left-handed) = 10/100 = 0.100 


Or we could have focused on whether knowing that the chosen student is left-handed changes 
the probability that the person is male. Because 


P(male | left-handed)=7/10=0.70#P(male|right-handed)=39/90=0.433 
P (male | left-handed) = 7/10 = 0.70 4 P (male|right-handed) = 39/90 = 0.433 


the events “male” and “left-handed” are not independent. 
You might have thought, “Surely there’s no connection between gender and handedness. 
The events ‘male’ and ‘left-handed’ are bound to be independent.” As the example shows, you 


can’t use your intuition to check whether events are independent. To be sure, you have to 
calculate some probabilities. 


Left- 
handed 

@ Right- 
handed 


Percent 


Male Female 


Gender 


Is there a connection between independence of events and association between two 
variables? Yes! In the preceding example, we found that the events “male” and “left-handed” 
were not independent for the sample of 100 Australian high school students. Knowing a 
student’s gender helped us predict his or her dominant hand. By what you learned in Chapter 1, 
there is an association between gender and handedness for the students in the sample. The 
segmented bar graph shows the association in picture form. 


Does that mean an association exists between gender and handedness in the larger 
population? Maybe. If there is no association between the variables in the population, it would 
be surprising to choose a random sample of 100 students for which P(left-handed | male), 
P(left-handed | female), and P(left-handed) were exactly equal. But these probabilities should 
be close to equal if there’s no association between the variables in the population. How close is 
close? We’Il discuss this issue further in Chapter 11. 


CHECK YOUR UNDERSTANDING 


. 


y For each chance process given, determine whether the events are independent. Justify 

your answer. 

1. Shuffle a standard deck of cards, and turn over the top card. Put it back in the deck, shuffle 
again, and turn over the top card. Define events A: first card is a heart, and B: second card 
is a heart. 

2. Shuffle a standard deck of cards, and turn over the top two cards, one at a time. Define 
events A: first card is a heart, and B: second card is a heart. 


3. The 28 students in Mr. Tabor’s AP® Statistics class took a quiz on conditional probability 
and independence. The two-way table summarizes the class’s quiz results based on gender 
and whether the student got an A. Choose a student from the class at random. The events 
of interest are “female” and “got an A.” 


Gender 
Female Male 
Yes 1 
GotanA 
No 6 


The General Multiplication Rule 


Suppose that A and B are two events resulting from the same chance process. We can find the 
probability P(A or B) with the general addition rule: 


P(A or B)=P(A)+P(B)-P(A and B)? (A or B) = P(A) + P(B) — P(A and B) 


How do we find the probability that both events happen, P(A and B)? 

Consider this situation: about 55% of high school students participate in a school athletic 
team at some level. Roughly 6% of these athletes go on to play on a college team in the 
NCAA.12 What percent of high school students play a sport in high school and go on to play 
on an NCAA team? About 6% of 55%, or roughly 3.3%. 

Let’s restate the situation in probability language. Suppose we select a high school student 
at random. What’s the probability that the student plays a sport in high school and goes on to 
play on an NCAA team? The given information suggests that 


P(high school sport)=0.55 and P(NCAA team | high school sport)=0.06 
P (high school sport) = 0.55 and P (NCAA team | high school sport) = 0.06 


By the logic just stated, 


P(high school sport and NCAA team)=P(high school sport)-P(NCAA team | high school sport)= 
P(high school sport and NCAA team) 


= P(high school sport) - P(/(NCAA team | high school sport) 
(0.55)(0.06)=0.033 = (0-55)(0.06) = 0.033 


This is an example of the general multiplication rule. 


DEFINITION General multiplication rule 


For any chance process, the probability that events A and B both occur can be 
found using the general multiplication rule: 


P(A and B)=P(AnB)=P(A)-P(BIA)/ (A and B) = P(A B) = P(A) - P(B\A) 


The general multiplication rule says that for both of two events to occur, first one must 
occur. Then, given that the first event has occurred, the second must occur. To confirm that this 
result is correct, start with the conditional probability formula 


ay _P(BNA) 
P(BIA)=P(BNA)P(A) P(A) 


The numerator gives the probability we want because P(B n A) is the same as P(A rn B). 
Multiply both sides of the previous equation by P(A) to get 


P(A):P(BJA)=P(AnByP(A) : P(BIA) = P(A B) 


EXAMPLE | Teens and social media (>) 
The general multiplication rule 


PROBLEM: The Pew Internet and American Life Project reported that 79% of teenagers 
(ages 13 to 17) use social media, and that 39% of teens who use social media feel pressure to 
post content that will be popular and get lots of comments or likes.42 Find the probability 
that a randomly selected teen uses social media and feels pressure to post content that will be 
popular and get lots of comments or likes. 


SOLUTION: 

P (use social media and feel pressure) =P (use social media)-P (feel pressure | use social media)=(0.79) 
(0.39) = 0.308 

P (use social media and feel pressure) = P (use social media) - P (feel pressure | use social media) 


= (0.79) (0.39) = 0.308 
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Tree Diagrams and Conditional Probability 


Shannon hits the snooze button on her alarm on 60% of school days. If she hits snooze, there is 
a 0.70 probability that she makes it to her first class on time. If she doesn’t hit snooze and gets 
up right away, there is a 0.90 probability that she makes it to class on time. Suppose we select a 
school day at random and record whether Shannon hits the snooze button and whether she 
arrives in class on time. Figure 5.7 shows a tree diagram for this chance process. 


P(on time | hits snooze) 


Vv On time 
0.70_ — 


SN 
Se vy On time 
0.40™ 0.90 — 
S| Doesn’t hit | ~ 
snooze button} ——~_ 
0.10 ~~ 
Late 


FIGURE 5.7 A tree diagram displaying the sample space of randomly choosing a school day 
and noting if Shannon hits the snooze button or not and whether she gets to her first class on 
time. 
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There are only two possible outcomes at the first “stage” of this chance process: Shannon 
hits the snooze button or she doesn’t. The first set of branches in the tree diagram displays 
these outcomes with their probabilities. The second set of branches shows the two possible 
results at the next “stage” of the process—Shannon gets to her first class on time or arrives late 
—and the probability of each result based on whether or not she hit the snooze button. Note 
that the probabilities on the second set of branches are conditional probabilities, like P(on 
time | hits snooze) = 0.70. 


DEFINITION Tree diagram 


A tree diagram shows the sample space of a chance process involving multiple 
stages. The probability of each outcome is shown on the corresponding branch of 
the tree. All probabilities after the first stage are conditional probabilities. 


We can ask some interesting questions related to the tree diagram: 


e What is the probability that Shannon hits the snooze button and is late for class on a 


randomly selected school day? The general multiplication rule provides the answer: 
P(hits snooze and late)=P(hits snooze)-P(late | hits snooze)=(0.60)(0.30)=0.18 
P (hits snooze and late) = P (hits snooze) - P (late | hits snooze) 
= (0.60) (0.30) 
= 0.18 


There is an 18% chance that Shannon hits the snooze button and is late for class. Note that 
the previous calculation amounts to multiplying probabilities along the branches of the tree 
diagram. 


What’s the probability that Shannon is late to class on a randomly selected school day? 
Figure 5.8 on the next page illustrates two ways this can happen: Shannon hits the snooze 
button and is late or she doesn’t hit snooze and is late. Because these outcomes are mutually 
exclusive, 


P(late)=P(hits snooze and late)+P(doesn’t hits snooze and late) 
P (late) = P (hits snooze and late) + P (doesn’t hits snooze and late) 


On time 


30 al Late | P(hits snooze and late) 
= (0.06)(0.30) = 0.18 


P(late) 
On time = 0.18 + 0.04 = 0.22 


~ 
ee 


0.10 al Late | P(doesn’t hit snooze and late) 
= (0.40)(0.10) = 0.04 


Doesn't hit 
snooze button 


FIGURE 5.8 Tree diagram showing the two possible ways that Shannon can be late to class 
on arandomly selected day. 


The general multiplication rule tells us that 


P(doesn’t hit snooze and late)=P(doesn’t hit snooze)-P(late | doesn’t hit snooze)=(0.40) 
(0.10)=0.04 


P (doesn’t hit snooze and late) = P (doesn’t hit snooze) - P (late | doesn’t hit snooze) 
= (0.40) (0.10) 
= 0.04 


So P(late) = 0.18 + 0.04 = 0.22.. There is a 22% chance that Shannon will be late to class. 


Suppose that Shannon is late for class on a randomly chosen school day. What is the 
probability that she hit the snooze button that morning? To find this probability, we start 
with the given information that Shannon is late, which is displayed on the second set of 
branches in the tree diagram, and ask whether she hit the snooze button, which is shown on 
the first set of branches. We can use the information from the tree diagram and the 


conditional probability formula to do the required calculation: 
P(hit snooze button | late)=P(hit snooze button and late)P(late)=0.180.22=0.818 
P(hit snooze button and late) 
P (hit snooze button | late) = ———_——____—_—_ 
P(late) 
0.18 
0.22 
= 0.818 


Given that Shannon is late for school on a randomly selected day, there is a 0.818 probability 
that she hit the snooze button. 


This method for solving conditional probability problems that involve “going backward” in a tree 
diagram is sometimes referred to as Bayes’s theorem. It was developed by the Reverend Thomas 
Bayes in the 1700s. 


Some interesting conditional probability questions—like this one about P(hit snooze button 
| late)—involve “going in reverse” on a tree diagram. Note that we just use the conditional 
probability formula and plug in the appropriate values to answer such questions. 


EXAMPLE |Do people read more ebooks or print books? » 
Tree diagrams and probability 


Burlingham/Shuterstock.com 


PROBLEM: Recently, Harris Interactive reported that 20% of millennials, 25% of Gen 
Xers, 21% of baby boomers, and 17% of matures (age 68 and older) read more ebooks than 
print books. According to the U.S. Census Bureau, 34% of those 18 and over are millennials, 
22% are Gen Xers, 30% are baby boomers, and 14% are matures. Suppose we select one 
U.S. adult at random and record which generation the person is from and whether she or he 
reads more ebooks or print books. 


a. Draw a tree diagram to model this chance process. 


b. Find the probability that the person reads more ebooks than print books. 


c. Suppose the chosen person reads more ebooks than print books. What’s the probability 
that she or he is a millennial? 


SOLUTION: 
a. 
0.20 Reads more ebooks 
Millennial 
0.80 Reads more print books 
0.25 Reads more ebooks 
Gen Xer 
Randomly 0.75 Reads more print books 
chosen 
U.S. adult Baby 0.21 Reads more ebooks 
Boomer 0.79 Reads more print books 
0.17 Reads more ebooks 
Millennial 
0.83 Reads more print books 
b. 


P(reads more ebooks) = (0.34)(0.20)+(0.22)(0.25)+(0.30)(0.21)+(0.14)(0.17)= 0.0680+0.0550+0.06304 


P(millennial | reads more ebooks) = P(millennial and reads more ebooks)P(reads more ebook 


P(AMB) 
P(A|B) =_—_—— 


P(A|B)=P(AnB)P(B) P(B) 
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One of the most important applications of conditional probability is in the area of drug and 
disease testing. 


EXAMPLE |Mammograms (>) 
Tree diagrams and conditional probability 


Sam Edwards/Getty Images 


PROBLEM: Many women choose to have annual mammograms to screen for breast cancer 
after age 40. A mammogram isn’t foolproof. Sometimes the test suggests that a woman has 
breast cancer when she really doesn’t (a “false positive”). Other times, the test says that a 
woman doesn’t have breast cancer when she actually does (a “false negative”). 

Suppose that we know the following information about breast cancer and mammograms in a 
particular population: 


e One percent of the women aged 40 or over in this population have breast cancer. 
e For women who have breast cancer, the probability of a negative mammogram is 0.03. 


e For women who don’t have breast cancer, the probability of a positive mammogram is 
0.06. 

A randomly selected woman aged 40 or over from this population tests positive for breast 

cancer in a mammogram. Find the probability that she actually has breast cancer. 


SOLUTION: 
Positive 
Hes tvenet 0.97 mammogram 
cancer 
0.01 0.03 Negative 
mammogram 
Positive 
Doesn't have 
breast cancer 
0.94 Nagai 


mammogram 


Start by making a tree diagram to summarize the possible outcomes. 
e Because 1% of women in this population have breast cancer, 99% don't have 
breast cancer. 


Of those women who do have breast cancer, 3% would test negative ona 
mammogram. The remaining 97% would (correctly) test positive. 

Among the women who don't have breast cancer, 6% would test positive ona 
mammogram. The remaining 94% would (correctly) test negative. 


P (breast cancer | positive mammogram) =P(breast cancer and positive mammogram) (positive mamm 
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Are you surprised by the final result of the example—given that a randomly selected 


woman from the population in question has a positive mammogram, there is only about a 14% 


chance that she has breast cancer? Most people are. Sometimes a two-way table that includes 
counts is more convincing. 


To make calculations simple, we’!l suppose that there are exactly 10,000 women aged 40 or 


over in this population, and that exactly 100 have breast cancer (that’s 1% of the women). 


e How many of those 100 would have a positive mammogram? It would be 97% of 100, or 97 


of them. That leaves 3 who would test negative. 


e How many of the 9900 women who don’t have breast cancer would get a positive 
mammogram? Six percent of them, or (9900)(0.06) = 594 women. The remaining 9900 — 
594 = 9306 would test negative. 


e In total, 97 + 594 = 691 women would have positive mammograms and 3 + 9306 = 9309 
women would have negative mammograms. 


This information is summarized in the two-way table. 


Has breast cancer? 


Yes No Total 

Mammogram Positive 691 
result Negative 9309 
Total 100 9900 10,000 


Given that a randomly selected woman has a positive mammogram, the two-way table shows 
that the conditional probability 


P(breast cancer|positive mammogram)=97/691=0.14 
P(breast cancer|positive mammogram) = 97/691 = 0.14 


This example illustrates an important fact when considering proposals for widespread 


testing for serious diseases or illegal drug use: if the condition being tested is uncommon in the 
population, many positives will be false positives. The best remedy is to retest any individual 


who tests positive. 


CHECK YOUR UNDERSTANDING 


q 


y A computer company makes desktop, laptop, and tablet computers at factories in two 
states: California and Texas. The California factory produces 40% of the company’s 
computers and the Texas factory makes the rest. Of the computers made in California, 25% 
are desktops, 30% are laptops, and the rest are tablets. Of those made in Texas, 10% are 
desktops, 20% are laptops, and the rest are tablets. All computers are first shipped to a 
distribution center in Missouri before being sent out to stores. Suppose we select a computer 
at random from the distribution center and observe where it was made and whether it is a 


desktop, laptop, or tablet 14 

1. Construct a tree diagram to model this chance process. 

2. Find the probability that the computer is a tablet. 

3. Given that a tablet computer is selected, what is the probability that it was made in 


California? 


The Multiplication Rule for Independent Events 


What happens to the general multiplication rule in the special case when events A and B are 
independent? In that case, P(B | A) = P(B). We can simplify the general multiplication rule as 


follows: 


P(A and B)=P(AnB)==P(A)-P(BJA)P(A):P(B) 
P(AandB) = P(ANB) =  P(A)-P(BIA) 
— P(A)-P(B) 


This result is known as the multiplication rule for independent events. 


DEFINITION Multiplication rule for independent events 
If A and B are independent events, the probability that A and B both occur is 


P(A and B)=P(AnB)=P(A)-P(B)/ (A and B) = P(A B) = P(A) - P(B) 


og Note that this rule applies only to independent events. 


Suppose that Pedro drives the same route to work on Monday through Friday. His route 
includes one traffic light. The probability that the light will be green when Pedro arrives is 
0.42, yellow is 0.03, and red is 0.55. 


.. What’s the probability that the light is green on Monday and red on Tuesday? Let event 
A be green light on Monday and event B be red light on Tuesday. These two events are 
independent because knowing that the light was green on Monday doesn’t help us predict the 
color of the light on Tuesday. By the multiplication rule for independent events, 

P(green on Monday and red on Tuesday)=P(A and B) =P(A)-P(B)=(0.42)(0.55)=0.231 
P (green on Monday and red on Tuesday) = P(A and B) 

P(A)- P(B) 

(0.42) (0.55) 

= 0.231 


There’s about a 23% chance that the light will be green on Monday and red on Tuesday. 


». What’s the probability that Pedro finds the light red on Monday through Friday? We 
can extend the multiplication rule for independent events to more than two events: 


P(red Monday and red Tuesday and red Wednesday and red Thursday and red 
Friday)=P(red Monday)-P(red Tuesday)-P(red Wednesday)-P(red Thursday)-P(red 
Friday)= (0.55)(0.55)(0.55)(0.55)(0.55)=(0.55)5=0.0503 
P(red Monday and red Tuesday and red Wednesday and red Thursday and red Friday) 


=P(red Monday) - P(red Tuesday) - P(red Wednesday) - P(red Thursday) - P(red Friday) 
=(0.55)(0.55)(0.55) (0.55) (0.55) 
=(0.55)° 
= 0.0503 
There is about a 5% chance that Pedro will encounter a red light on all five days in a work 
week. 


EXAMPLE |The Challenger disaster (») 
Multiplication rule for independent events 


Thom Baur/AP Images 


PROBLEM: On January 28, 1986, the space shuttle Challenger exploded on takeoff. All 
seven crew members were killed. Following the disaster, scientists and statisticians helped 
analyze what went wrong. They determined that the failure of O-ring joints in the shuttle’s 
booster rockets was to blame. Under the cold conditions that day, experts estimated that the 
probability that an individual O-ring joint would function properly was 0.977. But there were 
six of these O-ring joints, and all six had to function properly for the shuttle to launch safely. 
Assuming that O-ring joints succeed or fail independently, find the probability that the 
shuttle would launch safely under similar conditions. 


SOLUTION: 
P(O-ring 1 OK and O-ring 2 OK and O-ring 3 OK and O-ring 4 OK and O-ring 5 OK and O- 
ring 6 OK)=P(O-ring 1 OK)-P(O-ring 2 OK)-P(O-ring 3 OK)-P(O-ring 4 OK)-P(O-ring 5 OK)-P(O- 
ring 6 OK)= (0.977)(0.977)(0.977)(0.977)(0.977)(0.977)=(0.977)6=0.870 
P(O-ring 1 OK and O-ring 2 OK and O-ring 3 OK and O-ring 4 OK and O-ring 5 OK and O-ring 6 OK) 


= P(QO-ring 1 OK) - P(O-ring 2 OK) - P(O-ring 3 OK) - P(O-ring 4 OK) - P(O-ring 5 OK) - P(O-ring 6 OK) 
= (0.977)(0.977) (0.977) (0.977) (0.977) (0.977) 

= (0.977)° 

= 0.870 
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The multiplication rule for independent events can also be used to help find P(at least one). 
In the preceding example, the shuttle would not launch safely under similar conditions if 1 or 2 
or 3 or 4or 5 or all 6 O-ring joints fail—that is, if at least one O-ring fails. The only possible 
number of O-ring failures excluded is 0. So the events “at least one O-ring joint fails” and “no 
O-ring joints fail” are complementary events. By the complement rule, 

P(at least one O-ring fails)=1—P(no O-ring fails)=1—0.87=0.13 
P (at least one O-ring fails) = 1-— P(no O-ring fails) 

1 — 0.87 
= 0.13 


That’s a very high chance of failure! As a result of this analysis following the Challenger 
disaster, NASA made important safety changes to the design of the shuttle’s booster rockets. 


EXAMPLE | Rapid HIV testing (>) 
Finding the probability of “at least one” 


PROBLEM: Many people who visit clinics to be tested for HIV, the virus that causes AIDS, 
don’t come back to learn their test results. Clinics now use “rapid HIV tests” that give a 
result while the client waits. In a clinic in Malawi, for example, use of rapid tests increased 
the percentage of clients who learned their test results from 69% to over 99%. 


The trade-off for fast results is that rapid tests are less accurate than slower laboratory tests. 
Applied to people who have no HIV antibodies, one rapid test has a probability of about 
0.004 of producing a false positive (i.e., of falsely indicating that antibodies are present)./2 
If a clinic tests 200 randomly selected people who are free of HIV antibodies, what is the 
probability that at least one false positive will occur? Assume that test results for different 


individuals are independent. 


SOLUTION: 


P(no false positives)=P(all 200 tests negative)=(0.996)(0.996)--- 


(0.996)=0.996200=0.4486P(at least one false positive)=1 — 0.4486 = 0.5514 
P (no false positives) = P (all 200 tests negative) 


— (0.996) (0.996) - - - (0.996) 
= 0.9967 
= 0.4486 

P (at least one false positive) = 1-— 0.4486 = 0.5514 


Start by finding P (no false positives). 


The probability that any individual test result is negative is 1 — 0.004 = 0.996. 
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USING THE MULTIPLICATION RULE FOR INDEPENDENT EVENTS WISELY The 
multiplication rule P(A and B) = P(A) - P(B) holds if A and B are independent but not 
otherwise. The addition rule P(A or B) = P(A) + P(B) holds if A and B are mutually exclusive 
but not otherwise. o@ Resist the temptation to use these simple rules when the conditions 
that justify them are not met. 
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Hagar the Horrible used with the permission of King Features Syndicate and the Cartoonist Group. All rights reserved. 


EXAMPLE | Watch the weather! (») 
Beware lack of independence! 


PROBLEM: Hacienda Heights and La Puente are two neighboring suburbs in the Los 
Angeles area. According to the local newspaper, there is a 50% chance of rain tomorrow in 
Hacienda Heights and a 50% chance of rain in La Puente. Does this mean that there is a (0.5) 
(0.5) = 0.25 probability that it will rain in both cities tomorrow? 


SOLUTION: 


No: it is not appropriate to multiply the two probabilities, because “raining tomorrow in 
Hacienda Heights” and “raining tomorrow in La Puente” are not independent events. aie tt 
is raining in one of these locations, there is a high probability that it is raining in the 
other location because they are geographically close to each other. 
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Is there a connection between mutually exclusive and independent? Let’s start with a new 
chance process. Choose a U.S. adult at random. Define event A: the person is male, and event 
B: the person is pregnant. It’s fairly clear that these two events are mutually exclusive (can’t 
happen together)! Are they also independent? 

If you know that event A has occurred, does this change the probability that event B 
happens? Of course! If we know the person is male, then the chance that the person is pregnant 
is 0. But the probability of selecting someone who is pregnant is greater than 0. Because P(B | 
A) # P(B), the two events are not independent. Two mutually exclusive events (with nonzero 
probabilities) can never be independent, because if one event happens, the other event is 
guaranteed not to happen. 


CHECK YOUR UNDERSTANDING 


‘. 


,, Questions 1 and 2 refer to the following setting. New Jersey Transit claims that its 8:00 
A.M. train from Princeton to New York has probability 0.9 of arriving on time. Assume that 
this claim is true. 


1. Find the probability that the train arrives late on Monday but on time on Tuesday. 
2. What’s the probability that the train arrives late at least once in a 5-day week? 


3. Government data show that 8% of adults are full-time college students and that 15% of 
adults are age 65 or older. If we randomly select an adult, is P(full-time college student 


and age 65 or older) = (0.08)(0.15)? Why or why not? 


Section 5.3 Summary 


e A conditional probability describes the probability that one event happens given that 
another event is already known to have happened. 


e One way to calculate a conditional probability is to use the formula 


P(A|B)=P(A and B)P(B)=P(AnB)P(B)=P(both events occur)P(given event occurs) 
P(AIB) - P(AandB) P(ANMB)  P(both events occur) 
ae) = P(B) ~—-P(B) ~~_—_—«~P(given event occurs) 


e When knowing whether or not one event has occurred does not change the probability that 
another event happens, we say that the two events are independent. Events A and B are 
independent if 


P(A|B)=P(A|BC)=P(A)P(AIB) = P(AIB®) = P(A) 


or, alternatively, if 
P(BIA)=P(BIAC)=P(B)P(BIA) = P(BIA®) = P(B) 


e Use the general multiplication rule to calculate the probability that events A and B both 
occur: 


P(A and B)=P(AnB)=P(A)-P(BIA)! (A and B) = P(AMB) = P(A) - P(BIA) 
e When a chance process involves multiple stages, a tree diagram can be used to display the 
sample space and to help answer questions involving conditional probability. 


e In the special case of independent events, the multiplication rule becomes 


P(A and B)=P(AnB)=P(A)-P(B)P (A and B) = P(A B) = P(A) - P(B) 


Section 5.3 Exercises 


61. pg 332 _») Superpowers A random sample of 415 children from England and the 


United States who completed a survey in a recent year was selected. Each student’s 
country of origin was recorded along with which superpower they would most like to 
have: the ability to fly, ability to freeze time, invisibility, superstrength, or telepathy 
(ability to read minds). The data are summarized in the two-way table. 


Country 
England U.S. Total 
Fly 54 45 99 
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Freeze time 96 
Superpower Invisibility 67 
Superstrength 43 
Telepathy 110 
Total 415 


Suppose we randomly select one of these students. Define events E: England, T: telepathy, 
and S: superstrength. 


a. Find P(T | E). Interpret this value in context. 


b. Given that the student did not choose superstrength, what’s the probability that this 
child is from England? Write your answer as a probability statement using correct 
symbols for the events. 


Get rich A survey of 4826 randomly selected young adults (aged 19 to 25) asked, “What 
do you think are the chances you will have much more than a middle-class income at age 


302” The two-way table summarizes the responses.1® 
Gender 
Female Male Total 
Almost no chance 194 
Some chance but 712 
_ probably not 

Opinion A 50-50 chance 1416 
A good chance 1421 
Almost certain 1083 
Total 2367 2459 4826 


Choose a survey respondent at random. Define events G: a good chance, M: male, and N: 
almost no chance. 


a. Find P(G | M). Interpret this value in context. 


b. Given that the chosen survey respondent didn’t say “almost no chance,” what’s the 
probability that this person is female? Write your answer as a probability statement 
using correct symbols for the events. 


Body image A random sample of 1200 U.S. college students was asked, “What is your 
perception of your own body? Do you feel that you are overweight, underweight, or about 
right?” The two-way table below summarizes the data on perceived body image by 
gender12 


Gender 
Female Male 


About right 295 


Body image 
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Overweight 163 fz 
Underweight of fe 


Suppose we randomly select one of the survey respondents. 


a. Given that the person perceived his or her body image as about right, what’s the 
probability that the person is female? 


b. If the person selected is female, what’s the probability that she did not perceive her 
body image as overweight? 


Temperature and hatching How is the hatching of water python eggs influenced by the 
temperature of a snake’s nest? Researchers randomly assigned newly laid eggs to one of 
three water temperatures: cold, neutral, or hot. Hot duplicates the extra warmth provided 
by the mother python, and cold duplicates the absence of the mother. 


Nest temperature 


Cold Neutral Hot 

. Hatched 75 
Hatching status - 

Didn’t hatch 11 18 29 


Suppose we select one of the eggs at random. 


a. Given that the chosen egg was assigned to hot water, what is the probability that it 
hatched? 


b. If the chosen egg hatched, what is the probability that it was not assigned to hot water? 


Foreign-language study Choose a student in grades 9 to 12 at random and ask if he or she 
is studying a language other than English. Here is the distribution of results: 


Language Spanish French German All others None 
Probability 0.26 0.09 0.03 0.03 0.59 
a. What’s the probability that the student is studying a language other than English? 


b. What is the probability that a student is studying Spanish given that he or she is 
studying some language other than English? 


Income tax returns Here is the distribution of the adjusted gross income (in thousands of 
dollars) reported on individual federal income tax returns in a recent year: 


Income <25 25-49 50-99 100—499 >500 
Probability 0.431 0.248 0.215 0.100 0.006 


a. What is the probability that a randomly chosen return shows an adjusted gross income 
of $50,000 or more? 


b. Given that a return shows an income of at least $50,000, what is the conditional 
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probability that the income is at least $100,000? 


Tall people and basketball players Select an adult at random. Define events T: person is 
over 6 feet tall, and B: person is a professional basketball player. Rank the following 
probabilities from smallest to largest. Justify your answer. 


P(T) P(B) P (TB) P (BIT) 


Teachers and college degrees Select an adult at random. Define events D: person has 
earned a college degree, and T: person’s career is teaching. Rank the following 
probabilities from smallest to largest. Justify your answer. 


P (D) P(T) P (DIT) P (T|D) 


pg 333 |p») Dogs and cats In one large city, 40% of all households own a dog, 32% 


own a Cat, and 18% own both. Suppose we randomly select a household and learn that the 
household owns a cat. Find the probability that the household owns a dog. 


Mac or PC? A recent census at a major university revealed that 60% of its students 
mainly used Macs. The rest mainly used PCs. At the time of the census, 67% of the 
school’s students were undergraduates. The rest were graduate students. In the census, 
23% of respondents were graduate students and used a Mac as their main computer. 
Suppose we select a student at random from among those who were part of the census and 
learn that the person mainly uses a Mac. Find the probability that the person is a graduate 
student. 


pg 336 |») Who owns a home? What is the relationship between educational 


achievement and home ownership? A random sample of 500 U.S. adults was selected. 
Each member of the sample was identified as a high school graduate (or not) and as a 
homeowner (or not). The two-way table summarizes the data. 


High school graduate 


Yes No Total 
Yes 340 
Homeowner 
No 160 
Total 310 190 500 


Are the events “homeowner” and “high school graduate” independent? Justify your 
answer. 


Is this your card? A standard deck of playing cards (with jokers removed) consists of 52 
cards in four suits—clubs, diamonds, hearts, and spades. Each suit has 13 cards, with 
denominations ace, 2, 3, 4, 5, 6, 7, 8, 9, 10, jack, queen, and king. The jacks, queens, and 
kings are referred to as “face cards.” Imagine that we shuffle the deck thoroughly and deal 
one card. The two-way table summarizes the sample space for this chance process based 
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on whether or not the card is a face card and whether or not the card is a heart. 


Type of card 


Face card Nonface card Total 
; Heart 13 
Suit 
Nonheart 39 
Total 52 


Are the events “heart” and “face card” independent? Justify your answer. 


Cell phones The Pew Research Center asked a random sample of 2024 adult cell-phone 
owners from the United States their age and which type of cell phone they own: iPhone, 
Android, or other (including non-smartphones). The two-way table summarizes the data. 


Age 
18-34 35-54 55+ Total 
iPhone 467 
Type ofcell “A ndroid 503 
phone 
Other 1054 
Total 517 637 870 2024 


Suppose we select one of the survey respondents at random. 
a. Find P(iPhone | 18-34). 


b. Use your answer from part (a) to help determine if the events “iPhone” and “18-34” are 
independent. 


Middle school values Researchers carried out a survey of fourth-, fifth-, and sixth-grade 
students in Michigan. Students were asked whether good grades, athletic ability, or being 


popular was most important to them. The two-way table summarizes the survey data.12 
Grade 
Ath Sth 6th Total 
Grades 168 
Most 
Important “Athletic 98 
Popular 69 
Total 92 108 135 a0 


Suppose we select one of these students at random. 
a. Find P(athletic | 5th grade). 


b. Use your answer from part (a) to help determine if the events “Sth grade” and “athletic” 
are independent. 
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Rolling dice Suppose you roll two fair, six-sided dice—one red and one green. Are the 
events “sum is 7” and “green die shows a 4” independent? Justify your answer. (See 
Figure 5.2 on page 314 for the sample space of this chance process.) 


Rolling dice Suppose you roll two fair, six-sided dice—one red and one green. Are the 
events “sum is 8” and “green die shows a 4” independent? Justify your answer. (See 
Figure 5.2 on page 314 for the sample space of this chance process.) 


pg 338 |») Free downloads? Illegal music downloading is a big problem: 29% of 


Internet users download music files, and 67% of downloaders say they don’t care if the 
music is copyrighted.48 Find the probability that a randomly selected Internet user 
downloads music and doesn’t care if it’s copyrighted. 


At the gym Suppose that 10% of adults belong to health clubs, and 40% of these health 
club members go to the club at least twice a week. Find the probability that a randomly 
selected adult belongs to a health club and goes there at least twice a week. 


Box of chocolates According to Forrest Gump, “Life is like a box of chocolates. You 
never know what you’re gonna get.” Suppose a candymaker offers a special “Gump box” 
with 20 chocolate candies that look alike. In fact, 14 of the candies have soft centers and 6 
have hard centers. Suppose you choose 3 of the candies from a Gump box at random. Find 
the probability that all three candies have soft centers. 


Sampling students A statistics class with 30 students has 10 males and 20 females. 
Suppose you choose 3 of the students in the class at random. Find the probability that all 
three are female. 


pg 341 |») Filler up! Ina certain month, 88% of automobile drivers filled their 


vehicles with regular gasoline, 2% purchased midgrade gas, and 10% bought premium 
gas./2 Of those who bought regular gas, 28% paid with a credit card; of customers who 
bought midgrade and premium gas, 34% and 42%, respectively, paid with a credit card. 
Suppose we select a customer at random. 


a. Draw a tree diagram to model this chance process. 
b. Find the probability that the customer paid with a credit card. 


c. Suppose the chosen customer paid with a credit card. What’s the probability that the 
customer bought premium gas? 


Media usage and good grades The Kaiser Family Foundation released a study about the 
influence of media in the lives of young people aged 8-18.22 In the study, 17% of the 
youth were classified as light media users, 62% were classified as moderate media users, 
and 21% were classified as heavy media users. Of the light users who responded, 74% 
described their grades as good (A’s and B’s), while only 68% of the moderate users and 
52% of the heavy users described their grades as good. Suppose that we select one young 
person from the study at random. 
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a. Draw a tree diagram to model this chance process. 
b. Find the probability that this person describes his or her grades as good. 


c. Suppose the chosen person describes his or her grades as good. What’s the probability 


that he or she is a heavy user of media? 


342 First serve Tennis great Andy Murray made 60% of his first serves in a 
pg. ey 


recent season. When Murray made his first serve, he won 76% of the points. When 
Murray missed his first serve and had to serve again, he won only 54% of the points.2! 
Suppose you randomly choose a point on which Murray served. You get distracted before 
seeing his first serve but look up in time to see Murray win the point. What’s the 


probability that he missed his first serve? 


Lactose intolerance Lactose intolerance causes difficulty in digesting dairy products that 
contain lactose (milk sugar). It is particularly common among people of African and Asian 
ancestry. In the United States (not including other groups and people who consider 
themselves to belong to more than one race), 82% of the population is White, 14% is 
Black, and 4% is Asian. Moreover, 15% of Whites, 70% of Blacks, and 90% of Asians are 
lactose intolerant.22 Suppose we select a U.S. person at random and find that the person is 
lactose intolerant. What’s the probability that she or he is Asian? 


HIV testing Enzyme immunoassay (EIA) tests are used to screen blood specimens for the 
presence of antibodies to HIV, the virus that causes AIDS. Antibodies indicate the 
presence of the virus. The test is quite accurate but is not always correct. A false positive 
occurs when the test gives a positive result but no HIV antibodies are actually present in 
the blood. A false negative occurs when the test gives a negative result but HIV antibodies 
are present in the blood. Here are approximate probabilities of positive and negative EIA 
outcomes when the blood tested does and does not actually contain antibodies to HIV: 22 


Test result 


Antibodies present 
Truth 


Antibodies absent 


Suppose that 1% of a large population carries antibodies to HIV in their blood. Imagine 
choosing a person from this population at random. If the person’s EIA test is positive, 
what’s the probability that the person has the HIV antibody? 


Metal detector A boy uses a homemade metal detector to look for valuable metal objects 
on a beach. The machine isn’t perfect—it beeps for only 98% of the metal objects over 
which it passes, and it beeps for 4% of the nonmetallic objects over which it passes. 
Suppose that 25% of the objects that the machine passes over are metal. Choose an object 
from this beach at random. If the machine beeps when it passes over this object, find the 
probability that the boy has found a metal object. 


87. Fundraising by telephone Tree diagrams can organize problems having more than two 
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stages. The figure shows probabilities for a charity calling potential donors by 
telephone.24 Each person called is either a recent donor, a past donor, or a new prospect. 
At the next stage, the person called either does or does not pledge to contribute, with 
conditional probabilities that depend on the donor class to which the person belongs. 
Finally, those who make a pledge either do or don’t actually make a contribution. Suppose 
we randomly select a person who is called by the charity. 


0.8 —=— Contribute 
0.4 Pledge «< 
Recent aa 
/donor “ ead ~ Not 
0.6 . No pledge 
0.5 0.6 Contribute 
0.3 — Pledge « i 
0.3 » 
d Past oi 0.4 Not 
donor 
Md No pledge 
(0,2 
\ 0.5 —= Contribute 
0.1 Pledge « 
\ New me 0.5 ™ Not 
prospect 
0.9 No pledge 


a. What is the probability that the person contributed to the charity? 


b. Given that the person contributed, find the probability that he or she is a recent donor. 


HIV and confirmation testing Refer to Exercise 85. Many of the positive results from 
EIA tests are false positives. It is therefore common practice to perform a second EIA test 
on another blood sample from a person whose initial specimen tests positive. Assume that 
the false positive and false negative rates remain the same for a person’s second test. Find 
the probability that a person who gets a positive result on both EIA tests has HIV 
antibodies. 


pg 345 \») Merry and bright? A string of Christmas lights contains 20 lights. The 


lights are wired in series so that if any light fails, the whole string will go dark. Each light 
has probability 0.98 of working for a 3-year period. The lights fail independently of each 
other. Find the probability that the string of lights will remain bright for 3 years. 


Get rid of the penny Harris Interactive reported that 29% of all U.S. adults favor 
abolishing the penny. Assuming that responses from different individuals are independent, 
what is the probability of randomly selecting 3 U.S. adults who all say that they favor 
abolishing the penny? 

pg 346 ») Is the package late? A shipping company claims that 90% of its shipments 


arrive on time. Suppose this claim is true. If we take a random sample of 20 shipments 
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made by the company, what’s the probability that at least 1 of them arrives late? 


On a roll Suppose that you roll a fair, six-sided die 10 times. What’s the probability that 
you get at least one 6? 


pg 347 |») Who’s pregnant? According to the Current Population Survey (CPS), 27% 


of U.S. females are older than 55. The Centers for Disease Control and Prevention (CDC) 
report that 6% of all U.S. females are pregnant. Suppose that these results are accurate. If 
we randomly select a U.S. female, is P(pregnant and over 55) = (0.06)(0.27) = 0.0162? 
Why or why not? 


Late flights An airline reports that 85% of its flights arrive on time. To find the 
probability that a random sample of 4 of this airline’s flights into LaGuardia Airport in 
New York City on the same night all arrive on time, can we multiply (0.85)(0.85)(0.85) 
(0.85)? Why or why not? 


Fire or medical? Many fire stations handle more emergency calls for medical help than 
for fires. At one fire station, 81% of incoming calls are for medical help. Suppose we 
choose 4 incoming calls to the station at random. 


a. Find the probability that all 4 calls are for medical help. 
b. What’s the probability that at least 1 of the calls is not for medical help? 


c. Explain why the calculation in part (a) may not be valid if we choose 4 consecutive 
calls to the station. 


Broken links Internet sites often vanish or move so that references to them can’t be 
followed. In fact, 87% of Internet sites referred to in major scientific journals still work 
within two years of publication.22 Suppose we randomly select 7 Internet references from 
scientific journals. 


a. Find the probability that all 7 references still work two years later. 
b. What’s the probability that at least 1 of them doesn’t work two years later? 


c. Explain why the calculation in part (a) may not be valid if we choose 7 Internet 
references from one issue of the same journal. 


Mutually exclusive versus independent The two-way table summarizes data on the 
gender and eye color of students in a college statistics class. Imagine choosing a student 
from the class at random. Define event A: student is male, and event B: student has blue 


eyes.28 
Gender 
Male Female Total 
Blue 10 
Eye color 
Brown 40 
Total 50 


a. Copy and complete the two-way table so that events A and B are mutually exclusive. 
b. Copy and complete the two-way table so that events A and B are independent. 


c. Copy and complete the two-way table so that events A and B are not mutually 
exclusive and not independent. 


98. Independence and association The two-way table summarizes data from an experiment 
comparing the effectiveness of three different diets (A, B, and C) on weight loss. 
Researchers randomly assigned 300 volunteer subjects to the three diets. The response 
variable was whether each subject lost weight over a 1-year period. 


Diet 
A B C Total 
; Yes 180 
Lost weight? 
No 120 
Total 90 100 110 300 


a. Suppose we randomly select one of the subjects from the experiment. Show that the 
events “Diet B” and “Lost weight” are independent. 


b. Copy and complete the table so that there is no association between type of diet and 
whether a subject lost weight. 


c. Copy and complete the table so that there is an association between type of diet and 
whether a subject lost weight. 


99. Checking independence Suppose A and B are two events such that P(A) = 0.3, P(B) = 
0.4, and P(A n B) = 0.12. Are events A and B independent? Justify your answer. 


100. Checking independence Suppose C and D are two events such that 
P(C) = 0.6, P(D) = 0.45, and P(C n D) = 0.3. Are events C and D independent? Justify 
your answer. 


101. The geometric distributions You are tossing a pair of fair, six-sided dice in a board 
game. Tosses are independent. You land in a danger zone that requires you to roll 
doubles (both faces showing the same number of spots) before you are allowed to play 
again. 


a. What is the probability of rolling doubles on a single toss of the dice? 


b. What is the probability that you do not roll doubles on the first toss, but you do on the 
second toss? 


c. What is the probability that the first two tosses are not doubles and the third toss is 
doubles? This is the probability that the first doubles occurs on the third toss. 


d. Do you see the pattern? What is the probability that the first doubles occurs on the kth 
toss? 


102. Matching suits A standard deck of playing cards consists of 52 cards with 13 cards in 
each of four suits: spades, diamonds, clubs, and hearts. Suppose you shuffle the deck 
thoroughly and deal 5 cards face-up onto a table. 


a. What is the probability of dealing five spades in a row? 


b. Find the probability that all 5 cards on the table have the same suit. 
Multiple Choice Select the best answer for Exercises 103-106. 


03. An athlete suspected of using steroids is given two tests that operate independently of 
each other. Test A has probability 0.9 of being positive if steroids have been used. Test B 
has probability 0.8 of being positive if steroids have been used. What is the probability 
that neither test is positive if the athlete has used steroids? 


a. 0.08 
b. 0.28 
c. 0.02 
d. 0.38 


e. 0.72 

04. In an effort to find the source of an outbreak of food poisoning at a conference, a team of 
medical detectives carried out a study. They examined all 50 people who had food 
poisoning and a random sample of 200 people attending the conference who didn’t get 
food poisoning. The detectives found that 40% of the people with food poisoning went to 
a cocktail party on the second night of the conference, while only 10% of the people in 
the random sample attended the same party. Which of the following statements is 
appropriate for describing the 40% of people who went to the party? (Let F = got food 
poisoning and A = attended party.) 


a. P(F|A) = 0.40 
b. P(A|E°) = 0.40 
c. P(F|A®) = 0.40 
d. P(AC\EF) = 0.40 
e. P(A|F) = 0.40 
05. Suppose a loaded die has the following probability model: 


Outcome 1 2 3 4 5 6 
Probability 0.3 0.1 0.1 O21 0.1 0.3 


If this die is thrown and the top face shows an odd number, what is the probability that 


the die shows a 1? 
a. 0.10 
b. 0.17 
c. 0.30 
d. 0.50 


e. 0.60 
06. If P(A) = 0.24, P(B) = 0.52, and A and B are independent events, what is P(A or B)? 


a. 0.1248 
b. 0.28 
c. 0.6352 
d. 0.76 


e. The answer cannot be determined from the information given. 


Recycle and Review 


07. BMI (2.2, 5.2, 5.3) Your body mass index (BMI) is your weight in kilograms divided by 
the square of your height in meters. Online BMI calculators allow you to enter weight in 
pounds and height in inches. High BMI is a common but controversial indicator of being 
overweight or obese. A study by the National Center for Health Statistics found that the 
BMI of American young women (ages 20 to 29) is approximately Normally distributed 


with mean 26.8 and standard deviation 7.4.22 


a. People with BMI less than 18.5 are often classed as “underweight.” What percent of 
young women are underweight by this criterion? 


b. Suppose we select two American young women in this age group at random. Find the 
probability that at least one of them is classified as underweight. 

08. Snappy dressers (4.2, 4.3) Matt and Diego suspect that people are more likely to agree to 
participate in a survey if the interviewers are dressed up. To test this idea, they went to 
the local grocery store to survey customers on two consecutive Saturday mornings at 10 
A.M. On the first Saturday, they wore casual clothing (tank tops and jeans). On the 
second Saturday, they dressed in button-down shirts and nicer slacks. Each day, they 
asked every fifth person who walked into the store to participate in a survey. Their 
response variable was whether or not the person agreed to participate. Here are their 
results: 


Clothing 
Casual Nice 
Agreed 27 
Declined 36 23 


Participation 


a. Calculate the difference (Casual — Nice) in the proportion of subjects that agreed to 
participate in the survey in the two groups. 


b. Assume the study design is equivalent to randomly assigning shoppers to the “casual” 
or “nice” groups. A total of 100 trials of a simulation were performed to see what 
differences in proportions would occur due only to chance variation in this random 
assignment. Use the results of the simulation in the following dotplot to determine if 
the difference in proportions from part (a) is statistically significant. Explain your 
reasoning. 
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Simulated difference (Casual — Nice) in proportion who agreed 


c. What flaw in the design of this experiment would prevent Matt and Diego from 
drawing a cause-and-effect conclusion about the impact of an interviewer’s attire on 
nonresponse in a survey? 


MM _. FRAPPY! FREE RESPONSE AP® 
PROBLEM, YAY! 


The following problem is modeled after actual AP® Statistics exam free response 
questions. Your task is to generate a complete, concise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you use, because you will be 
scored on the correctness of your methods as well as on the accuracy and completeness of 
your results and explanations. 


A Statistics teacher has 40 students in his class, 23 females and 17 males. At the beginning 
of class on a Monday, the teacher planned to spend time reviewing an assignment due that 
day. Unknown to the teacher, only 19 of the females and 11 of the males had completed 
the assignment. The teacher plans to randomly select students to do problems from the 
assignment on the whiteboard. 


a. What is the probability that a randomly selected student has completed the 
assignment? 


b. Are the events “selecting a female” and “selecting a student who completed the 
assignment” independent? Justify your answer. 


Suppose that the teacher randomly selects 4 students to do a problem on the whiteboard 
and only 2 of the students had completed the assignment. 


c. Describe how to use a table of random digits to estimate the probability that 2 or 
fewer of the 4 randomly selected students completed the assignment. 


d. Complete three trials of your simulation using the random digits below and use the 
results to estimate the probability described in part (c). 


12975 13258 13048 45144 12321 81940 00360 02428 
96767 35964 23822 96012 94951 65194 50842 Sost2 
37609 59057 66967 83401 60705 02384 90597 93600 


After you finish, you can view two example solutions on the book’s website 
(highschool.bfwpub.com/tps6e). Determine whether you think each solution is 
“complete,” “substantial,” “developing,” or “minimal.” If the solution is not 
complete, what improvements would you suggest to the student who wrote it? 


7 te 


Finally, your teacher will provide you with a scoring rubric. Score your response 
and note what, if anything, you would do differently to improve your own score. 


Chapter 5 Review 


Section 5.1: Randomness, Probability, and Simulation 


In this section, you learned about the idea of probability. The law of large numbers says that 
when you repeat a chance process many, many times, the relative frequency of an outcome will 
approach a single number. This single number is called the probability of the outcome—how 
often we expect the outcome to occur in a very large number of repetitions of the chance 
process. Be sure to remember the “large” part of the law of large numbers. Although clear 
patterns emerge in a large number of repetitions, we shouldn’t expect such regularity in a small 
number of repetitions. 


Simulation is a powerful tool that we can use to imitate a chance process and estimate a 
probability. To perform a simulation, describe how to use a chance device to imitate one trial of 
the simulation. Tell what you will record at the end of each trial. Then perform many trials, and 
use the results of your simulation to answer the question of interest. If you are using random 
digits to perform your simulation, be sure to consider whether digits can be repeated within 
each trial. 


Section 5.2: Probability Rules 


In this section, you learned that chance behavior can be described by a probability model. 
Probability models have two parts, a list of possible outcomes (the sample space) and a 
probability for each outcome. The probability of each outcome in a probability model must be 
between 0 and 1, and the probabilities of all the outcomes in the sample space must add to 1. 

An event is a collection of possible outcomes from the sample space. The complement rule 
says the probability that an event occurs is 1 minus the probability that the event doesn’t occur. 
In symbols, the complement rule says that P(A) = 1 — P(A). Given two events A and B from 
some chance process, use the general addition rule to find the probability that event A or event 
B occurs: 


P(A or B)=P(AUB)=P(A)+P(B)-P(An B)P(A or B) = P(A U B) = P(A) + P(B) = P(A ‘a B) 
If the events A and B have no outcomes in common, use the addition rule for mutually 
exclusive events: P(A U B) = P(A) + P(B). 
Finally, you learned how to use two-way tables and Venn diagrams to display the sample 
space for a chance process involving two events. Using a two-way table or a Venn diagram is a 


helpful way to organize information and calculate probabilities involving the union (A or B) 
and the intersection (A and B) of two events. 


Section 5.3: Conditional Probability and Independence 


In this section, you learned that a conditional probability describes the probability of an event 


occurring given that another event is known to have already occurred. To calculate the 
probability that event A occurs given that event B has occurred, use the formula 


AIB) = P( ANB) 7 P(A and B) 
P(A|B)=P(AnB)P(B)=P(A and B)P(B) /  -P(B) P(B) 

Two-way tables and tree diagrams are useful ways to organize the information provided in a 
conditional probability problem. Two-way tables are best when the problem describes the 
number or proportion of cases with certain characteristics. Tree diagrams are best when the 
problem provides the conditional probabilities of different events or describes a sequence of 
events. 

Use the general multiplication rule for calculating the probability that event A and event B 
both occur: 


P(A and B)=P(AnB)=P(A)-P(BIA)P (A and B) = P(A B) = P(A) - P(BIA) 


If knowing whether or not event B occurs doesn’t change the probability that event A occurs, 
then events A and B are independent. That is, events A and B are independent if P(A | B) = 
P(A | B®) = P(A). If events A and B are independent, use the multiplication rule for 
independent events to find the probability that events A and B both occur: P(ANB) = P(A) - 
P(B). 
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Chapter 5 Review Exercises 


These exercises are designed to help you review the important ideas and methods of the 
chapter. 


R5.1 |») Butter side down Researchers at Manchester Metropolitan University in England 
determined that if a piece of toast is dropped from a 2.5-foot-high table, the probability 
that it lands butter side down is 0.81. 


a. Explain what this probability means. 


b. Suppose that the researchers dropped 4 pieces of toast, and all of them landed butter 
side down. Does that make it more likely that the next piece of toast will land with the 
butter side up? Explain your answer. 


R5.2 |») Butter side down Refer to the preceding exercise. Maria decides to test this 


probability and drops 10 pieces of toast from a 2.5-foot table. Only 4 of them land butter 
side down. Maria wants to perform a simulation to estimate the probability that 4 or 
fewer pieces of toast out of 10 would land butter side down if the researchers’ 0.81 
probability value is correct. 


a. Describe how you would use a table of random digits to perform the simulation. 


b. Perform 3 trials of the simulation using the random digits given. Copy the digits onto 
your paper and mark directly on or above them so that someone can follow what you 


did. 
29077 14863 61683 47052 62224 51025 
95052 90908 73592 75186 87136 95761 
27102 56027 55892 33063 41842 81868 


c. The dotplot displays the results of 50 simulated trials of dropping 10 pieces of toast. 
Is there convincing evidence that the researchers’ 0.81 probability value is incorrect? 
Explain your answer. 


Simulated number of butter side down 


R5.3 jp) Rock smashes scissors Almost everyone has played the game rock-paper-scissors at 


some point. Two players face each other and, at the count of 3, make a fist (rock), an 
extended hand, palm side down (paper), or a “V” with the index and middle fingers 
(scissors). The winner is determined by these rules: rock smashes scissors; paper covers 
rock; and scissors cut paper. If both players choose the same object, then the game is a 
tie. Suppose that Player 1 and Player 2 are both equally likely to choose rock, paper, or 
scissors. 


a. Give a probability model for this chance process. 
b. Find the probability that Player 1 wins the game on the first throw. 

R5.4 |») What kind of vehicle? Randomly select a new vehicle sold in the United States in a 
certain month.22 The probability distribution for the type of vehicle chosen is given here. 


Vehicle type Passenger car Pickup truck SUV Crossover Minivan 
Probability 0.46 0.15 0.10 ? 0.05 


a. What is the probability that the vehicle is a crossover? How do you know? 
b. Find the probability that the vehicle is not an SUV or a minivan. 


c. Given that the vehicle is not a passenger car, what is the probability that it is a pickup 
truck? 


R5.5 |p) Drive to exercise? The two-way table summarizes the responses of 120 people to a 


survey in which they were asked, “Do you exercise for at least 30 minutes four or more 
times per week?” and “What kind of vehicle do you drive?” 


Car type 
Sedan SUV Truck 
Exercise? nee es 
No 24 


Suppose one person from this sample is randomly selected. 
a. Find the probability that the person drives an SUV. 


b. Find the probability that the person drives a sedan or exercises for at least 30 minutes 
four or more times per week. 


c. Find the probability that the person does not drive a truck, given that she or he 
exercises for at least 30 minutes four or more times per week. 


R5.6 |) Mike’s pizza You work at Mike’s pizza shop. You have the following information 


about the 9 pizzas in the oven: 3 of the 9 have thick crust and 2 of the 3 thick-crust pizzas 
have mushrooms. Of the remaining 6 pizzas, 4 have mushrooms. 


a. Are the events “thick-crust pizza” and “pizza with mushrooms” mutually exclusive? 


Justify your answer. 


b. Are the events “thick-crust pizza” and “pizza with mushrooms” independent? Justify 
your answer. 


c. Suppose you randomly select 2 of the pizzas in the oven. Find the probability that 
both have mushrooms. 


R5.7 |p) Does the new hire use drugs? Many employers require prospective employees to 


take a drug test. A positive result on this test suggests that the prospective employee uses 
illegal drugs. However, not all people who test positive use illegal drugs. The test result 
could be a false positive. A negative test result could be a false negative if the person 
really does use illegal drugs. Suppose that 4% of prospective employees use drugs, and 
that the drug test has a false positive rate of 5%, and a false negative rate of 10%.22 
Imagine choosing a prospective employee at random. 


a. Draw a tree diagram to model this chance process. 
b. Find the probability that the drug test result is positive. 


c. If the prospective employee’s drug test result is positive, find the probability that she 
or he uses illegal drugs. 


R5.8 _p») Lucky penny? Harris Interactive reported that 33% of U.S. adults believe that 


finding and picking up a penny is good luck. Assuming that responses from different 
individuals are independent, what is the probability of randomly selecting 10 U.S. adults 
and finding at least 1 person who believes that finding and picking up a penny is good 
luck? 


Chapter 5 AP® Statistics Practice Test 


Section 


I: Multiple Choice Select the best answer for each question. 


Questions T5.1 to T5.3 refer to the following setting. A group of 125 truck owners were asked 
what brand of truck they owned and whether or not the truck has four-wheel drive. The results 
are summarized in the two-way table below. Suppose we randomly select one of these truck 


owners. 


Brand of 


Four-wheel drive? 


No 

Ford 17 

truck Chevy 18 
Dodge 10 


T5.1 What is the probability that the person owns a Dodge or has four-wheel drive? 


a 
b. 
C. 
d. 


e. 


. 20/80 
20/125 
80/125 
90/125 
110/125 


T5.2 What is the probability that the person owns a Chevy, given that the truck has four- 
wheel drive? 


a. 32/50 
b. 

c 
d. 


e. 


32/80 

32/125 
50/125 
80/125 


T5.3 Which one of the following is true about the events “Owner has a Chevy” and 
“Owner’s truck has four-wheel drive”? 


a. These two events are mutually exclusive and independent. 


b 


C 
d. 


. These two events are mutually exclusive, but not independent. 
These two events are not mutually exclusive, but they are independent. 
These two events are neither mutually exclusive nor independent. 


These two events are mutually exclusive, but we do not have enough information to 
determine if they are independent. 


T5.4 A spinner has three equally sized regions: blue, red, and green. Jonny spins the spinner 
3 times and gets 3 blues in a row. If he spins the spinner 297 more times, how many 
more blues is he most likely to get? 


a. 
b. 


C. 


d. 


e. 


a7 
99 
100 
101 
103 


Questions T5.5 and 15.6 refer to the following setting. Wilt is a fine basketball player, but his 
free-throw shooting could use some work. For the past three seasons, he has made only 56% of 
his free throws. His coach sends him to a summer clinic to work on his shot, and when he 
returns, his coach has him step to the free-throw line and take 50 shots. He makes 34 shots. Is 
this result convincing evidence that Wilt’s free-throw shooting has improved? We want to 
perform a simulation to estimate the probability that a 56% free-throw shooter would make 34 
or more in a sample of 50 shots. 


T5.5 Which of the following is a correct way to perform the simulation? 


a. 


e. 


Let integers from 1 to 34 represent making a free throw and 35 to 50 represent 
missing a free throw. Generate 50 random integers from 1 to 50. Count the number 
of made free throws. Repeat this process many times. 


. Let integers from 1 to 34 represent making a free throw and 35 to 50 represent 


missing a free throw. Generate 50 random integers from 1 to 50 with no repeats 
allowed. Count the number of made free throws. Repeat this process many times. 


Let integers from 1 to 56 represent making a free throw and 57 to 100 represent 
missing a free throw. Generate 50 random integers from 1 to 100. Count the number 
of made free throws. Repeat this process many times. 


Let integers from 1 to 56 represent making a free throw and 57 to 100 represent 
missing a free throw. Generate 50 random integers from 1 to 100 with no repeats 
allowed. Count the number of made free throws. Repeat this process many times. 


None of the above is correct. 


T5.6 The dotplot displays the number of made shots in 100 simulated sets of 50 free throws 
by someone with probability 0.56 of making a free throw. 
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34 36 


Which of the following is an appropriate statement about Wilt’s free-throw shooting 
based on this dotplot? 


a. 


If Wilt were still only a 56% shooter, the probability that he would make at least 34 
of his shots is about 0.03. 


If Wilt were still only a 56% shooter, the probability that he would make at least 34 
of his shots is about 0.97. 


If Wilt is now shooting better than 56%, the probability that he would make at least 
34 of his shots is about 0.03. 


If Wilt is now shooting better than 56%, the probability that he would make at least 
34 of his shots is about 0.97. 


If Wilt were still only a 56% shooter, the probability that he would make at least 34 
of his shots is about 0.01. 


T5.7 The partially complete table that follows shows the distribution of scores on the AP® 
Statistics exam for a class of students. 


Score 1 2 3 4 5 
Probability 0.10 0.20 22? 0.25 0.15 


Select a student from this class at random. If the student earned a score of 3 or higher 


on the AP® Statistics exam, what is the probability that the student scored a 5? 


a. 0.150 
b. 

G, 
d. 


e. 


0.214 
0.300 
0.428 
0.700 


T5.8 In a class, there are 18 girls and 14 boys. If the teacher selects two students at random 
to attend a party with the principal, what is the probability that the two students are the 
same sex? 


a. 
b. 


Cc. 


0.49 
0.50 
0.51 


d. 0.52 
e. 0.53 


T5.9 Suppose that a student is randomly selected from a large high school. The probability 
that the student is a senior is 0.22. The probability that the student has a driver’s license 
is 0.30. If the probability that the student is a senior or has a driver’s license is 0.36, 
what is the probability that the student is a senior and has a driver’s license? 


a. 0.060 
b. 0.066 
c. 0.080 
d. 0.140 
e. 0.160 


T5.10 The security system in a house has two units that set off an alarm when motion is 
detected. Neither one is entirely reliable, but one or both always go off when there is 
motion anywhere in the house. Suppose that for motion in a certain location, the 
probability that detector A goes off and detector B does not go off is 0.25, and the 
probability that detector A does not go off is 0.35. What is the probability that detector 
B goes off? 


a. 0.1 
b. 0.35 
c. 0.4 
d. 0.65 
e. 0.75 


Section Il: Free Response Show all your work. Indicate clearly the methods you use, 
because you will be graded on the correctness of your methods as well as on the accuracy and 
completeness of your results and explanations. 


15.11 The two-way table summarizes data on whether students at a certain high school eat 
regularly in the school cafeteria by grade level. 


Grade 
Total 
Eatin Yes 495 
cafeteria? No 310 
Total 148 209 210 238 805 


a. If you choose a student at random, what is the probability that the student eats 
regularly in the cafeteria and is not a 10th-grader? 


b. If you choose a student at random who eats regularly in the cafeteria, what is the 


probability that the student is a 10th-grader? 


c. Are the events “10th-grader” and “eats regularly in the cafeteria” independent? 
Justify your answer. 


15.12 Three machines—A, B, and C—are used to produce a large quantity of identical parts at 
a factory. Machine A produces 60% of the parts, while Machines B and C produce 30% 
and 10% of the parts, respectively. Historical records indicate that 10% of the parts 
produced by Machine A are defective, compared with 30% for Machine B and 40% for 
Machine C. Suppose we randomly select a part produced at the factory. 


a. Find the probability that the part is defective. 


b. If the part is inspected and found to be defective, what’s the probability that it was 
produced by Machine B? 


T5.13 At Dicey Dave’s Diner, the dinner buffet usually costs $12.99. Once a month, Dave 
sponsors “lucky buffet” night. On that night, each patron can either pay the usual price 
or roll two fair, six-sided dice and pay a number of dollars equal to the product of the 
numbers showing on the two faces. The table shows the sample space of this chance 
process. 


First die 


Second 
die 


a. A customer decides to play Dave’s “lucky buffet” game. Find the probability that the 
customer will pay less than the usual cost of the buffet. 


b. A group of 4 friends comes to Dicey Dave’s Diner to play the “lucky buffet” game. 
Find the probability that all 4 of these friends end up paying less than the usual cost 
of the buffet. 


c. Find the probability that at least 1 of the 4 friends ends up paying more than the 
usual cost of the buffet. 


15.14 Based on previous records, 17% of the vehicles passing through a tollbooth have out-of- 
state plates. A bored tollbooth worker decides to pass the time by counting how many 
vehicles pass through until he sees two with out-of-state plates. We would like to 
perform a simulation to estimate the average number of vehicles it takes to find two 


with out-of-state plates.22 


a. Describe how you would use a table of random digits to perform the simulation. 


b. Perform 3 trials of the simulation using the random digits given here. Copy the digits 
onto your paper and mark directly on or above them so that someone can follow 
what you did. 


41050 92031 06449 05059 59884 31880 
53115 84469 94868 57967 05811 84514 
84177 06757 17613 15582 51506 81435 


Chapter 6 Random Variables 


Stefano Paterna/Alamy 


Chapter 6 Ap® Statistics Practice Test 


INTRODUCTION 


Do you drink bottled water or tap water? According to an online news report, about 75% of 


people drink bottled water regularly. Some people do so because they believe bottled water is 
safer than tap water. (There’s little evidence to support this belief.) Others say they prefer the 
taste of bottled water. Can people really tell the difference? 


ACTIVITY Bottled water versus tap water 


This activity will give you and your classmates a chance to discover whether or not you can 
taste the difference between bottled water and tap water. 


i 


Before class begins, your teacher will prepare numbered stations with cups of water. You 
will be given an index card with a station number on it. 


. Go to the corresponding station. Pick up three cups (labeled A, B, and C) and take them 


back to your seat. 


. Your task is to determine which one of the three cups contains the bottled water. Drink all 


the water in Cup A first, then the water in Cup B, and finally the water in Cup C. Write 
down the letter of the cup that you think held the bottled water. Do not discuss your results 
with any of your classmates yet! 


. While you’re tasting, your teacher will make a chart on the board like this one: 


Station number Bottled water cup? Truth 


. When you are told to do so, go to the board and record your station number and the letter 


of the cup you identified as containing bottled water. 


. Your teacher will now reveal the truth about the cups of drinking water. How many 


students in the class identified the bottled water correctly? What percent of the class is 
this? 


. Let’s assume that no one in your class can distinguish tap water from bottled water. In that 


case, students would just be guessing which cup of water tastes different. If so, what’s the 
probability that an individual student would guess correctly? 


. How many correct identifications would you need to provide convincing evidence that the 


students in your class aren’t just guessing? With your classmates, design and carry out a 
simulation to answer this question. What do you conclude about your class’s ability to 
distinguish tap water from bottled water? 


magnetcreative/E+/Getty Images 


The ABC News program 20/20 set up a blind taste test in which people were asked to rate four 


different brands of bottled water and New York City tap water without knowing which they were 
drinking. Can you guess the result? Tap water came out the clear winner in terms of taste. 


When Mr. Hogarth’s class did the preceding activity, 13 out of 21 students made correct 
identifications. If we assume that the students in his class can’t tell tap water from bottled 
water, then each one is basically guessing, with a 1/3 chance of being correct. So we’d expect 
about one-third of Mr. Hogarth’s 21 students (i.e., about 7 students) to guess correctly. How 
likely is it that 13 or more of the 21 students would guess correctly? To answer this question 
without a simulation, we need a different kind of probability model from the ones we saw in 


Chapter 5. 

Section 6.1 introduces the concept of a random variable, a numerical outcome of some 
chance process (like the 13 students who guessed correctly in Mr. Hogarth’s class). Each 
random variable has a probability distribution that gives us information about the likelihood 
that a specific event happens (like 13 or more correct guesses out of 21) and about what’s 
expected to happen if the chance behavior is repeated many times. Section 6.2 examines the 
effect of transforming and combining random variables on the shape, center, and variability of 
their probability distributions. In Section 6.3, we’ll look at two random variables with 
probability distributions that are used enough to have their own names— binomial and 
geometric. 


SECTION 6.1 Discrete and Continuous Random Variables 


LEARNING TARGETS By the end of the section, you should be able to: 


e Use the probability distribution of a discrete random variable to calculate the 
probability of an event. 


Make a histogram to display the probability distribution of a discrete random 
variable and describe its shape. 


Calculate and interpret the mean (expected value) of a discrete random variable. 


Calculate and interpret the standard deviation of a discrete random variable. 


Use the probability distribution of a continuous random variable (uniform or 
Normal) to calculate the probability of an event. 


A probability model describes the possible outcomes of a chance process and the likelihood 
that those outcomes will occur. For example, suppose you toss a fair coin 3 times. The sample 
space for this chance process is 


HHH HHT HTH THH HTT THI TTH TIT 
HHH HHT HTH THH HTT THT TTH TTT 
Because there are 8 equally likely outcomes, the probability is 1/8 for each possible outcome. 


Define the random variable X = the number of heads obtained in 3 tosses. The value of X 
will vary from one set of tosses to another, but it will always be one of the numbers 0, 1, 2, or 
3. How likely is X to take each of those values? It will be easier to answer this question if we 
group the possible outcomes by the number of heads obtained: 


X=0: TTT X=1: HTT THT TTH X=2: HHT HTH THH X=3: HHH 
A =0:TTT A =1:HTT THT TTH X =2:HHT HTH THH XA = 3: HHH 


We can summarize the probability distribution of X in a table: 


Value 0 il 2 3 
Probability 1/8 3/8 3/8 1/8 


DEFINITION Random variable, Probability distribution 


A random variable takes numerical values that describe the outcomes of a 
chance process. 


The probability distribution of a random variable gives its possible values and 
their probabilities. 


We use capital, italic letters (like X or Y) to designate random variables. There are two main 
types of probability distributions, corresponding to two types of random variables: discrete and 
continuous. 


Discrete Random Variables 


The random variable X in the coin-tossing setting is a discrete random variable. 


DEFINITION Discrete random variable 


A discrete random variable X takes a fixed set of possible values with gaps 
between them. 


We can list the possible values of X = the number of heads in 3 tosses of a coin as 0, 1, 2, 3. 
Note that there are gaps between these values on a number line. For instance, a gap exists 
between X=1X = 1 and X=2X = 2 because X cannot take values such as 1.2 or 1.84. 


The probability distribution of X is 


Value 0 1 2 3 
Probability 1/8 3/8 3/8 1/8 


This probability distribution is valid because all the probabilities are between 0 and 1, and 
their sum is 


1/8-+3/8+3/8+1/8=11/8 + 3/8 + 3/8+1/8=1 


PROBABILITY DISTRIBUTION FOR A DISCRETE RANDOM VARIABLE 


The probability distribution of a discrete random variable X lists the values x; and their 


probabilities pj: 


Value X41 X2 X3 
Probability P41 p2 p3 


For the probability distribution to be valid, the probabilities p; must satisfy two 
requirements: 


1. Every probability p; is a number between 0 and 1, inclusive. 


2. The sum of the probabilities is 1:p1+p2+p3+...=1) ' Pi + P2 + Pst. --= 1, 


We can use the probability distribution of a discrete random variable to find the probability 
of an event. For instance, what’s the probability that we get at least one head in three tosses of 


the coin? In symbols, we want to find P(X= pP (X = 1) We know that 
P(X21)=P(X=1 or X=2 or K=3)o a SE On Sek ee nid) 
Because the events X=1X = 1, X=2X = 2, and X=3X = 3 are mutually exclusive, we can add 


their probabilities to get the answer: 


P(X21)=P(X=1)+P(X=2)+P(X=3)=3/8+3/8+1/8=7/8 
P(X >1) = P(X =1)+ P(X =2)+ P(X =3) 
= 3/8+3/8+1/8=7/8 


Or we could use the complement rule from Chapter 5: 
P(X>1) = 1—P(X<1)=1-P(X=0) 
P(X>1)=1-P(X<1)=1-P(X=0)=1-1/8=7/8 = 1-1/8=7/8 


EXAMPLE | Apgar scores: Babies’ health at birth (>) 
Discrete random variables 


susaro/iStock/Getty Images 


PROBLEM: In 1952, Dr. Virginia Apgar suggested five criteria for measuring a baby’s 
health at birth: skin color, heart rate, muscle tone, breathing, and response when stimulated. 
She developed a 0-1-2 scale to rate a newborn on each of the five criteria. A baby’s Apgar 
score is the sum of the ratings on each of the five scales, which gives a whole-number value 
from 0 to 10. Apgar scores are still used today to evaluate the health of newborns. Although 
this procedure was later named for Dr. Apgar, the acronym APGAR also represents the five 
scales: Appearance, Pulse, Grimace, Activity, and Respiration. 

What Apgar scores are typical? To find out, researchers recorded the Apgar scores of over 
2 million newborn babies in a single year. Imagine selecting a newborn baby at random. 
(That’s our chance process.) Define the random variable X = Apgar score of a randomly 
selected newborn baby. The table gives the probability distribution of X. 


Value x; 0 t Z 3 4 5 6 i 8 9 10 
Probability pj ??? 0.006 0.007 0.008 0.012 0.020 0.038 0.099 0.319 0.437 0.053 


a. Write the event “the baby has an Apgar score of 0” in terms of X. Then find its 
probability. 


b. Doctors decided that Apgar scores of 7 or higher indicate a healthy newborn baby. What’s 
the probability that a randomly selected newborn baby is healthy? 


SOLUTION: 
a. P(X=0)=1-(0.006+0.0074+:*:+0.053)=1-0.999=0.001 
P(X=0) = 1—(0.006 +0.007+--- + 0.053) 
= 1-—0.999 
= 0.001 


Use the complement rule: 


E060) 0c 20) 


P(X >7) = 0.099 + 0.319 + 0. 437 + 0.053 
b. P(X=7)=0.099+0.319+0.437+0.053=0.908 = 0.908 


The probability of choosing a healthy baby is 
P(X 27) =P(X=7)+P(XK=8)+ 


P(X =9) + P(X =10) 


FOR PRACTICE, TRY EXERCISE 1 


Note that the probability of randomly selecting a newborm whose Apgar score is at least 7 
is not the same as the probability that the baby’s Apgar score is greater than 7. The latter 
probability is 

P(X>7)=P(X=8)+P(X=9)+P(X=10)=0.319+0.437+0.053=0.809 
P(X>7) = P(X =8)+ P(X =9)+ P(X =10) 
0.319 + 0.437 + 0.053 
= 0.809 


The outcome X = 7 is included in “at least 7” but is not included in “greater than 7.” ow Be 


sure to consider whether to include the boundary value in your calculations when dealing 
with discrete random variables. 


Analyzing Discrete Random Variables: Describing Shape 


When we analyzed distributions of quantitative data in Chapter 1, we made it a point to discuss 
their shape, center, and variability. We’ll do the same with probability distributions of random 


variables. 


For the discrete random variable X = Apgar score of a randomly selected newborn baby, the 
probability distribution is 


Value x; 0 al 2 3 4 5 6 7 8 9 10 
Probability p; 0.001 0.006 0.007 0.008 0.012 0.020 0.038 0.099 0.319 0.437 0.053 


We can display the probability distribution with a histogram. Values of the variable go on the 
horizontal axis and probabilities go on the vertical axis. There is one bar in the histogram for 
each value of X. The height of each bar gives the probability for the corresponding value of the 
variable. 


Figure 6.1 shows a histogram of the probability distribution of X. This distribution is 
skewed to the left with a single peak at an Apgar score of 9. 


Probability 


0 1 2 3 4 5 6 7 8 9 10 

X = Apgar score 
FIGURE 6.1 Histogram of the probability distribution for the random variable X = Apgar score of 
a randomly selected newborn baby. 


A probability distribution histogram is really just a relative frequency histogram because 
probabilities are long-run relative frequencies. 


EXAMPLE |Pete’s Jeep Tours (>) 
Displaying a probability distribution 


Judy M. Starnes 


PROBLEM: Pete’s Jeep Tours offers a popular day trip in a tourist area. There must be at 


least 2 passengers for the trip to run, and the vehicle will hold up to 6 passengers. Pete 
charges $150 per passenger. Let C = the total amount of money that Pete collects on a 
randomly selected trip. The probability distribution of C is given in the table. 


Total collected c; 300 450 600 750 900 
Probability p; 0.15 0.25 0.35 0.20 0.05 


Make a histogram of the probability distribution. Describe its shape. 


SOLUTION: 


0.4 
0.3 


02 


Probability 


0.7 


0.0 
0 6 150 300 #50» 600s «#50 =: 900 
C = Money collected (4) 


The graph 1s roughly symmetric and has a single peak at $600. 


Remember: Values of the variable go on the horizontal axis and probabilities go on 


the vertical axis. Don’t forget to properly label and scale each axis! 


FOR PRACTICE, TRY EXERCISE 5 


Notice the use of the label C (collects) for the random variable in the example. Sometimes 
we prefer contextual labels like this to the more generic X and Y. 


CHECK YOUR UNDERSTANDING 
”, Indiana University Bloomington posts the grade distributions for its courses online.2 
Suppose we choose a student at random from a recent semester of this university’s Business 


Statistics course. The student’s grade on a 4-point scale (with A = 4) is a random variable X 
with this probability distribution: 


Value 6) 1 2 3 4 


Probability 0.011 0.032 ??? 0.362 0.457 


1. Write the event “the student got a C” using probability notation. Then find this 
probability. 


2. Explain in words what P(X>3)P (X = 3) means. What is this probability? 
3. Make a histogram of the probability distribution. Describe its shape. 


Measuring Center: The Mean (Expected Value) of a Discrete 
Random Variable 


In Chapter 1, you learned how to summarize the center of a distribution of quantitative data 
with either the median or the mean. For discrete random variables, the mean is typically used to 
summarize the center of a probability distribution. The mean x # of a quantitative data set with 
n observations is 


x =sum of data valuesnumber of data values=x1+x2+:--+xnn=), xin 
sum ofdatavalues = Fy +@Qte+en © > vj 


tS 
number of data values n n 


How do we find the mean of a discrete random variable? 

Consider the random variable C = the total amount of money that Pete collects on a 
randomly selected jeep tour from the previous example. The probability distribution of C is 
given in the table. 


Total collected c; 300 450 600 750 900 
Probability p; 0.15 0.25 0.35 0.20 0.05 


What’s the average amount of money that Pete collects on his jeep tours? 


Imagine a hypothetical 100 trips. Pete will collect $300 on 15 of these trips, $450 on 25 
trips, $600 on 35 trips, $750 on 20 trips, and $900 on 5 trips. Pete’s average amount collected 
for these trips is 
300-15+450-25+600-35+750-20+900-5100=300-15100+450-25100+600-35100+750-20100+900 

300-15+-450-25-+4-600-35-+-750-20+-900-5 
100 
300-15 450-25 600-35 750-20 900-5 
~ 100° 100 ° 100 ' 100 ° 100 
= 300 (0.15) + 450 (0.25) + 600 (0.35) + 750 (0.20) + 900 (0.05) 
= 562.50 


The third line of the calculation is just the values of the random variable C times their 


corresponding probabilities. 


That is, the mean of the discrete random variable C is wC=$562.50He = $562.50 This is 
also known as the expected value of C, denoted by E(C). 


The mean (expected value) of any discrete random variable is found in a similar way. It is 
an average of the possible outcomes, but a weighted average in which each outcome is 
weighted by its probability. 


DEFINITION Mean (expected value) of a discrete random variable 


The mean (expected value) of a discrete random variable is its average value 
over many, many repetitions of the same chance process. 


Suppose that X is a discrete random variable with probability distribution 


Value X1— X2 X3 
Probability Pi ~P2 pz 


To find the mean (expected value) of X, multiply each possible value of X by its 
probability, then add all the products: 


px = E(X) = 2p; + top2 + 3p3 +... 
uX=E(X)=x1p1+x2p2+x3p3+...=¥. xipi = Pi 


Recall that the mean is the balance point of a distribution. For Pete’s distribution of money 
collected on a randomly selected jeep tour, the histogram balances at p~C=562.50HC ~ 562.50 
How do we interpret this value? If we randomly select many, many jeep tours, Pete will make 
about $562.50 per trip, on average. 


0.4 


e 0.3 
= 
— 
B 02 
= 
[—W) 
0.1 
0.0 


0 1500 = 3300. 450 600——s 750—s( 900 
B,. = 562.50 


C = Money collected ($) 


EXAMPLE | Apgar scores: What’s typical? (>) 


Finding and interpreting the mean 


PROBLEM: Earlier, we defined the random variable X to be the Apgar score of a randomly 
selected newborn baby. The table gives the probability distribution of X once again. 


Value x; 0 1 2 3 4 5 6 ii 8 g 10 
Probability p; 0.001 0.006 0.007 0.008 0.012 0.020 0.038 0.099 0.319 0.437 0.053 


Calculate and interpret the expected value of X. 


SOLUTION: 


E(X)=).X=(0)(0.001)+(1)(0.006)+...+(10)(0.053)=8.128 
E(X) = px = (0)(0.001) + (1)(0.006) +... + (10)(0.053) = 8.128 


Bee ape =z = S- Xip, 


af many, many newborns are randomly selected, their average Apgar score will be about 
8.128. 


FOR PRACTICE, TRY EXERCISE 7 


Notice that the mean Apgar score, 8.128, is not a possible value of the random variable X 
because it is not a whole number between 0 and 10. The non-integer value of the mean 
shouldn’t bother you if you think of the mean (expected value) as a long-run average over 
many repetitions. 


AP® EXAM TIP 


If the mean of a random variable has a non-integer value but you report it as an 
integer, your answer will not get full credit. 


How can we find the median of a discrete random variable? In Chapter 1, we defined the 
median as “the midpoint of a distribution, the number such that about half the observations are 
smaller and about half are larger.” The median of a discrete random variable is the 50th 
percentile of its probability distribution. We can find the median by adding a cumulative 
probability row to the probability distribution table, and then locating the smallest value for 
which the cumulative probability equals or exceeds 0.5. For the distribution of amount of 
money collected on Pete’s Jeep Tours, we see that the median is $600. 


Total collected 300 450 
Probability 0.15 0.25 
Cumulative probability 0.15 0.40 


750 900 
0.20 0.05 
0.95 1.00 


Measuring Variability: The Standard Deviation (and Variance) ofa 
Discrete Random Variable 


With the mean as our measure of center for a discrete random variable, it shouldn’t surprise 
you that we’ll use the standard deviation as our measure of variability. In Chapter 1, we defined 
the standard deviation sx** of a distribution of quantitative data as the typical distance of the 
values in the data set from the mean. To get the standard deviation, we started by “averaging” 


A: — 2 . 2 
the squared deviations from the mean (xi-x y2(z i-Z) to get the sample variance sx2* and 
then took the square root: 


oO 
Sx=(x1-x )2+(x2-x )2+...+(xn-x )2n-1 \ n—1 


We can modify this approach to calculate the standard deviation of a discrete random 


variable X. Start by finding a weighted average of the squared deviations (xi-— ux)2(© i — ux)” 
of the values of the variable X from its mean pX/* . The probability distribution gives the 
appropriate weight for each squared deviation. We call this weighted average the variance of 


X, denoted by oX27%. Then take the square root to get the standard deviation oX?*. 


DEFINITION Standard deviation of a discrete random variable, 
Variance 


The standard deviation of a discrete random variable measures how much the 
values of the variable typically vary from the mean. 


Suppose that X is a discrete random variable with probability distribution 


Value X41 x2 x3 
Probability P1 p2 p3 


and that pW, is the mean of X. The variance of X is 


Var(X)=o0 X2=(x1-pX) 2p 1+ (x 2- UX) 2p 2+ (x 3-pX) 2p3t+...=y (xi-pX)2pi 
Var (X) = 0% = (m1 —px)*pit+ (v2—px) ?p2t+ (x3 — px) *ps + 


= D(a —px)"p 
The standard deviation of X is the square root of the variance: 
oX=0X2 =(x1-pX)2p1+(x2-pX)2p2+(x3-pX)2p3+...=y) (xi-pX)2pi 


= J (ei - px) x) pi 


Let’s return to the random variable C = the total amount of money that Pete collects on a 
randomly selected jeep tour. The left two columns of the following table give the probability 
distribution. Recall that the mean of C is fc = 562.50. The third column of the table shows the 
squared deviation of each value from the mean. The fourth column gives the weighted squared 
deviations. 


Total Probability Squared deviation from the mean Weighted squared deviation 
collected ¢; Pi (Cj-He)? (Cj-He)*P; 

300 0.15 (300 — 562.50)2 (300 — 562.50)2 (0.15) = 
10335.94 

450 0.25 (450 - 562.50)2 (450 — 562.50)2 (0.25) = 3164.06 

600 0.35 (600 — 562.50)2 (600 — 562.50)2 (0.35) = 

492.19 
750 0.20 (750 — 562.50)2 (750 — 562.50)2 (0.20) = 7031.25 
900 0.05 (900 — 562.50)2 (900 — 562.50)2 (0.05) = 5695.31 


Sum = 26,718.75 


Adding the weighted average of the squared deviations in the fourth column gives the 
variance of C: 


Var(C)=0C2=¥ (ci-pC)2pi=(300-562.50)2(0.15)+(450-562.50)2(0.25)+-+:+ 
(900-562.50)2(0.05)=10335.94+3164.06+---+5695.31=26,718.75 (squared dollars) 
Var(C) = if = IC — pc)’ pi 
= (300 — 562.50)? (0.15) + (450 — 562.50)” (0.25) + - + (900 — 562.50)” (0.05) 
= 10335.94 + 3164.06 + --- + 5695.31 
= 26,718.75 (squared dollars) 


The standard deviation of C is the square root of the variance: 
6C=26,718.75=$163.4640 = 40) ABTS = B1ES.46 


The amount of money that Pete collects on a randomly selected trip typically varies from the 
mean of $562.50 by about $163.46. 


EXAMPLE |How much do Apgar scores vary? (>) 
Finding and interpreting the standard deviation 


Juanmonino/Getty Images 


PROBLEM: Earlier, we defined the random variable X to be the Apgar score of a randomly 
selected newborn baby. The table gives the probability distribution of X once again. In the 
last example, we calculated the mean Apgar score of a randomly chosen newborn to be pl, = 


8.128. 


Value x; 0 ef Zz 3 4 ce) 6 a 8 g 10 
Probability p; 0.001 0.006 0.007 0.008 0.012 0.020 0.038 0.099 0.319 0.437 0.053 


Calculate and interpret the standard deviation of X. 


SOLUTION: 


oX2=(0-8.128)2(0.001)+(1-8.128)2(0.006)+:--+(10-8.128)2(0.053)=2.066 
o2. = (0 — 8.128)" (0.001) + (1 — 8.128)" (0.006) + --- + (10 — 8.128)" (0.053) = 2.066 


oX = 2.066 = 1.4377x = V 2.066 = 1.437 


A randomly selected newborn baby’s Apgar score will typically vary from the mean (8.128) 
by about 1.437 units. 


a). es ie Me: 
Start by calculating the variance oX2=)> (xi-X)2pi?* - XL (ti — Hx) ia 


FOR PRACTICE, TRY EXERCISE 13 


You can use your calculator to graph the probability distribution of a discrete random 
variable and to calculate measures of center and variability, as the following Technology 
Corner illustrates. 


12. Technology Corner | ANALYZING DISCRETE RANDOM 
VARIABLES (>) 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 


Let’s explore what the TI-83/84 can do using the random variable X = Apgar score of a 
randomly selected newborn. 


1. Enter the values of the random variable in list L1 and the corresponding probabilities in 
list L2. 


LatcD=O 


2. To graph a histogram of the probability distribution: 
e In the statistics plot menu, define Plot 1 to be a histogram with Xlist: L1 and Freq: L2. 


e Adjust your window settings as follows: Xmin = -1, Xmax = 11, Xscl = 1, Ymin = 
—0.1, Ymax = 0.5, Yscl = 0.1. 


* Press (GRAPH. 
ORMAL FLOAT AUTO REAL RADIAN HP 


Pioti:LasL2 


min=9 
max<ie n=.437 


3. To calculate the mean and standard deviation of the random variable, use one-variable 
Statistics with the values in L1 and the probabilities (relative frequencies) in L2. Press 
STAT), arrow to the CALC menu, and choose 1-Var Stats. 

OS 2.55 or later: In the dialog box, specify List: L1 and FreqList: L2. Then choose 
Calculate. 
Older OS: Execute the command 1-Var Stats L1,L2. 


ORMAL FLOAT AUTO REAL RADIAN MP fl 


x=8.128 
=x=8.128 
=x?=68.13 
Sx= 
ox=1.437225104 
n=1 
minx=0 

4Q1=8 


Note: If you leave Freq: L2 and try to calculate summary statistics for a quantitative data set 
that does not include frequencies, you will likely get an error message. Be sure to clear Freq 
when you are done with calculations to avoid this issue. 


AP® EXAM TIP 


If you are asked to calculate the mean or standard deviation of a discrete random 
variable on a free response question, you must show numerical values substituted 


into the appropriate formula, as in the previous two examples. Feel free to use 
ellipses (...) if there are many terms in the summation, as we did. You may then use 
the method described in Technology Corner 12 to perform the calculation with 1-Var 
Stats. Writing only 1-Var Stats L1, L2 and then giving the correct values of the mean 
and standard deviation will not earn credit for showing work. 


CHECK YOUR UNDERSTANDING 


i. 


,, Indiana University Bloomington posts the grade distributions for its courses online.2 
Suppose we choose a student at random from a recent semester of this university’s Business 
Statistics course. The student’s grade on a 4-point scale (with A = 4) is a random variable X 
with this probability distribution: 


Value 0 1 2 3 4 
Probability 0.011 0.032 0.138 0.362 0.457 


1. Calculate and interpret the mean of X. 


2. Calculate and interpret the standard deviation of X. 


Continuous Random Variables 


When we use Table D of random digits to select an integer from 0 to 9, the result is a discrete 
random variable (call it X). The probability distribution assigns probability 1/10 to each of the 
10 equally likely values of X. 


Suppose we want to choose a number at random between 0 and 9, allowing any number 
between 0 and 9 as the outcome (like 0.84522 or 7.1111119). Calculator and computer random 
number generators will do this. The sample space of this chance process is the entire interval of 
values between 0 and 9 on the number line. If we define Y = randomly generated number 
between 0 and 9, then Y is a continuous random variable. 


DEFINITION Continuous random variable 


A continuous random variable can take any value in an interval on the number 
line. 


Most discrete random variables result from counting something, like the number of siblings 
that a randomly selected student has. Continuous random variables typically result from 
measuring something, like the height of a randomly selected student or the time it takes that 
student to run a mile. 


How can we find the probability P(3<Y<7)P (3 < ¥ <7) that the random number generator 
produces a value between 3 and 7? As in the case of selecting a random digit, we would like all 
possible outcomes to be equally likely. But we cannot assign probabilities to each individual 
value of Y and then add them, because there are infinitely many possible values. 

The probability distribution of a continuous random variable is described by a density 
curve. Recall from Chapter 2 that any density curve has area exactly 1 underneath it, 
corresponding to a total probability of 1. We use areas under the density curve to assign 
probabilities to events. 

For the continuous random variable Y = randomly generated number between 0 and 9, its 
probability distribution is a uniform density curve with constant height 1/9 on the interval from 
0 to 9. Note that this probability distribution is valid because the total area under the density 
curve is 


Area=basexheight=9x 1/9-1ATea = base x height = 9 x 1/9-1 


Figure 6.2 shows the probability distribution of Y with the area of interest shaded. The area 
under the density curve between 3 and 7 is 


Area=basexheight=4x1/9= 4jgAtea = base x height = 4 x 1/9 = 4/9 
So P(3<= Y <& 7) = 4/9 = 0.444. 


am, 


912s #46 6 7 8 8 
Y = Randomly generated number 
FIGURE 6.2 The probability distribution of the continuous random variable Y = randomly 
generated number between 0 and 9. The shaded area represents P(3 < Y < 7). 


HOW TO FIND PROBABILITIES FOR A CONTINUOUS RANDOM 
VARIABLE 


The probability of any event involving a continuous random variable is the area under the 
density curve and directly above the values on the horizontal axis that make up the event. 


Consider a specific outcome from the random number generator setting, such as P(Y = 7). 
The probability of this event is the area under the density curve that’s above the point 7.0000 
... on the horizontal axis. But this vertical line segment has no width, so the area is 0. In fact, 
all continuous probability distributions assign probability 0 to every individual outcome. For 
that reason, 


P(3<Y<7)=P(3<Y<7)=P(3<Y<7)=P(3<Y<7)=0.444 
P(3<Y<7)=P(3<Y<7)=P(3<Y<7)=P(3<Y <7) =0.444 


Remember: The probability distribution for a continuous random variable assigns probabilities 
to intervals of outcomes rather than to individual outcomes. 


EXAMPLE | Will it be quicker to walk to work? |») 
Continuous random variables 


PROBLEM: Selena works at a bookstore in the Denver International Airport. She takes the 
airport train from the main terminal to get to work each day. The airport just opened a new 
walkway that would allow Selena to get from the main terminal to the bookstore in 4 
minutes. She wonders if it will be faster to walk or take the train to work. Let Y = Selena’s 
journey time to work (in minutes) by train on a randomly selected day. The probability 
distribution of Y can be modeled by a uniform density curve on the interval from 2 to 5 
minutes. Find the probability that it will be quicker for Selena to take the train than to walk 
that day. 


SOLUTION: 


| Density curve } 


aINN 


20 25 3.0 
Y = Journey time (min) 


I ii T T 
Ss 4.0 4.5 £0 


Start by drawing the density curve with the area of interest shaded. The height of the 


curve needs to be 1/3 so that 
Area = base x height = 3 x 1/3 = 1 


Shaded area = base = height = 2 x 1/3 = 2/3P(Y < 4) = 2/3 = 0.667 
Shaded area = base x height =2 x 1/3=2/3 


P(Y < 4) = 2/3 = 0.667 
There is a 66. 7% chance that it will be quicker ‘for Selena to take the train to work on a 


randomly selected day. 
FOR PRACTICE, TRY EXERCISE 23 


The density curves that are most familiar to us are the Normal distributions from Chapter 2. 


Normal curves can be probability distributions as well as models for data. 


EXAMPLE | Young women’s heights (>) 
Normal probability distributions 


Alamy Images 


PROBLEM: The heights of young women can be modeled by a Normal distribution with 
mean p! = 64 inches and standard deviation o=2.7¢ = 2.7 inches. Suppose we choose a young 


woman at random and let Y = her height (in inches). Find P(68<Y<70)? (68 < ¥ < 70). 


Interpret this value. 


SOLUTION: 


\ | 
55.9 58.6 61.3 64 66.7 62 69.4 70 74.1 


Y = Young woman's height Cin.) 


1. Draw a Normal distribution. 
2. Perform calculations—show your work! 


(i) Standardize and use Table A or technology; or 
(ii) Use technology without standardizing. 
Be sure to answer the question that was asked. 


_ 68-64 
eee a 
= DS =2.22 


Z = 70-642.7 = DOOM 2.7 
Using Table A: 0.9868 — 0.9306 = 0.0562 
Using technology: normalcdf(lower:1.48, upper:2.22, mean:o, SD:1) = 0.0562 


P(68 < Ys 70) =P (1.48 < Z < 2.22) 


it. normalcdf{(lower:68, upper:70, mean:64, S‘D:2. 7) = OHO oul 
The probability that a randomly selected young woman has a height between 68 and 70 
inches is about 0.06. 


FOR PRACTICE, TRY EXERCISE 27 


AP® EXAM TIP 


Students often do not get full credit on the AP® Statistics exam because they only 
use option (ii) with “calculator-speak” to show their work on Normal calculation 


questions— for example, normalcdf(68,70,64,2.7). This is not considered clear 
communication. To get full credit, follow the two-step process above, making sure to 
carefully label each of the inputs in the calculator command if you use technology in 
Step 2: normalcdf(lower:68, upper:70, mean:64, SD: 2.7). 


The calculation in the preceding example is the same as those we did in Chapter 2. Only the 
language of probability is new. 


What about the mean and standard deviation for continuous random variables? The 
probability distribution of a continuous random variable X is described by a density curve. 
Chapter 2 showed us how to find the mean of the distribution: it is the point at which the area 
under the density curve would balance if it were made out of solid material. The mean lies at 
the center of symmetric density curves such as Normal curves. We can locate the standard 
deviation of a Normal distribution from its inflection points. Exact calculation of the mean and 


standard deviation for most continuous random variables requires advanced mathematics.2 


Section 6.1 Summary 


e A random variable takes numerical values determined by the outcome of a chance process. 
The probability distribution of a random variable gives its possible values and their 
probabilities. There are two types of random variables: discrete and continuous. 


e A discrete random variable has a fixed set of possible values with gaps between them. 


: A valid probability distribution assigns each of these values a probability between 0 and 1 
such that the sum of all the probabilities is exactly 1. 


« The probability of any event is the sum of the probabilities of all the values that make up 
the event. 


« We can display the probability distribution as a histogram, with the values of the variable 
on the horizontal axis and the probabilities on the vertical axis. 


e A continuous random variable can take any value in an interval on the number line. 


« A valid probability distribution for a continuous random variable is described by a density 
curve with area 1 underneath. 


« The probability of any event is the area under the density curve directly above the values 
on the horizontal axis that make up the event. 


e We can describe the shape of a probability distribution histogram or density curve in the 
same way as we did a distribution of quantitative data—by identifying symmetry or 
skewness and any major peaks. 


e Use the mean to summarize the center of a probability distribution. The mean of a random 
variable iy is the balance point of the probability distribution histogram or density curve. 


« The mean is the long-run average value of the variable after many repetitions of the chance 
process. It is also known as the expected value of the random variable, E(X). 


= If X is a discrete random variable, the mean is the average of the values of X, each weighted 
by its probability: 
uX=E(X)=y. xipi=x1p1+x2p2+x3p3+---4* ~ E(X) = > ai = iP aba + S33 1° 


e Use the standard deviation to summarize the variability of a probability distribution. The 
standard deviation of a random variable oX°?* measures how much the values of the 
variable typically vary from the mean. 


= If X is a discrete random variable, the variance of X is the “average” squared deviation of 
the values of the variable from their mean: 


OX2=)' (xi-pX)2pi=(K1-pX)2p1+(x2—-pX)2p2+(x3-pX)2p3+... 
ox = } (x; — px)’ pi = (e1 — wx)’pi + (2 — wx)’ p2 + (es — px)" p3 +... 


The standard deviation of X is the square root of the variance. 


6.1 Technology Corner 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 


12. Analyzing discrete random variables Page 370 


Section 6.1 Exercises 


1. pg 363 |») Kids and toys In an experiment on the behavior of young children, each 


subject is placed in an area with five toys. Past experiments have shown that the 
probability distribution of the number X of toys played with by a randomly selected 
subject is as follows: 


Number of toys x; 0 1 2 3 4 5 
Probability p; 0.03 0.16 0.30 0.23 0.17 22? 


a. Write the event “child plays with 5 toys” in terms of X. Then find its probability. 


b. What’s the probability that a randomly selected subject plays with at most 3 toys? 


2. Spell-checking Spell-checking software catches “nonword errors,” which result in a string 
of letters that is not a word, as when “the” is typed as “teh.” When undergraduates are 
asked to write a 250-word essay (without spell-checking), the number Y of nonword errors 


in a randomly selected essay has the following probability distribution. 


Value y; 0 1 2 3 4 
Probability p; 0.1 22? 0.3 0.3 0.1 


a. Write the event “one nonword error” in terms of Y. Then find its probability. 


b. What’s the probability that a randomly selected essay has at least two nonword errors? 


Get on the boat! A small ferry runs every half hour from one side of a large river to 
the other. The probability distribution for the random variable Y = money collected 
(in dollars) on a randomly selected ferry trip is shown here. 


Money collected 0 5 10 15 20 25 
Probability 0.02 0.05 0.08 0.16 0.27 0.42 


a. Find P(X>20)P(X > 20). Interpret this result. 
p 


b. Express the event “at least $20 is collected” in terms of Y. What is the probability of 
this event? 


Skee Ball Ana is a dedicated Skee Ball player (see photo) who always rolls for the 50- 
point slot. The probability distribution of Ana’s score X on a randomly selected roll of the 
ball is shown here. 


Score 10 20 30 40 50 
Probability 0.32 0.27 0.19 0.15 0.07 
a. Find P(y<20)P(¥ < 29). Interpret this result. 
p 


b. Express the event “Anna scores at most 20” in terms of X. What is the probability of 
this event? 


Stan Rohrer/Getty Images 


pg 365 Get on the boat! Refer to Exercise 3. Make a histogram of the probability 
BZ 


distribution. Describe its shape. 


Skee Ball Refer to Exercise 4. Make a histogram of the probability distribution. Describe 
its shape. 


pg 367 |») Get on the boat! Refer to Exercise 3. Find the mean of Y. Interpret this 


value. 
Money collected 0 5 10 15 20 25 
Probability 0.02 0.05 0.08 0.16 0.27 0.42 


Skee Ball Refer to Exercise 4. Find the mean of X. Interpret this value. 


Score 10 20 30 40 50 
Probability OF 0.27 0.19 0.15 0.07 


Benford’s law Faked numbers in tax returns, invoices, or expense account claims often 
display patterns that aren’t present in legitimate records. Some patterns, like too many 
round numbers, are obvious and easily avoided by a clever crook. Others are more subtle. 
It is a striking fact that the first digits of numbers in legitimate records often follow a 
model known as Benford’s law.‘ Call the first digit of a randomly chosen legitimate 
record X for short. The probability distribution for X is shown here (note that a first digit 
cannot be 0). 


First digit x; uD 2 3 4 5 6 7 8 9 
Probability p; 0.301 0.176 0.125 0.097 0.079 0.067 0.058 0.051 0.046 
a. A histogram of the probability distribution is shown. Describe its shape. 
b. Calculate and interpret the expected value of X. 


0.30 - 
0.25 


= 0.20 - 
3 
2 0.15 
° 
he 
& 0.10 

0.05 

0.00 

012 3 4 5 6 7 8 9 
X = First digit 


. Working out Choose a person aged 19 to 25 years at random and ask, “In the past seven 
days, how many times did you go to an exercise or fitness center or work out?” Call the 
response Y for short. Based on a large sample survey, here is the probability distribution of 


y2 
Days y; 0 1 2 3 4 5 6 7 
Probability p; 0.68 0.05 0.07 0.08 0.05 0.04 0.01 0.02 


a. A histogram of the probability distribution is shown. Describe its shape. 


b. Calculate and interpret the expected value of Y. 


0.7 4 
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O42 3 4 5 6 
Y = Number of days working out 
11. pg 369 |») Get on the boat! A small ferry runs every half hour from one side of a large 


river to the other. The probability distribution for the random variable Y = money collected 
on a randomly selected ferry trip is shown here. From Exercise 7, pY=$19.35.4Y =$19.35. 


Money collected 0 5 10 15 20 25 
Probability 0.02 0.05 0.08 0.16 0.27 0.42 


a. Find the median of Y. 


b. Compare the mean and median. Explain why this relationship makes sense based on the 
probability distribution. 


12. Skee Ball Ana is a dedicated Skee Ball player (see photo in Exercise 4) who always rolls 
for the 50-point slot. The probability distribution of Ana’s score X on a randomly selected 
roll of the ball is shown here. From Exercise 8, uX=23.84X = 23.8 


Score 10 20 30 40 50 
Probability 0.32 O27 0.19 0.15 0.07 


a. Find the median of X. 


b. Compare the mean and median. Explain why this relationship makes sense based on the 
probability distribution. 

13. pg 369 |») Get on the boat! A small ferry runs every half hour from one side of a large 

river to the other. The probability distribution for the random variable Y = money collected 


on a randomly selected ferry trip is shown here. From Exercise 7, pY=$19.35.4Y =$19.35. 
Calculate and interpret the standard deviation of Y. 


Money collected 0 5 10 15 20 25 
Probability 0.02 0.05 0.08 0.16 0.27 0.42 


14. Skee Ball Ana is a dedicated Skee Ball player (see photo in Exercise 4) who always rolls 


15. 


16. 


17. 


for the 50-point slot. The probability distribution of Ana’s score X on a randomly selected 
roll of the ball is shown here. From Exercise 8, pX=23.84X = 23-8. Calculate and 
interpret the standard deviation of X. 


Score 10 20 30 40 50 
Probability 0.32 0.27 0.19 0.15 0.07 


Benford’s law Exercise 9 described how the first digits of numbers in legitimate records 
often follow a model known as Benford’s law. Call the first digit of a randomly chosen 
legitimate record X for short. The probability distribution for X is shown here (note that a 
first digit can’t be 0). From Exercise 9, E(X) = 3.441. Find the standard deviation of X. 
Interpret this value. 


First digit x; 1 Z 3 4 5 6 7 8 9 
Probability p; 0.301 0.176 0.125 0.097 0.079 0.067 0.058 0.051 0.046 


Working out Exercise 10 described a large sample survey that asked a sample of people 
aged 19 to 25 years, “In the past seven days, how many times did you go to an exercise or 
fitness center or work out?” The response Y for a randomly selected survey respondent has 
the probability distribution shown here. From Exercise 10, E(Y) = 1.03. Find the standard 
deviation of Y. Interpret this value. 


Days y; 0 1 Z 3 4 5 6 7 
Probability p; 0.68 0.05 0.07 0.08 0.05 0.04 0.01 0.02 


Life insurance A life insurance company sells a term insurance policy to 21-year-old 
males that pays $100,000 if the insured dies within the next 5 years. The probability that a 
randomly chosen male will die each year can be found in mortality tables. The company 
collects a premium of $250 each year as payment for the insurance. The amount Y that the 
company earns on a randomly selected policy of this type is $250 per year, less the 
$100,000 that it must pay if the insured dies. Here is the probability distribution of Y: 


Age at death 21 22 23 
Profit y; -$99,750 -$99,500 -$99,250 
Probability p; 0.00183 0.00186 0.00189 
Age at death 24 25 26 or more 
Profit y; -$99,000 -$98,750 $1250 
Probability p; 0.00191 0.00193 0.99058 


a. Explain why the company suffers a loss of $98,750 on such a policy if a client dies at 


age 25. 


b. Calculate the expected value of Y. Explain what this result means for the insurance 


company. 


18. 


19. 


c. Calculate the standard deviation of Y. Explain what this result means for the insurance 
company. 

Fire insurance Suppose a homeowner spends $300 for a home insurance policy that will 

pay out $200,000 if the home is destroyed by fire in a given year. Let P = the profit made 

by the company on a single policy. From previous data, the probability that a home in this 

area will be destroyed by fire is 0.0002. 


a. Make a table that shows the probability distribution of P. 


b. Calculate the expected value of P. Explain what this result means for the insurance 
company. 


c. Calculate the standard deviation of P. Explain what this result means for the insurance 
company. 


Size of American households In government data, a household consists of all occupants 
of a dwelling unit, while a family consists of two or more persons who live together and 
are related by blood or marriage. So all families form households, but some households 
are not families. Here are the distributions of household size and family size in the United 
States: 


Number of oT 


7 
Family probability 0.42 0.23 0.21 0.09 0.03 0.02 


Let H = the number of people in a randomly selected U.S. household and F = the number 
of people in a randomly chosen U.S. family. 


a. Here are histograms comparing the probability distributions of H and F. Describe any 
differences that you observe. 


b. Find the expected value of each random variable. Explain why this difference makes 
sense. 


c. The standard deviations of the two random variables are oH=1.4217H = 1-421 and 
oF=1.2497F = 1.249. Explain why this difference makes sense. 


Probability 
Probability 


0 12 3 4 5 6 7 0 12 3 4 5 6 7 
H = Number of persons F = Number of persons 
in household in family 


20. Housing in San José How do rented housing units differ from units occupied by their 
owners? Here are the distributions of the number of rooms for owner-occupied units and 
renter-occupied units in San José, California:® 


Number of rooms 
10 


1 2 3 4 5 6 7 8 9 


Rented | 0.008 | 0.027 | 0.287 | 0.363 | 0.164 | 0.093 | 0.039 | 0.013 | 0.003 | 0.003 


Let X = the number of rooms in a randomly selected owner-occupied unit and Y = the 
number of rooms in a randomly chosen renter-occupied unit. 


a. Here are histograms comparing the probability distributions of X and Y. Describe any 
differences you observe. 


b. Find the expected number of rooms for both types of housing unit. Explain why this 
difference makes sense. 


c. The standard deviations of the two random variables are oX=1.6407x = 1-640 and 
oY=1.3087Y = 1.308. Explain why this difference makes sense. 


21. 


Probability 


Probability 


0.0 
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X = Number of rooms in 
owner-occupied units 
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Y = Number of rooms in 
renter-occupied units 


Exercises 21 and 22 examine how Benford’s law (Exercise 9) can be used to detect fraud. 


Benford’s law and fraud A not-so-clever employee decided to fake his monthly expense 
report. He believed that the first digits of his expense amounts should be equally likely to 
be any of the numbers from 1 to 9. In that case, the first digit Y of a randomly selected 
expense amount would have the probability distribution shown in the histogram. 


a. What’s P(y<6)P (Y <6) According to Benford’s law (see Exercise 9), what 
proportion of first digits in the employee’s expense amounts should be greater than 6? 
How could this information be used to detect a fake expense report? 


figure. 


. Explain why the mean of the random variable Y is located at the solid red line in the 


According to Benford’s law, the expected value of the first digit is pX=3.441 


bx = 3.441. Explain how this information could be used to detect a fake expense 


report. 


Probability for 
each value = 1/9 


Outcomes 


22. 


23. 


24. 


2m 


Benford’s law and fraud 


a. Using the graph from Exercise 21, calculate the standard deviation oY? . This gives us 
an idea of how much variation we’d expect in the employee’s expense records if he 
assumed that first digits from 1 to 9 were equally likely. 


b. The standard deviation of the first digits of randomly selected expense amounts that 


follow Benford’s law is oX=2.467x = 2-46, Would using standard deviations be a 
good way to detect fraud? Explain your answer. 


pg 372 |») Still waiting for the server? How does your web browser get a file from 


the Internet? Your computer sends a request for the file to a web server, and the web 
server sends back a response. Let Y = the amount of time (in seconds) after the start of an 
hour at which a randomly selected request is received by a particular web server. The 
probability distribution of Y can be modeled by a uniform density curve on the interval 
from 0 to 3600 seconds. Find the probability that the request is received by this server 
within the first 5 minutes (300 seconds) after the hour. 


Where’s the bus? Sally takes the same bus to work every morning. Let X = the amount of 
time (in minutes) that she has to wait for the bus on a randomly selected day. The 
probability distribution of X can be modeled by a uniform density curve on the interval 
from 0 minutes to 8 minutes. Find the probability that Sally has to wait between 2 and 5 
minutes for the bus. 


Class is over! Mr. Shrager does not always let his statistics class out on time. In fact, he 
seems to end class according to his own “internal clock.” The density curve here models 
the distribution of Y, the amount of time after class ends (in minutes) when Mr. Shrager 
dismisses the class on a randomly selected day. (A negative value indicates he ended class 
early.) 

a. Find and interpret P(-1<Y<1)P (-1sY<1) 

b. What is pY#Y ? Explain your answer. 


c. Find the value of k that makes this statement true: P(Y>k)=0.25? (Y > k) = 0.25. 


—] 1) 4 


Y = Dismissal time (min. after class ends) 


26. Quick, click! An Internet reaction time test asks subjects to click their mouse button as 


soon as a light flashes on the screen. The light is programmed to go on at a randomly 
selected time after the subject clicks “Start.” The density curve models the amount of time 


Y (in seconds) that the subject has to wait for the light to flash. 
a. Find and interpret P(Y>3.75)/(¥ > 3-75), 
b. What is pY#Y ? Explain your answer. 


c. Find the value of k that makes this statement true: P(Y<k)=0.38" (Y < k) = 0.38, 


0 1 2 3 4 § 
Y = Time until light 
flashes (sec) 
27. pQ 373 |») Running a mile A study of 12,000 able-bodied male students at the 


University of Illinois found that their times for the mile run were approximately Normal 
with mean 7.11 minutes and standard deviation 0.74 minute.2 Choose a student at random 


from this group and call his time for the mile Y. Find P(Y<6)P (Y <6). Interpret this 
value. 


28. Give me some sugar! Machines that fill bags with powdered sugar are supposed to 
dispense 32 ounces of powdered sugar into each bag. Let X = the weight (in ounces) of the 
powdered sugar dispensed into a randomly selected bag. Suppose that X can be modeled 
by a Normal distribution with mean 32 ounces and standard deviation 0.6 ounce. Find 
P(X<31)P (X < 31), Interpret this value. 


29. Horse pregnancies Bigger animals tend to carry their young longer before birth. The 
length of horse pregnancies from conception to birth varies according to a roughly Normal 
distribution with mean 336 days and standard deviation 6 days. Let X = the length of a 
randomly selected horse pregnancy. 


a. Write the event “pregnancy lasts between 325 and 345 days” in terms of X. Then find 
this probability. 


b. Find the value of c such that P(X>c)=0.20P (X > c) = 0.20. 


30. Ace! Professional tennis player Novak Djokovic hits the ball extremely hard. His first- 
serve speeds follow an approximately Normal distribution with mean 115 miles per hour 
(mph) and standard deviation 6 mph. Choose one of Djokovic’s first serves at random. Let 
Y = its speed, measured in miles per hour. 


a. Write the event “speed is between 100 and 120 miles per hour” in terms of Y. Then find 
this probability. 


b. Find the value of c such that P(Y<c)=0.15P(¥ Sc) = 0.15, 


Multiple Choice Select the best answer for Exercises 31-34. 


Exercises 31—33 refer to the following setting. Choose an American household at random and 
let the random variable X be the number of cars (including SUVs and light trucks) they own. 
Here is the probability distribution if we ignore the few households that own more than 5 cars: 


Number of cars 0 1 2 3 4 5 
Probability 0.09 0.36 0.35 0.13 0.05 0.02 


31. What’s the expected number of cars in a randomly selected American household? 
a. 1.00 
b. 1.75 
c. 1.84 
d. 2.00 
e. 2.50 


32. The standard deviation of X is oX=1.087X = 1-08. If many households were selected at 
random, which of the following would be the best interpretation of the value 1.08? 


a. The mean number of cars would be about 1.08. 

b. The number of cars would typically be about 1.08 from the mean. 

c. The number of cars would be at most 1.08 from the mean. 

d. The number of cars would be within 1.08 from the mean about 68% of the time. 


e. The mean number of cars would be about 1.08 from the expected value. 


33. About what percentage of households have a number of cars within 2 standard deviations 
of the mean? 


a. 68% 
b. 71% 
c. 93% 
d. 95% 
e. 98% 


34. A deck of cards contains 52 cards, of which 4 are aces. You are offered the following 
wager: Draw one card at random from the deck. You win $10 if the card drawn is an ace. 
Otherwise, you lose $1. If you make this wager very many times, what will be the mean 
amount you win? 


a. About —$1, because you will lose most of the time. 


b. About $9, because you win $10 but lose only $1. 
c. About -$0.15; that is, on average, you lose about 15 cents. 
d. About $0.77; that is, on average, you win about 77 cents. 


e. About $0, because the random draw gives you a fair bet. 


Recycle and Review 


Exercises 35 and 36 refer to the following setting. Many chess masters and chess advocates 
believe that chess play develops general intelligence, analytical skill, and the ability to 
concentrate. According to such beliefs, improved reading skills should result from study to 
improve chess-playing skills. To investigate this belief, researchers conducted a study. All the 
subjects in the study participated in a comprehensive chess program, and their reading 
performances were measured before and after the program. The graphs and numerical 
summaries that follow provide information on the subjects’ pretest scores, posttest scores, and 
the difference (Post — Pre) between these two scores. 


Descriptive Statistics: Pretest, Posttest, Post - Pre 


Variable N Mean Median StDev Min Max Q1 Q3 
Pretest 53 57.70 58.00 17.84 23.00 99.00 44.50 70.50 
Posttest 53 63.08 64.00 18.70 28.00 99.00 48.00 76.00 
Post - Pre 53 5.38 3.00 13.02 -19.00 42.00 =3.00 14.00 
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35. Better readers? (1.3, 4.3) 


a. Did students tend to have higher reading scores after participating in the chess 
program? Justify your answer. 


b. If the study found a statistically significant improvement in the average reading score, 
could you conclude that playing chess causes an increase in reading skills? Justify your 
answer. 


Some graphical and numerical information about the relationship between pretest and 


posttest scores is provided here. 


Posttest 


20 30 40 50 60 70 80 90 =©100 
Pretest 


Regression Analysis: Posttest Versus Pretest 


Predictor Coef SE Coef Ti P 

Constant 17.897 5.889 3.04 0.004 
Pretest 0.78301 0.09758 8.02 0.000 
S = 12.55 R-Sq = 55.8% R-Sq(adj)= 54.9% 


Residual 


Fitted value 


36. Predicting posttest scores (3.2) 


a. What is the equation of the least-squares regression line relating posttest and pretest 
scores? Define any variables used. 


b. Is a linear model appropriate for describing this relationship? Justify your answer. 


c. If we use the least-squares regression line to predict students’ posttest scores from their 
pretest scores, how far off will our predictions typically be? 


SECTION 6.2 Transforming and Combining Random 
Variables 


LEARNING TARGETS By the end of the section, you should be able to: 


e Describe the effect of adding or subtracting a constant or multiplying or dividing 
by a constant on the probability distribution of a random variable. 


e Calculate the mean and standard deviation of the sum or difference of random 
variables. 


e Find probabilities involving the sum or difference of independent Normal random 
variables. 


In Section 6.1, we looked at several examples of random variables and their probability 
distributions. We also saw that the mean ply and standard deviation oy give us important 


information about a random variable. 


sjharmon/Getty Images 


Consider this new setting. An American roulette wheel has 38 slots numbered 1 through 36, 
plus 0 and 00. Half of the slots from 1 to 36 are red; the other half are black. Both the 0 and 00 
slots are green. Suppose that a player places a $1 bet on red. If the ball lands in a red slot, the 
player gets the original dollar back, plus an extra dollar for winning the bet. If the ball lands in 
a different-colored slot, the player loses the $1 bet. Let X = the net gain on a single $1 bet on 
red. Because there is an 18/38 chance that the ball lands in a red slot, the probability 
distribution of X is as shown in the table. 


Value x; -$1 $1 
Probability p; 20/38 18/38 


The mean of X is 


20 
tx = (—1) | = 
uX=(~1)(2038)+(1)(1838)=-$0.05. a ie ( =) 


That is, a player can expect to lose an average of 5 cents per $1 bet if he plays many, many 
games. You can verify that the standard deviation is oy = $1.00. If the player only plays a few 
games, his actual net gain could be much better or worse than this expected value. 

Would the player be better off playing one game of roulette with a $2 bet on red or playing 
two games and betting $1 on red each time? To find out, we need to compare the probability 
distributions of the random variables Y = gain from a $2 bet and T = total gain from two $1 
bets. Which random variable (if either) has the higher expected gain in the long run? Which has 
the larger variability? By the end of this section, you’ ll be able to answer questions like these. 


Transforming a Random Variable 


In Chapter 2, we studied the effects of transformations on the shape, center, and variability of a 
distribution of quantitative data. Here’s what we discovered: 


.. Adding (or subtracting) a constant: Adding the same positive number a to (subtracting a 
from) each observation: 


e Adds a to (subtracts a from) measures of center and location (mean, median, quartiles, 
percentiles). 


e Does not change measures of variability (range, IQR, standard deviation). 
e Does not change the shape of the distribution. 


’, Multiplying or dividing by a constant: Multiplying (or dividing) each observation by the 
same positive number b: 


e Multiplies (divides) measures of center and location (mean, median, quartiles, percentiles) 
by b. 


e Multiplies (divides) measures of variability (range, JQR, standard deviation) by b. 


e Does not change the shape of the distribution. 


How are the probability distributions of random variables affected by similar 
transformations? 


EFFECT OF ADDING OR SUBTRACTING A CONSTANT Let’s return to a familiar 
setting from Section 6.1. Pete’s Jeep Tours offers a popular day trip in a tourist area. There 
must be at least 2 passengers for the trip to run, and the vehicle will hold up to 6 passengers. 
Pete charges $150 per passenger. Let C = the total amount of money that Pete collects on a 
randomly selected trip. The probability distribution of C is shown in the table and the 
histogram. 


0 150 300 450 600 750 900 
C = Money collected ($) 


Total collected c; 300 450 600 750 900 
Probability p; 0.15 0.25 0.35 0.20 0.05 


Earlier, we calculated the mean of C as pC=$562.50He = $562.50 and the standard deviation of 
C as oC=$163.46% = $163.46, We can describe the probability distribution of C as follows: 


Shape: Roughly symmetric with a single peak 
Center: .C=$562.50He = $562.50 
Variability: oC=$163.467¢C = $163.46 


It costs Pete $100 to buy permits, gas, and a ferry pass for each day trip. The amount of 
profit V that Pete makes on a randomly selected trip is the total amount of money C that he 
collects from passengers minus $100. That is, V= C — 100. The probability distribution of V is 


Profit v; 200 350 500 650 800 
Probabilityp; 0.15 0.25 0.35 0.20 0.05 


A histogram of this probability distribution is shown here. 
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0 200 350 500 650 800 
V = Profit ($) 


We can see that the probability distribution of V has the same shape as the probability 
distribution of C. The mean of Vis 


uv=(200)(0.15) + (350)(0.25) + (500)(0.35) + (650)(0.20) + (800)(0.05)=$462.50 
jty = (200)(0.15) + (350)(0.25) + (500)(0.35) + (650)(0.20) + (800)(0.05) 
= $462.50 


On average, Pete will make a profit of $462.50 from the trip. That’s $100 less than pCHC, 
his mean amount of money collected per trip. The standard deviation of V is 


oC=(200-462.50)2(0.15)+(350-462.50)2(0.25)+--:+(800-462.50)2(0.05)=$163.46 
oc = 4/(200 — 462.50)? (0.15) + (350 — 462.50)? (0.25) + --- + (800 — 462.50)? (0.05) 
= $163.46 


That’s the same as the standard deviation of C. 

It’s fairly clear that subtracting 100 from the values of the random variable C just shifts the 
probability distribution to the left by 100. This transformation decreases the mean by 100 (from 
$562.50 to $462.50), but it doesn’t change the standard deviation ($163.46) or the shape. These 
results can be generalized for any random variable. 


THE EFFECT OF ADDING OR SUBTRACTING A CONSTANT ON A 
PROBABILITY DISTRIBUTION 


Adding the same positive number a to (subtracting a from) each value of a random variable: 


e Adds a to (subtracts a from) measures of center and location (mean, median, quartiles, 
percentiles). 


e Does not change measures of variability (range, IQR, standard deviation). 


e Does not change the shape of the probability distribution. 


Note that adding or subtracting a constant affects the distribution of a quantitative variable 
and the probability distribution of a random variable in exactly the same way. 


EXAMPLE | Scaling test scores (>) 


| Effect of adding/subtracting a constant 


monkeybusinessimages/Getty Images 


PROBLEM: In a large introductory statistics class, the score X of a randomly selected 
student on a test worth 50 points can be modeled by a Normal distribution with mean 35 and 
standard deviation 5. Due to a difficult question on the test, the professor decides to add 5 
points to each student’s score. Let Y be the scaled test score of the randomly selected student. 
Describe the shape, center, and variability of the probability distribution of Y. 


SOLUTION: 


S hape: Approximately Normal 
Center: pY = uX+5 = 35+5 = goHkY = Hx +5=35+5 = 40 
Variability: oY =0X =5%Y = 9x =5 


Notice that Y = X + 5. Adding a constant doesn’t affect the shape or the standard 


deviation of the probability distribution. 


FOR PRACTICE, TRY EXERCISE 37 


EFFECT OF MULTIPLYING OR DIVIDING BY A CONSTANT The professor in the 
preceding example decides to convert his students’ scaled test scores Y to percentages. Because 
the test was scored out of 50 points, the professor multiplies each student’s scaled score by 2 to 
convert to a percent score W. That is, W = 2Y. Figure 6.3 displays the probability distributions 
of the random variables Y and W. From the graphs, we can see that the measures of center, 
location, and variability have all doubled—just like the individual students’ scores. But the 
shape of the two distributions is the same. 


20 30 40 50 60 70 80 90 100 110 
Y = scaled test score (out of 50) W = test score as a percent 
Shape: Approximately Normal Shape: Approximately Normal 
Center: y = 40 Center: w = 80 
(€) Variability: y = 5 (0) Variability: w = 10 


FIGURE 6.3 Probability distribution of (a) Y = a randomly selected statistics student’s scaled test 
score out of 50 and (b) W = the student’s scaled test score as a percent. 


THE EFFECT OF MULTIPLYING OR DIVIDING BY A CONSTANT ON A 
PROBABILITY DISTRIBUTION 


Multiplying (or dividing) each value of a random variable by the same positive number b: 


Multiplies (divides) measures of center and location (mean, median, quartiles, percentiles) 
by b. 
Multiplies (divides) measures of variability (range, IQR, standard deviation) by b. 


Does not change the shape of the distribution. 


Once again, multiplying or dividing by a constant has the same effect on the probability 
distribution of a random variable as it does on a distribution of quantitative data. 


EXAMPLE |How much does college cost? (>) 
Effect of multiplying/dividing by a constant 


PROBLEM: El Dorado Community College considers a student to be full-time if he or she 
is taking between 12 and 18 units. The number of units X that a randomly selected El Dorado 
Community College full-time student is taking in the fall semester has the following 
distribution. 


Number of 12 13 414715 16-5 17 “18 
units 


Probability 0.25 0.10 0.05 0.30 0.10 0.05 0.15 


At right is a histogram of the probability distribution. The mean is pX=14.65/4% = 14-65 and 


the standard deviation is oX=2.0567X = 2-056, 


At El Dorado Community College, the tuition for full-time students is $50 per unit. That is, 
if T = tuition charge for a randomly selected full-time student, T = 50X. 


a. What shape does the probability distribution of T have? 


Multiplying by a constant doesn’t change the shape. 


b. Find the mean of T. 
c. Calculate the standard deviation of T. 


0.30 


Probability 


1 13 14 15 16 17 18 
X = Number of units 


SOLUTION: 


a. The same shape as the probability distribution of x: roughly symmetric with three 
peaks. 
6. wT= sonX = 50(14.6s) = $732.50M7 = 50x = 50 (14.65) = $732.50 


¢. oT = 500X = 50(2.056) = $102.80°T = 507x = 50 (2.056) = $102.80 
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It is not common to multiply (or divide) a random variable by a negative number b. Doing 
so would multiply (or divide) the measures of variability by |b]. We can’t have a negative 
amount of variability! Multiplying or dividing by a negative number would also affect the 
shape of the probability distribution, as all values would be reflected over the y axis. 


Think About It 


HOW DOES MULTIPLYING BY A CONSTANT AFFECT THE VARIANCE? For El 
Dorado Community College, the variance of the number of units that a randomly selected 


full-time student takes is 6X2=4.22757x = 42275. The variance of the tuition charge for 

2 
such a student is oT2=10,568.7577 = 10, 568-75. That’s (2500)(4.2275). So oT 2=25000X2 
oy = 25000 Where did 2500 come from? It’s just (50)2. In other words, oT2=(50)20X2 


a 22 
Op = (50)'o% Multiplying a random variable by a constant b multiplies the variance by b?. 


CHECK YOUR UNDERSTANDING 


. 


v A large auto dealership keeps track of sales made during each hour of the day. Let X = 
the number of cars sold during the first hour of business on a randomly selected Friday. 
Based on previous records, the probability distribution of X is as follows: 


Cars sold 0 1 2 3 
Probability 0.3 0.4 0.2 0.1 


The random variable X has mean pX=1.1/-X = 1-1 and standard deviation oX=0.943 
ox = 0.943 


Suppose the dealership’s manager receives a $500 bonus from the company for each car 
sold. Let Y = the bonus received from car sales during the first hour on a randomly selected 
Friday. 

1. Sketch a graph of the probability distribution of X and a separate graph of the probability 
distribution of Y. How do their shapes compare? 
2. Find the mean of Y. 
3. Calculate and interpret the standard deviation of Y. 
The manager spends $75 to provide coffee and doughnuts to prospective customers 


each morning. So the manager’s net profit T during the first hour on a randomly selected 
Friday is $75 less than the bonus earned. 


4. Describe the shape, center, and variability of the probability distribution of T. 


PUTTING IT ALL TOGETHER: ADDING/SUBTRACTING AND 
MULTIPLYING/DIVIDING What happens if we transform a random variable by both adding 
or subtracting a constant and multiplying or dividing by a constant? We just use the order of 
operations and the facts about transforming data that we’ve already established. 


EXAMPLE |The baby and the bathwater (>) 
Analyzing the effect of transformations 


Michael DeLeon/Getty Images 


PROBLEM: One brand of baby bathtub comes with a dial to set the water temperature. 
When the “babysafe” setting is selected and the tub is filled, the temperature X of the water 
in a randomly selected bath follows a Normal distribution with a mean of 34°C and a 
standard deviation of 2°C. Let Y be the water temperature in degrees Fahrenheit for the 


= ee 
randomly selected bath. Recall that F=95 al as hc 32) 
a. Find the mean of Y. 


b. Calculate and interpret the standard deviation of Y. 
SOLUTION: 


= 2ny +32 = 2 (34) +32 = 93.2° 
fw = o5uxes> < os(4)ig0 = og 0FtY = pax +32= 5 (34) +32 = 93.2°F 


a ~ 9 y1429 
Note that Y=95 X+32Note that ¥ = 5 X +32 


—_ ¥ re i 0 
b. oY = 950X = 95(2) = 3,6Fo} —5 ox 5 (2) 3.6°F 


When the dial is set on “babysafe, ” the temperature of a randomly selected bath 
typically varies about 3.6°F from the mean of O22 2. 


Multiplying each value of X by 9/5 multiplies the standard deviation by 9/5. 


However, adding 32 to each value doesn’t affect measures of variability. 
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The probability distribution of Y = water temperature on a randomly selected day when the 
“babysafe” setting is used is Normal because the original distribution is Normal, and adding a 
constant and multiplying by a constant don’t affect shape. We can use this Normal distribution 


to find probabilities as we did in Section 6.1. For instance, according to Babies R Us, the 
temperature of a baby’s bathwater should be between 90°F and 100°F. What’s P(90<Y<100) 
P(90< Y < 100), Figure 6.4 shows the desired probability as an area under a Normal curve. 


Y = Water temperature (°F) 
FIGURE 6.4 The Normal probability distribution of the random variable Y = the temperature (in 
degrees Fahrenheit) of the bathwater when the dial is set on “babysafe.” The shaded area is the 
probability that the water temperature is between 90°F and 100°F. 


To find the probability, we can either (i) standardize the boundary values and use Table A 
or technology; or (ii) use technology without standardizing. 


(i) z=90—-93.23.6=—0.89 z=100-93.23.6=1.89 
z= 1 = _0.89 z= = 1.89 


Using Table A: 0.9706-0.1867=0.78390-9706 — 0.1867 = 0.7839 
Using technology: normalcdf(lower:—0.89, upper:1.89, mean:0, SD:1) = 0.7839 
(ii) normalcdf(lower:90, upper:100, mean:93.2, SD:3.6) = 0.7835 


When set on “babysafe” mode, there’s about a 78% probability that the water temperature 
meets the recommendation for a randomly selected bath. 


Combining Random Variables 


So far, we have looked at settings that involved a single random variable. Many interesting 
Statistics problems require us to combine two or more random variables. 

Let’s return to the familiar setting of Pete’s Jeep Tours. Earlier, we focused on the amount 
of money C that Pete collects on a randomly selected day trip. This time we’ ll consider a 
different but related random variable: X = the number of passengers on a randomly selected 
trip. Here is its probability distribution: 


Number of passengers xj 2 3 4 5 6 
Probability p; 0.15 0.25 0.35 0.20 0.05 


You can use what you learned earlier to confirm that pX=3.754X 3.75 passengers and 
oX=1.08970x = 1.0897 passengers. 


Pete’s sister Erin runs jeep tours in another part of the country on the same days as Pete in 
her slightly smaller vehicle, under the name Erin’s Adventures. The number of passengers Y on 
a randomly selected trip has the following probability distribution. You can confirm that 
wY=3.10HY = 3.10 passengers and oY=0.9437Y = 0.943 passengers. 


Number of passengers y; 2 3 4 5 
Probability pj 0.3 0.4 0.2 O.1 


Here are two questions that we would like to answer based on this scenario: 


e What is the sum S = X + Y of the number of passengers Pete and Erin will have on their tours 
on a randomly selected day? 


e What is the difference D = X — Y in the number of passengers Pete and Erin will have on a 
randomly selected day? 


As this setting suggests, we want to investigate what happens when we add or subtract random 
variables. 


MEAN (EXPECTED VALUE) OF THE SUM OR DIFFERENCE OF TWO RANDOM 
VARIABLES How many total passengers S can Pete and Erin expect to have on their tours on 
a randomly selected day? Because Pete averages UX=3.754X ~ 3.75 passengers per trip and 
Erin averages pY=3.10HY 3.10 passengers per trip, they will average a total of 
US=3.75+3.10=6.8545 = 3.75 + 3.10 = 6.85 passengers per day. We can generalize this result 
for any two random variables. 


MEAN (EXPECTED VALUE) OF A SUM OF RANDOM VARIABLES 


For any two random variables X and Y, if S = X Y, the mean (expected value) of S is 
uS=pX+Y=pX+pYHs = x+y = Mx + My 


In other words, the mean of the sum of two random variables is equal to the sum of their 
means. 


What’s the mean of the difference D = X — Y in the number of passengers that Pete and Erin 
have on their tours on a randomly selected day? Because Pete averages pX=3.754X — 3.75 
passengers per trip and Erin averages p Y=3.10HY — 3.10 passengers per trip, the mean 
difference is pD=3.75-3.10=0.654) ~ 3.75 — 3.10 = 0.65 passengers. That is, Pete averages 
0.65 more passengers per day than Erin does. Once again, we can generalize this result for any 
two random variables. 


MEAN (EXPECTED VALUE) OF A DIFFERENCE OF RANDOM 
VARIABLES 


For any two random variables X and Y, if D = X — Y, the mean (expected value) of D is 


uD=pX-Y=pX-pYHD ~ HxX-y = Bx ~ Hy 


In other words, the mean of the difference of two random variables is equal to the difference 
of their means. 


Oo The order of subtraction is important. If we had defined D = Y — X, then 
uD=pY-pX=3.10-3.75=-0.65HD = Hy — Hx = 3.10 — 3.75 = —0.65= 3 19 — 3.75 = -0.65. In 
other words, Erin averages 0.65 fewer passengers than Pete does on a randomly chosen day. 


EXAMPLE | How much do Pete and Erin make? (>) 


Mean of a sum or difference of random variables 


moodboard/Superstock 


PROBLEM: Pete charges $150 per passenger and Erin charges $175 per passenger for a 
jeep tour. Let C = the amount of money that Pete collects and E = the amount of money that 
Erin collects on a randomly selected day. From our earlier work, we know that pC=562.50 
tic = 562.50 and it is easy to show that pE=542.50He = 542.50. Define S = C + E. Calculate 
and interpret the mean of S. 


SOLUTION: 
Us=pc+yE=562.50+542.50=$1105.00Hs = He + He = 562.50 + 542.50 = $1105.00 
Pete and Erin expect to collect a total of $1105 per day, on average, over many randomly 


selected days. 


FOR PRACTICE, TRY EXERCISE 49 


How did we calculate pCMC and pE“ in the example? Earlier, we defined X = the number 
of passengers that Pete has and Y = the number of passengers that Erin has on a randomly 
selected day trip. Recall that pX=3.754X ~ 3.75 and wY=3.10HY = 3.10 Because Pete charges 
$150 per passenger, the amount of money that he collects on a randomly selected day is C 


=150X. Multiplying a random variable by a constant multiplies the value of the mean by the 
same constant: 


yC=150pX=150(3.75)=$562,50HC = 1504x = 150 (3.75) = $562.50 


Because Erin charges $175 per passenger, E = 175Y and 
pE=175pY=175(3.10)=$542.50HE = 175p4y = 175 (3.10) = $542.50 


What’s the mean of the difference D = C — E in the amounts that Pete and Erin collect on a 
randomly chosen day? It’s 


uD=pC-pE=562.50-542.50=$20.00HD = Lic — Pr = 562.50 — 542.50 = $20.00 


On average, Pete collects $20 more per day than Erin does. 


Standard Deviation of the Sum or Difference of Two Random 
Variables 


How much variation is there in the total number of passengers S = X + Y who go on Pete’s and 
Erin’s tours on a randomly chosen day? Here are the probability distributions of X and Y once 
again. Let’s think about the possible values of S. The number of passengers X on Pete’s tour is 
between 2 and 6, and the number of passengers Y on Erin’s tour is between 2 and 5. So the total 
number of passengers S is between 4 and 11. That is, there’s more variability in the values of S 
than in the values of X or Y alone. This makes sense, because the variation in X and the 
variation in Y both contribute to the variation in S. 


Pete’s Jeeps 
Number of passengers xj 2 3 4 5 6 
Probability p; 0.15 0.25 0.35 0.20 0.05 


px=3.75  ox=1.0897He = 3.75 oy = 1.0897 


Erin’s Adventures 
Number of passengers y; 2 3 4 5 
Probability pj 0.3 04 0.2 0.1 


uY=3.10 oY=0.943HY = 3.10 oy = 0.943 


What’s the standard deviation of S = X + Y? If we had the probability distribution of S, then 
we could calculate S75. Let’s try to construct this probability distribution starting with the 
smallest possible value, S = 4. The only way to get a total of 4 passengers is if Pete has X = 2 
passengers and Erin has Y = 2 passengers. We know that P(X = 2) = 0.15 and that P(Y = 2) = 
0.3. If the events X = 2 and Y = 2 are independent, we can use the multiplication rule for 
independent events to find P(X = 2 and Y = 2). Otherwise, we’re stuck. In fact, we can’t 


calculate the probability for any value of S unless X and Y are independent random variables. 


DEFINITION Independent random variables 


If Knowing the value of X does not help us predict the value of Y, then X and Y are 
independent random variables. 


It’s reasonable to treat the random variables X = number of passengers on Pete’s trip and Y 
= number of passengers on Erin’s trip on a randomly chosen day as independent, because the 
siblings operate their trips in different parts of the country. Because X and Y are independent, 


P(S =4) = P(X =2andY = 2) 
P(S=4)=P(X=2 and Y=2)=(0.15)(0.3)=0.045 = (0.15)(0.3) = 0.045 
There are two ways to get a total of S = 5 passengers on a randomly selected day: X = 3, Y= 
2orX=2, Y=3. So 


P(S=5)=P(X=2 and Y=3)+P(X=3 and Y=2)=(0.15)(0.4)+(0.25)(0.3)=0.06+0.075=0.135 
PiS=5) = P(X —2andY —3) + P(X — Band Y = 2) 
= (0.15)(0.4) + (0.25)(0.3) 
= 0.06 + 0.075 = 0.135 


We can construct the probability distribution by listing all combinations of X and Y that yield 
each possible value of S and adding the corresponding probabilities. Here is the result: 


Sum sj 4 5 6 ri 8 9 10 11 
Probability pj; 0.045 0.135 0.235 0.265 0.190 0.095 0.030 0.005 


The mean of S is 


uS=¥ sipi=(4)(0.045)+(5)(0.135)+---+(11)(0.005)=6.85 
us = 5- sip; = (4) (0.045) + (5) (0.135) +--+ + (11) (0.005) = 6.85 


Our calculation confirms that 


US=pX+pY=3.75+3.10=6.85Hs = Hx + Hy = 3.75 + 3.10 = 6.85 


The variance of S is 
0S2=Y (si-pS)2 pi=(4—-6.85)2(0.045)+(5-6.85)2(0.135)+---+(11-6.85)2(0.005)=2.0775 
a = > (8 — us)’ pi 
= (4— 6.85)? (0.045) + (5 — 6.85)? (0.135) + --- + (11 — 6.85)? (0.005) 
= 2.0775 


The variances of X and Y are oX2=(1.0897)2=1.18757x = (1.0897)" = 1.1875 ang oy2= 


9 
(0.943)2=0,897r = (0-943)" = 0.89 Notice that 


an - = = 
0X2+0Y2=1.1875+0.89=2.0775=0827x + Fy = 1.1875 + 0.89 = 2.075 = of 


In other words, the variance of a sum of two independent random variables is the sum of their 
variances. To find the standard deviation of S, take the square root of the variance: 


0S=2.0775=1.4419%s = V 2.0775 = 1.441 


The total number of passengers on Pete’s and Erin’s trips on a randomly selected day typically 
varies by about 1.441 passengers from the mean of 6.85 passengers. 


STANDARD DEVIATION OF THE SUM OF TWO INDEPENDENT 
RANDOM VARIABLES 


For any two independent random variables X and Y, if S = X + Y, the variance of S is 


a mma gal 2 
oS2=0X+Y2=0X2+0¥278 = Oxy =I toy 


To get the standard deviation of S, take the square root of the variance: 
— 


ome ore echt 
oS=oX+Y=0X2+0Y2 9 EY TOR 


ll 2 
The formula oS2=0X2+cY27s — 7x * 7Y is sometimes referred to as the “Pythagorean theorem 
dios : se —_— 2 
of statistics.” It certainly looks similar to c2=a2+b2¢ = @ +> b*! Just as the real Pythagorean 
=o. +0%, 


2 
theorem only applies to right triangles, the formula oS2=0X2+0Y27S only applies if X 


and Y are independent random variables. 


You might be wondering whether there’s a formula for computing the variance or standard 
deviation of the sum of two random variables that are not independent. There is, but it’s beyond 
the scope of this course. 


When we add two independent random variables, their variances add. @ Standard 


deviations do not add. For Pete’s and Erin’s passenger totals, 


oX+oY=1.0897+0.943=2.03277x + oy = 1.0897 + 0.943 = 2.0327 


This is very different from oS=1.441.7s = 1.441. 

Can you guess what the variance of the difference of two independent random variables 
will be? If you were thinking something like “the difference of their variances,” think again! 
Here are the probability distributions of X and Y from the jeep tours scenario once again: 


Pete’s Jeeps 
Number of passengers x; 2 3 4 5 6 


Probability p; 0.15 0.25 0.35 0.20 0.05 


uX = 3,754#X = 3.75 
oX = 1.08977x = 1.0897 
Erin’s Adventures 


Number of passengers y; 2 3 4 5 
Probability pj; 0.3 0.4 0.2 0.1 


pY = 3.104y = 3.10 
oY =0.9437r = 0.943 
By following the process we used earlier with the random variable S = X + Y, you can build 


the probability distribution of D = X — Y. 


Value d; =3 =2 =] 0 1 2 3 4 
Probability p; 0.015 0.055 0.145 0.235 0.260 0.195 0.080 0.015 


You can use the probability distribution to confirm that: 

|. 1D=0.65=3.75-3.10=pX-pYHp = 9.65 = 3.75 — 3.10 = wx — py 

1, GD2=2.0775=1.1875+0.89=0X2+0Y 27D = 2-0775 = 1.1875 + 0.89 = a + oF 
8, oD=2.0775=1.4419D = V2.0775 = 1.441 


Result 2 shows that, just as with addition, when we subtract two independent random variables, 
variances add. There’s more variability in the values of the difference D than in the values of X 
or Y alone. This should make sense, because the variation in X and the variation in Y both 
contribute to the variation in D. 


STANDARD DEVIATION OF THE DIFFERENCE OF TWO INDEPENDENT 
RANDOM VARIABLES 


For any two independent random variables X and Y, if D = X — Y, the variance of D is 
oD2=0X-Y2=0X2+GY27) = Tx-¥ = Fx + FF 


To get the standard deviation of D, take the square root of the variance: 


> 
= 0x-y = ox + or 


oD 
oD=0X-Y=o0X2+0Y2 


Let’s put this new rule to use in a familiar setting. 


EXAMPLE |How much do Pete’s and Erin’s earnings vary? » 
SD of asum or difference of random variables 


PROBLEM: Pete charges $150 per passenger and Erin charges $175 per passenger for a 
jeep tour. Let C = the amount of money that Pete collects and E = the amount of money that 
Erin collects on a randomly selected day. From our earlier work, it is easy to show that 
oC=$163.467c = $163.46 and oE=$165.037@ = $165.03, you may assume that these two 
random variables are independent. Define D = C - E. Earlier, we found that p»D=$20 

Hp = $20 Calculate and interpret the standard deviation of D. 


SOLUTION: 


D=C-E£E. Because C and E are independent random variables, 
oD2 = oC2+0£2 = (163.46)2+(165.03)2 = 53,954.07 
o?, = o?, + 03, = (163.46) + (165.03)° = 53, 954.07 


oD = 53,954.07 = $232.287D = V33, 954.07 = $232.28 


The difference (Pete — Erin) in the amount collected on a randomly selected day typically 
vartes by about Reo s from the mean difference of $20. 


Note that variances add when you are dealing with the sum or difference of 


independent random variables. 
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How did we calculate oC7¢€ and oE?# in the example? Earlier, we defined X = the number 
of passengers that Pete has and Y = the number of passengers that Erin has on a randomly 
selected day trip. Recall that oX=1.08977% = 1.0897 and cY=0.9437Y = 9-943. Because Pete 
charges $150 per passenger, the amount of money that he collects on a randomly selected day 
is C =150X. Multiplying a random variable by a constant multiplies the value of the standard 
deviation by the same constant: 


oC=1506X=150(1.0897)=$163.467¢ = 1500x = 150 (1.0897) = $163.46 


Because Erin charges $175 per passenger, E = 175Y and 
OE=1750Y=175(0.943)=$165.0378 = 175ey = 175 (0.943) = $165.03 


COMBINING VERSUS TRANSFORMING RANDOM VARIABLES We can extend our 

rules for combining random variables to situations involving repeated observations of the same 
chance process. Let’s return to the gambler we met at the beginning of this section. Suppose he 
plays two games of roulette, each time placing a $1 bet on red. What can we say about his total 


gain (or loss) from playing two games? Earlier, we showed that if X = the amount gained on a 
single $1 bet on red, then pX=—-$0.054X = ~$0.05 and oX=-$1.007x = —$1.00, Because 
we’re interested in the player’s total gain over two games, we’|I define X as the amount he 


gains from the first game and X> as the amount he gains from the second game. Then his total 
gain T= X, + X5. Both X; and X> have the same probability distribution as X and, therefore, 


the same mean (—$0.05) and standard deviation ($1.00). The player’s expected gain in two 
games is 


uT=pX1+X2=(-$0.05)+(-$0.05)=-$0.10H7 = HX, + HX, = (—80.05) + (—$0.05) = —$0.10 


Exactostock-1527/Superstock 


Because knowing the result of one game tells the player nothing about the result of the other 
game, X, and X> are independent random variables. As a result, 


a eo : = 
o6T2=6X12+0X22=(1.00)2+(1.00)2=2.00° = Om. Fxg = 00)" + L.00)" = 2.00 


and the standard deviation of the player’s total gain is 
oT=.00=$1.419r = v-00 = $1.41 


At the beginning of the section, we asked whether a roulette player would be better off 
placing two separate $1 bets on red or a single $2 bet on red. We just showed that the expected 
total gain from two $1 bets is pT=-$0.10/T = ~$0.10 with a standard deviation of oT=$1.41 
or =$1.41_ Now think about what happens if the gambler places a $2 bet on red in a single 
game of roulette. Because the random variable X represents a player’s gain from a $1 bet, the 
random variable Y = 2X represents his gain from a $2 bet. 

What’s the player’s expected gain from a single $2 bet on red? It’s 


pY=2'X=2(-$0.05)=-$0.10#¥ = 2#x = 2(—$0.05) = —$0.10 


That’s the same as his expected gain from playing two games of roulette with a $1 bet each 
time. But the standard deviation of the player’s gain from a single $2 bet is 
oY =20X=2($1.00)=$2.007Y = 2x = 2 ($1.00) = $2.00 

Compare this result to oT=$1.419r =1-41, There’s more variability in the gain from a 
single $2 bet than in the total gain from two $1 bets. og Bottom line: Xj + X> is not the same 
as 2X. 

Let’s take this one step further. Would it be better for the player to place a single $100 bet 
on red or to play 100 games and bet $1 each time on red? With the single $100 bet, the player 
will either win big or lose big (lots of variation). But with 100 $1 bets, there will be some wins 
and some losses, and the end result will be much less variable. 


CHECK YOUR UNDERSTANDING 


. 


”, A large auto dealership keeps track of sales and lease agreements made during each hour 
of the day. Let X = the number of cars sold and Y = the number of cars leased during the first 
hour of business on a randomly selected Friday. Based on previous records, the probability 


distributions of X and Y are as follows: 


Cars sold x; 0 1 2 3 
Probability pj; 03 04 O2 #4=90.1 


Mean: pX=1.1Standard deviation: oX=0.943 
Mean: zx = 1.1 Standard deviation: ox = 0.943 


Cars leased y; 0 1 2 
Probability pj 0.4 0.5 0.1 


Mean: p Y=0.7Standard deviation: cY=0.64 Mean: py = 0.7 Standard deviation: oy = 0.64 
Define T = X + Y. Assume that X and Y are independent. 
1. Find and interpret pT? . 


2. Calculate and interpret oT?7. 


3. The dealership’s manager receives a $500 bonus for each car sold and a $300 bonus for 
each car leased. Find the mean and standard deviation of the manager’s total bonus B. 


Combining Normal Random Variables 


So far, we have concentrated on developing rules for means and variances of random variables. 
If a random variable is Normally distributed, we can use its mean and standard deviation to 
compute probabilities. What happens if we combine two independent Normal random 
variables? 


We used software to simulate separate random samples of size 1000 for each of two 


independent, Normally distributed random variables, X and Y. Their means and standard 
deviations are as follows: 


uX=3HX = 3 gX=0,97x = 0.9 wY=1ey =1) gy=1.20y = 1.2 


Figure 6.5(a) shows the results. What do we know about the sum and difference of these two 
random variables? The histograms in Figure 6.5(b) came from adding and subtracting the 
corresponding values of X and Y for the 1000 randomly generated observations from each 
probability distribution. 
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FIGURE 6.5 (a) Histograms showing the results of randomly selecting 1000 values of two 
independent, Normal random variables X and Y. (b) Histograms of the sum and difference of the 
1000 randomly selected values of X and Y. 


As the simulation illustrates, any sum or difference of independent Normal random 
variables is also Normally distributed. The mean and standard deviation of the resulting 
Normal distribution can be found using the appropriate rules for means and standard 
deviations: 


Sum X + Y Difference X - Y 
Mean pX+Y=pX+pY=3+1=4 wX-Y=pX-pY=3-1=2 
x+y = ex + py =3+1=4 ex-y = fx py =3—-lL=2 
SD 0X-Y2=0X2+0Y2=0.92+1.22=2.250X-Y=2.25=1.5 


OX+Y2=0X2+0Y2=0.9241.22=2.250X+Y=2.25=1.5 oy y = aX { oF — 0.97 + 1.2? = 2.25 


OxLY = ox + oF a 0.9° r 1.2° = 2.25 Ox-y —v 220 1:5 
Oxiy —v 220 15 


EXAMPLE | Will the lid fit? >) 


Combining normal random variables 


PROBLEM: The diameter C of the top of a randomly selected large drink cup at a fast-food 
restaurant follows a Normal distribution with a mean of 3.96 inches and a standard deviation 
of 0.01 inch. The diameter L of a randomly selected large lid at this restaurant follows a 
Normal distribution with mean 3.98 inches and standard deviation 0.02 inch. Assume that L 
and C are independent random variables. Let the random variable D = L - C be the 
difference between the lid’s diameter and the cup’s diameter. 


a. Describe the distribution of D. 


b. For a lid to fit on a cup, the value of L has to be bigger than the value of C, but not by 
more than 0.06 inch. Find the probability that a randomly selected lid will fit on a 
randomly selected cup. Interpret this value. 


karandaev/ 
Getty Images 


SOLUTION: 


a Shape: Normal 
Center:w'D=3.98-3.96=0.02 inchCenter : up = 3.98 — 3.96 = 0.02 inch 
Variability:cD=(0.02)2+(0.01)2=0.0224 inch 


Variability : ap = \/ (0.02) + (0.01)? = 0.0224 inch 


D is the difference of two independent Normal random variables. 


UD=y"L-yCHP — fiE-— Be 


{9 2 
jo, +96 


oD = 
oD=0L2+0C2 \ 


b. The lid will fit if 0<£-Cso.069 <L—C< 0.06 that is, if 0<Dso.060 < D< 0.06, 
\ 


0.0472 -0.0243 -0.0024 0.02 0.0424 0.0648 0.0872 
D = Difference (Lid — Cup) in diameter (in.) 


1. Draw a Normal distribution. 
2. Perform calculations—show your work! 


(i) Standardize and use Table A or technology; or 
(ii) Use technology without standardizing. 
Be sure to answer the question that was asked. 


2 0,06—0.02 = 1.79 
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1. Z=0-0.020.0224=-0.89 Z=0.06—0.020.0224=1.79 0.0224 0.0224 


Using Table A: 0.9633 - 0.1867 = 0.7766 


Using technology: normalcdf(lower: -0.89, Upper1.79, mean:o, SD:1) = 0.7765 


i. normalcdf(lower:o, upper:0.06, mean:0.02, SD:0.0224) = O17 70 


There’s about a 78% chance that a randomly selected lid will fit on a randomly 
selected cup. 
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We can extend what we have learned about combining independent Normal random 
variables to settings that involve repeated observations from the same probability distribution. 
Consider this scenario. Mr. Starnes likes sugar in his hot tea. From experience, he needs 
between 8.5 and 9 grams of sugar in a cup of tea for the drink to taste right. While making his 
tea one morning, Mr. Starnes adds four randomly selected packets of sugar. Suppose the 
amount of sugar in these packets follows a Normal distribution with mean 2.17 grams and 
standard deviation 0.08 gram. What’s the probability that Mr. Starnes’s tea tastes right? 

Let X = the amount of sugar in a randomly selected packet. Then X; = amount of sugar in 
Packet 1, X> = amount of sugar in Packet 2, X3 = amount of sugar in Packet 3, and X4 = amount 
of sugar in Packet 4. Each of these random variables has a Normal distribution with mean 2.17 
grams and standard deviation 0.08 grams. We’re interested in the total amount of sugar that Mr. 
Starnes puts in his tea: T= X1 + Xy + X3+ Xz. 

The random variable T is a sum of four independent Normal random variables. So T 
follows a Normal distribution with mean 


WT=pX1+pX2+yX3+pX4=.17+.17+.17+.17=8.68 grams 
ber = Bx, + Ux, + ex, + ux, = 174+ .17+ .17+ .17 = 8.68 grams 


and variance 


6T2=0X12+0X22+0X32+0X42=(0.08)2+(0.08)2+(0.08)2+(0.08)2=0.0256 
Op = 7%, + 0%, + 0%, + 0%, = (0.08)° + (0.08)” + (0.08) + (0.08)” = 0.0256 
The standard deviation of T is 
oT=0.0256=0.16 gram?r = V 9.0256 = 0.16 gram 


We want to find the probability that the total amount of sugar in Mr. Starnes’s tea is between 
8.5 and 9 grams. Figure 6.6 shows this probability as the area under a Normal curve. 


To find this area, we can use either of our two familiar methods: 


8.2 8.36 8.52 8.68 8.84 9 9.16 
8.50 


T = Total amount of sugar (g) 


FIGURE 6.6 Normal distribution of the total amount of sugar in Mr. Starnes’s tea. 


(i) Standardize the boundary values and use Table A or technology: 
_ 8.5-8.68 _ 9—8.68 


eT and ee 
2=8.5-8.680.16=—1.13andz=9-8.680.16=2.00° 0.16 ET She 


Using Table A: P(-1.13 < Z < 2.00) = 0.9772 — 0.1292 = 0.8480 
Using technology: normalcdf(lower:—1.13, upper:2.00, mean:0, SD:1) = 0.8480 


= 2.00 


(ii) Use technology to find the desired area without standardizing. 


normalcdf(lower:8.5, upper:9, mean:8.68, SD:0.16) = 0.8470 


There’s about an 85% probability that Mr. Starnes’s tea will taste right. 


Section 6.2 Summary 


e Adding a positive constant a to (subtracting a from) a random variable increases (decreases) 
measures of center and location by a, but does not affect measures of variability (range, IQR, 
standard deviation) or the shape of its probability distribution. 

e Multiplying (dividing) a random variable by a positive constant b multiplies (divides) 
measures of center and location by b and multiplies (divides) measures of variability (range, 
IQR, standard deviation) by b, but does not change the shape of its probability distribution. 


e If X and Y are any two random variables, 


uX+Y=pX+pVYexX+Y = Hx + HY: The mean of the sum of two random variables is the 
sum of their means. 


uxX-Y=pX-pyex-y = Hx ~ HY: The mean of the difference of two random variables is 
the difference of their means. 


e If X and Y are independent random variables, then knowing the value of one variable tells 
you nothing about the value of the other. In that case, variances add: 


Be seed 2 
oX+Y2=0X2+0Y27xX+Y ~ 7x + °Y: The variance of the sum of two independent random 


variables is the sum of their variances. 


2 ae 2 
oX-Y2=0X2+0Y27X-Y ~ 7x + %Y: The variance of the difference of two independent 
random variables is the sum of their variances. 


e To get the standard deviation of the sum or difference of two independent random variables, 
calculate the variance and then take the square root: 
f <9 2 
OX+Y = Ox-Y = 4/Cxy + OIy 
OX+Y=o0X-Y=0X2+0Y2 — 
e The sum or difference of independent Normal random variables is a Normal random 
variable. 


Section 6.2 Exercises 


37. PQ 384 |») Driving to work The time X it takes Hattan to drive to work on a randomly 


selected day follows a distribution that is approximately Normal with mean 15 minutes 
and standard deviation 6.5 minutes. Once he parks his car in his reserved space, it takes 5 
more minutes for him to walk to his office. Let T = the total time it takes Hattan to reach 
his office on a randomly selected day, so T= X + 5. Describe the shape, center, and 
variability of the probability distribution of T. 


38. Toy shop sales Total gross profits G on a randomly selected day at Tim’s Toys follow a 
distribution that is approximately Normal with mean $560 and standard deviation $185. 
The cost of renting and maintaining the shop is $65 per day. Let P = profit on a randomly 
selected day, so P = G — 65. Describe the shape, center, and variability of the probability 
distribution of P. 


39. Airline overbooking Airlines typically accept more reservations for a flight than the 
number of seats on the plane. Suppose that for a certain route, an airline accepts 40 
reservations on a plane that carries 38 passengers. Based on experience, the probability 
distribution of Y = the number of passengers who actually show up for a randomly 
selected flight is given in the following table. You can check that pYY = 37.4 and oY7Y 


= 1.24. 
Number of passengers y; 35 36 37 38 39 40 
Probability p; 0.10 0.10 0.30 0.35 0.10 0.05 


There is also a crew of two flight attendants and two pilots on each flight. Let X = the total 
number of people (passengers plus crew) on a randomly selected flight. 


a. Make a graph of the probability distribution of X. Describe its shape. 


b. Find and interpret pX*. 


40. 


41. 


42. 


c. Calculate and interpret oX?% . 


City parking Victoria parks her car at the same garage every time she goes to work. 
Because she stays at work for different lengths of time each day, the fee the parking 
garage charges on a randomly selected day is a random variable, G. The table gives the 
probability distribution of G. You can check that pG4G= $14 and oG?%6 = $2.74. 


Garage fee g; $10 $13 $15 $20 
Probability p; 0.20 0.25 0.45 0.10 


In addition to the garage’s fee, the city charges a $3 use tax each time Victoria parks her 
car. Let T = the total amount of money she pays on a randomly selected day. 

a. Make a graph of the probability distribution of T. Describe its shape. 

b. Find and interpret pTM?. 

c. Calculate and interpret oT?7. 


pg 385 |») Get on the boat! A small ferry runs every half hour from one side of a large 


river to the other. The number of cars X on a randomly chosen ferry trip has the 
probability distribution shown here with mean pX=3.87HX — 3.87 and standard deviation 
OX=1.297X = 1.29. 


o 412 3 *& 
X = Number of cars 


The cost for the ferry trip is $5. Define M = money collected on a randomly selected ferry 
trip. 


a. What shape does the probability distribution of M have? 
b. Find the mean of M. 


c. Calculate the standard deviation of M. 


Skee Ball Ana is a dedicated Skee Ball player who always rolls for the 50-point slot. 
Ana’s score X on a randomly selected roll of the ball has the probability distribution 
shown here with mean pX#X = 23.8 and standard deviation oX7X = 12.63. 


43. 


44. 
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A player receives one ticket from the game for every 10 points scored. Define T = number 
of tickets Ana gets on a randomly selected roll. 


a. What shape does the probability distribution of T have? 
b. Find the mean of T. 


c. Calculate the standard deviation of T. 


Still waiting for the server? How does your web browser get a file from the Internet? 
Your computer sends a request for the file to a web server, and the web server sends back 
a response. Let Y = the amount of time (in seconds) after the start of an hour at which a 
randomly selected request is received by a particular web server. The probability 
distribution of Y can be modeled by a uniform density curve on the interval from 0 to 3600 
seconds. Define the random variable W = Y/60. 


a. Explain what W represents. 


b. What probability distribution does W have? 


Where’s the bus? Sally takes the same bus to work every morning. Let X = the amount of 
time (in minutes) that she has to wait for the bus on a randomly selected day. The 
probability distribution of X can be modeled by a uniform density curve on the interval 
from 0 minutes to 8 minutes. Define the random variable V = 60X. 


a. Explain what V represents. 


b. What probability distribution does V have? 


Exercises 45 and 46 refer to the following setting. Ms. Hall gave her class a 10-question 
multiple-choice quiz. Let X = the number of questions that a randomly selected student in 
the class answered correctly. The computer output gives information about the probability 
distribution of X. To determine each student’s grade on the quiz (out of 100), Ms. Hall will 
multiply his or her number of correct answers by 5 and then add 50. Let G = the grade of a 
randomly chosen student in the class. 


“Mean Median StDev Min Max QQ, Q3 
7.6 8.5 1.32 4 10 8 9 


45. 


46. 


47. 


48. 


49. 


50. 


Easy quiz 
a. Find the median of G. 


b. Find the interquartile range (IQR) of G. 


More easy quiz 
a. Find the mean of G. 


b. Find the range of G. 


387 Too cool at the cabin? During the winter months, the temperatures at the 
pg. ey g p 


Starneses’ Colorado cabin can stay well below freezing (32°F or 0°C) for weeks at a time. 
To prevent the pipes from freezing, Mrs. Starnes sets the thermostat at 50°F. She also buys 
a digital thermometer that records the indoor temperature each night at midnight. 
Unfortunately, the thermometer is programmed to measure the temperature in degrees 
Celsius. Based on several years’ worth of data, the temperature T in the cabin at midnight 
on a randomly selected night can be modeled by a Normal distribution with mean 8.5°C 
and standard deviation 2.25°C. Let Y = the temperature in the cabin at midnight on a 
randomly selected night in degrees Fahrenheit (recall that F = (9/5)C + 32). 


a. Find the mean of Y. 
b. Calculate and interpret the standard deviation of Y. 


c. Find the probability that the midnight temperature in the cabin is less than 40°F. 


How much cereal? A company’s single-serving cereal boxes advertise 1.63 ounces of 
cereal. In fact, the amount of cereal X in a randomly selected box can be modeled by a 
Normal distribution with a mean of 1.70 ounces and a standard deviation of 0.03 ounce. 
Let Y = the excess amount of cereal beyond what’s advertised in a randomly selected box, 
measured in grams (1 ounce = 28.35 grams). 


a. Find the mean of Y. 
b. Calculate and interpret the standard deviation of Y. 


c. Find the probability of getting at least 1 gram more cereal than advertised. 


pg 389 |») Community College costs EF] Dorado Community College has a main 


campus in the suburbs and a downtown campus. The amount X spent on tuition by a 
randomly selected student at the main campus has mean $732.50 and standard deviation 
$103. The amount Y spent on tuition by a randomly selected student at the downtown 
campus has mean $825 and standard deviation $126.50. Suppose we randomly select one 
full-time student from each of the two campuses. Calculate and interpret the mean of the 
sumS=X+Y. 


Essay errors Typographical and spelling errors can be either “nonword errors” or “word 


51. 


52. 


53. 


54. 


50. 


56. 


errors.” A nonword error is not a real word, as when “the” is typed as “teh.” A word error 
is areal word, but not the right word, as when “lose” is typed as “loose.” When students 
are asked to write a 250-word essay (without spell-checking), the number of nonword 
errors X in a randomly selected essay has mean 2.1 and standard deviation 1.136. The 
number of word errors Y in the essay has mean 1.0 and standard deviation 1.0. Calculate 
and interpret the mean of the sum S = X + Y. 


Study habits The Survey of Study Habits and Attitudes (SSHA) is a psychological test 
that measures academic motivation and study habits. The SSHA score F of a randomly 
selected female student at a large university has mean 120 and standard deviation 28, and 
the SSHA score M of a randomly selected male student at the university has mean 105 and 
standard deviation 35. Suppose we select one male student and one female student at 
random from this university and give them the SSHA test. Calculate and interpret the 
mean of the difference D = F — M in their scores. 


Commuting to work Sulé’s job is just a few bus stops away from his house. While it can 
be faster to take the bus to work than to walk, the travel time is more variable due to 
traffic. The commute time B if Sulé takes the bus to work on a randomly selected day has 
mean 12 minutes and standard deviation 4 minutes. The commute time W if Sulé walks to 
work on a randomly selected day has mean 16 minutes and standard deviation 1 minute. 
Calculate and interpret the mean of the difference D = B —- W in the time it would take 
Sulé to get to work on a randomly selected day. 


Community college costs Refer to Exercise 49. At the main campus, full-time students 
pay $50 per unit. At the downtown campus, full-time students pay $55 per unit. Find the 
mean of the difference D (Main — Downtown) in the number of units that the two 
randomly selected students take. 


Essay scores Refer to Exercise 50. An English professor deducts 3 points from a student’s 
essay score for each nonword error and 2 points for each word error. Find the mean of the 
total score deductions T for a randomly selected essay. 


Rainy days Imagine that we randomly select a day from the past 10 years. Let X be the 
recorded rainfall on this date at the airport in Orlando, Florida, and Y be the recorded 
rainfall on this date at Disney World just outside Orlando. Suppose that you know the 


means XX and pYY and the variances oX27x and oY2°¥ of both variables. 


a. Can we calculate the mean of the total rainfall X + Y to be pwX+p YX * HY ? Explain 


your answer. 


b. Can we calculate the variance of the total rainfall to be oX2+0Y27x * 7? Explain 


your answer. 


His and her earnings Researchers randomly select a married couple in which both 
spouses are employed. Let X be the income of the husband and Y be the income of the 


wife. Suppose that you know the means pX/X and p YY and the variances oX27% and 


oY27¥ of both variables. 


a. Can we calculate the mean of the total income X + Y to be ty + fy? Explain your 


answer. 


2 2 
b. Can we calculate the variance of the total income to be oX2+oY2°X * °Y? Explain 
your answer. 


57. PQ 392 |») Community college costs Refer to Exercise 49. Note that X and Y are 


58. 


59. 


60. 


61. 


62. 


independent random variables because the two students are randomly selected from each 
of the campuses. Calculate and interpret the standard deviation of the sum S = X + Y. 


Essay errors Refer to Exercise 50. Assume that the number of non-word errors X and 
word errors Y in a randomly selected essay are independent random variables. Calculate 
and interpret the standard deviation of the sum S = X + Y. 


Study habits Refer to Exercise 51. 


a. Assume that F and M are independent random variables. Explain what this means in 
context. 


b. Calculate and interpret the standard deviation of the difference D = F — M in their 


scores. 


c. From the information given, can you find the probability that the randomly selected 
female student has a higher SSHA score than the randomly selected male student? 
Explain why or why not. 


Commuting to work Refer to Exercise 52. 


a. Assume that B and W are independent random variables. Explain what this means in 
context. 


b. Calculate and interpret the standard deviation of the difference D (Bus — Walk) in the 


time it would take Sulé to get to work on a randomly selected day. 


c. From the information given, can you find the probability that it will take Sulé longer to 
get to work on the bus than if he walks on a randomly selected day? Explain why or 
why not. 


Community college costs Refer to Exercise 49. Note that X and Y are independent 
random variables because the two students are randomly selected from each of the 
campuses. At the main campus, full-time students pay $50 per unit. At the downtown 
campus, full-time students pay $55 per unit. Suppose we randomly select one full-time 
student from each of the two campuses. Find the standard deviation of the difference D 
(Main — Downtown) in the number of units that the two randomly selected students take. 


Essay scores Refer to Exercise 50. Assume that the number of nonword errors X and word 
errors Y in a randomly selected essay are independent random variables. An English 


professor deducts 3 points from a student’s essay score for each nonword error and 2 
points for each word error. Find the standard deviation of the total score deductions T for a 
randomly selected essay. 


Exercises 63 and 64 refer to the following setting. In Exercise 17 of Section 6.1, we examined 
the probability distribution of the random variable X = the amount a life insurance company 
ears on a randomly chosen 5-year term life policy. Calculations reveal that uX=$303.35 

Hx =$303.35 and oX=$9707.57.7x =$9707.57. 


63. 


64. 


65. 


66. 


Life insurance The risk of insuring one person’s life is reduced if we insure many people. 
Suppose that we randomly select two insured 21-year-old males, and that their ages at 
death are independent. If X; and Xp are the insurer’s income from the two insurance 
policies, the insurer’s average income W on the two policies is 
, %X1+X9 

W=X1+X22" 2 
Find the mean and standard deviation of W. (You see that the mean income is the same as 
for a single policy, but the standard deviation is less.) 


Life insurance If we randomly select four insured 21-year-old men, the insurer’s average 
income is 
X1 +XQ+X3+Xq4 
V=X1+X2+X3+X44 4 
where X; is the income from insuring one man. Assuming that the amount of income 
earned on individual policies is independent, find the mean and standard deviation of V. 


(If you compare with the results of Exercise 63, you should see that averaging over more 
insured individuals reduces risk.) 


pg 395 |») Time and motion A time-and-motion study measures the time required for 


an assembly-line worker to perform a repetitive task. The data show that the time X 
required to bring a part from a bin to its position on an automobile chassis follows a 
Normal distribution with mean 11 seconds and standard deviation 2 seconds. The time Y 
required to attach the part to the chassis follows a Normal distribution with mean 20 
seconds and standard deviation 4 seconds. The study finds that the times required for the 
two steps are independent. 


a. Describe the distribution of the total time required for the entire operation of 
positioning and attaching a randomly selected part. 


b. Management’s goal is for the entire process to take less than 30 seconds. Find the 


probability that this goal will be met for a randomly selected part. 


Ohm-my! The design of an electronic circuit for a toaster calls for a 100-ohm resistor and 
a 250-ohm resistor connected in series so that their resistances add. The resistance X of a 
100-ohm resistor in a randomly selected toaster follows a Normal distribution with mean 


67. 


68. 


69. 


70. 


100 ohms and standard deviation 2.5 ohms. The resistance Y of a 250-ohm resistor in a 
randomly selected toaster follows a Normal distribution with mean 250 ohms and standard 
deviation 2.8 ohms. The resistances X and Y are independent. 


a. Describe the distribution of the total resistance of the two components in series for a 
randomly selected toaster. 


b. Find the probability that the total resistance for a randomly selected toaster lies between 


345 and 355 ohms. 


Yard work Lamar and Hareesh run a two-person lawn-care service. They have been 
caring for Mr. Johnson’s very large lawn for several years, and they have found that the 
time L it takes Lamar to mow the lawn on a randomly selected day is approximately 
Normally distributed with a mean of 105 minutes and a standard deviation of 10 minutes. 
The time H it takes Hareesh to use the edger and string trimmer on a randomly selected 
day is approximately Normally distributed with a mean of 98 minutes and a standard 
deviation of 15 minutes. Assume that L and H are independent random variables. Find the 
probability that Lamar and Hareesh finish their jobs within 5 minutes of each other on a 
randomly selected day. 


Hit the track Andrea and Barsha are middle-distance runners for their school’s track 
team. Andrea’s time A in the 400-meter race on a randomly selected day is approximately 
Normally distributed with a mean of 62 seconds and a standard deviation of 0.8 second. 
Barsha’s time B in the 400-meter race on a randomly selected day is approximately 
Normally distributed with a mean of 62.8 seconds and a standard deviation of 1 second. 
Assume that A and B are independent random variables. Find the probability that Barsha 
beats Ashley in the 400-meter race on a randomly selected day. 


Swim team Hanover High School has the best women’s swimming team in the region. 
The 400-meter freestyle relay team is undefeated this year. In the 400-meter freestyle 
relay, each swimmer swims 100 meters. The times, in seconds, for the four swimmers this 
season are approximately Normally distributed with means and standard deviations as 
shown. Assume that the swimmer’s individual times are independent. Find the probability 
that the total team time in the 400-meter freestyle relay for a randomly selected race is less 
than 220 seconds. 


Swimmer Mean StDev 
Wendy 55.2 2.8 
Jill 58.0 3.0 
Carmen 56.3 2.6 


Latrice 54.7 2.f 


Toothpaste Ken is traveling for his business. He has a new 0.85-ounce tube of toothpaste 
that’s supposed to last him the whole trip. The amount of toothpaste Ken squeezes out of 
the tube each time he brushes is independent, and can be modeled by a Normal 
distribution with mean 0.13 ounce and standard deviation 0.02 ounce. If Ken brushes his 


teeth six times on a randomly selected trip, what’s the probability that he’ ll use all the 
toothpaste in the tube? 


71. Auto emissions The amount of nitrogen oxides (NOX) present in the exhaust of a 
particular model of old car varies from car to car according to a Normal distribution with 
mean 1.4 grams per mile (g/mi) and standard deviation 0.3 g/mi. Two randomly selected 
cars of this model are tested. One has 1.1 g/mi of NOX; the other has 1.9 g/mi. The test 
station attendant finds this difference in emissions between two similar cars surprising. If 
the NOX levels for two randomly chosen cars of this type are independent, find the 
probability that the difference is greater than 0.8 or less than —0.8. 


72. Loser buys the pizza Leona and Fred are friendly competitors in high school. Both are 
about to take the ACT college entrance examination. They agree that if one of them scores 
5 or more points better than the other, the loser will buy the winner a pizza. Suppose that, 
in fact, Fred and Leona have equal ability so that each score on a randomly selected test 
varies Normally with mean 24 and standard deviation 2. (The variation is due to luck in 
guessing and the accident of the specific questions being familiar to the student.) The two 
scores are independent. What is the probability that the scores differ by 5 or more points in 
either direction? 


Multiple Choice Select the best answer for Exercises 73 and 74, which refer to the following 
setting. 

The number of calories in a 1-ounce serving of a certain breakfast cereal is a random variable 
with mean 110 and standard deviation 10. The number of calories in a cup of whole milk is a 
random variable with mean 140 and standard deviation 12. For breakfast, you eat 1 ounce of 
the cereal with 1/2 cup of whole milk. Let T be the random variable that represents the total 
number of calories in this breakfast. 


73. The mean of T is 
a. 110. 
b. 140. 
c. 180. 
d. 195. 
e. 250. 


74. The standard deviation of T is 
a. 22. 
b. 16. 
ce 15.62. 
d. 11.66. 


e. 4. 


Recycle and Review 


rien 


76. 


Fluoride varnish (4.2) In an experiment to measure the effect of fluoride “varnish” on the 
incidence of tooth cavities, thirty-four 10-year-old girls whose parents volunteered them 
for the study were randomly assigned to two groups. One group was given fluoride 
varnish annually for 4 years, along with standard dental hygiene; the other group followed 
only the standard dental hygiene regimen. The mean number of cavities in the two groups 
was compared at the end of the 4 years. 


a. Are the participants in this experiment subject to the placebo effect? Explain. 
b. Describe how you could alter this experiment to make it double-blind. 


c. Explain the purpose of the random assignment in this experiment. 


Buying stock (5.3, 6.1) You purchase a hot stock for $1000. The stock either gains 30% 
or loses 25% each day, each with probability 0.5. Its returns on consecutive days are 
independent of each other. You plan to sell the stock after two days. 


a. What are the possible values of the stock after two days, and what is the probability for 
each value? What is the probability that the stock is worth more after two days than the 
$1000 you paid for it? 


b. What is the mean value of the stock after two days? 


Comment: You see that these two criteria give different answers to the question “Should I 
invest?” 


SECTION 6.3 Binomial and Geometric Random Variables 


LEARNING TARGETS By the end of the section, you should be able to: 
Determine whether the conditions for a binomial setting are met. 
Calculate and interpret probabilities involving binomial distributions. 


Calculate the mean and standard deviation of a binomial random variable. 
Interpret these values. 


When appropriate, use the Normal approximation to the binomial distribution to 
calculate probabilities.* 


Find probabilities involving geometric random variables. 


When the same chance process is repeated several times, we are often interested in whether a 
particular outcome does or doesn’t happen on each trial. Here are some examples: 


e To test whether someone has extrasensory perception (ESP), choose one of four cards at 
random—a star, wave, cross, or circle. Ask the person to identify the card without seeing it. 
Do this a total of 50 times and see how many cards the person identifies correctly. Chance 
process: choose a card at random. Outcome of interest: person identifies card correctly. 
Random variable: X = number of correct identifications. 


e A shipping company claims that 90% of its shipments arrive on time. To test this claim, take 
a random sample of 100 shipments made by the company last month and see how many 
arrived on time. Chance process: randomly select a shipment and check when it arrived. 
Outcome of interest: arrived on time. Random variable: Y = number of on-time shipments. 


e Inthe game of Pass the Pigs, a player rolls a pair of pig-shaped dice. On each roll, the player 
earns points according to how the pigs land. If the player gets a “pig out,” in which the two 
pigs land on opposite sides, she loses all points earned in that round and must pass the pigs to 
the next player. A player can choose to stop rolling at any point during her turn and to keep 
the points that she has earned before passing the pigs. Chance process: roll the pig dice. 
Outcome of interest: pig out. Random variable: T = number of rolls it takes the player to pig 
out. 


Alex Clark/Alamy 


Some random variables, like X and Y in the first two bullets, count the number of times the 
outcome of interest occurs in a fixed number of trials. They are called binomial random 
variables. Other random variables, like T in the Pass the Pigs setting, count the number of trials 
of the chance process it takes for the outcome of interest to occur. They are known as 
geometric random variables. These two special types of discrete random variables are the focus 
of this section. 


Binomial Settings and Binomial Random Variables 


Let’s start with an activity that involves repeating a chance process several times. 


ACTIVITY Pop quiz! 


It’s time for a pop quiz! We hope you are ready. The quiz consists of 10 multiple-choice 
questions. Each question has five answer choices, labeled A through E. Now for the bad 
news: you will not get to see the questions. You just have to guess the answer for each one! 


1. Get out a blank sheet of paper. Write your name at the top. Number your paper from 1 to 
10. Then guess the answer to each question: A, B, C, D, or E. Do not look at anyone else’s 
paper! You have 2 minutes. 


2. Now it’s time to grade the quizzes. Exchange papers with a classmate. Your teacher will 
display the answer key. The correct answer for each of the 10 questions was determined 
randomly so that A, B, C, D, or E was equally likely to be chosen. 


3. How did you do on your quiz? Make a class dotplot that shows the number of correct 
answers for each student in your class. As a class, describe what you see. 


In the “Pop quiz” activity, each student is performing repeated trials of the same chance 
process: guessing the answer to a multiple-choice question. We’re interested in the number of 
times that a specific event occurs: getting a correct answer (which we’ll call a “success”). 
Knowing the outcome of one question (right or wrong guess) tells us nothing about the 
outcome of any other question. That is, the trials are independent. The number of trials is fixed 
in advance: n = 10. And a student’s probability of getting a “success” is the same on each trial: 
p = 0.2. When these conditions are met, we have a binomial setting. 


DEFINITION Binomial setting 


A binomial setting arises when we perform n independent trials of the same 
chance process and count the number of times that a particular outcome (called a 
“suCcCess”) OCCurs. 


The four conditions for a binomial setting are 


e Binary? The possible outcomes of each trial can be classified as “success” or 
“failure.” 


e Independent? Trials must be independent. That is, knowing the outcome of one 
trial must not tell us anything about the outcome of any other trial. 


e Number? The number of trials n of the chance process must be fixed in 
advance. 


e Same probability? There is the same probability of success p on each trial. 


The boldface letters in the definition box give you a helpful way to remember the 
conditions for a binomial setting: just check the BINS! 

When checking the binary condition, note that there can be more than two possible 
outcomes per trial—in the “Pop Quiz” Activity, each question (trial) had five possible answer 
choices: A, B, C, D, or E. If we define “success” as guessing the correct answer to a question, 
then “failure” occurs when the student guesses any of the four incorrect answer choices. 


EXAMPLE | From blood types to aces (>) 
Identifying binomial settings 


PROBLEM: Determine whether the given scenario describes a binomial setting. Justify 
your answer. 


a. Genetics says that the genes children receive from their parents are independent from one 


child to another. Each child of a particular set of parents has probability 0.25 of having 
type O blood. Suppose these parents have 5 children. Count the number of children with 
type O blood. 

b. Shuffle a standard deck of 52 playing cards. Turn over the first 10 cards, one at a time. 


Record the number of aces you observe. 


c. Shuffle a deck of cards. Turn over the top card. Put the card back in the deck, and shuffle 
again. Repeat this process until you get an ace. Count the number of cards you had to turn 
over. 


<> 


tam_odin/Shutterstock.com 


SOLUTION: 


a. ¢ Binary? “Success” = has type O blood. “Failure” = doesn’t have type O blood. 


. Independent? ‘Knowing one child’s blood type tells you nothing about another child’s 
because they inherit genes independently from their parents. 


* Number? n = 5 


* Same probability? = OS 
This ts a binomial setting. 


Check the BINS! A trial consists of observing the blood type for one of these 
parents’ children. 


All the conditions are met and we are counting the number of successes (children 
with type O blood). 


b. ¢ Binary? "Suecess = get an ace. “Failure” = don’t get an ace. 


° Independent? No. If the first card you turn over is an ace, then the WeEXt card is less 
likely to be an ace because you're not replacing the top card in the deck. If the first 


card isn’t an ace, the second card is more likely to be an ace. 


This ts not a binomial setting because the independent condition is not met. 


Check the BINS! A trial consists of turning over a card from the deck and 
observing what’s on the card. 


To check for independence, you could also write P(2nd card ace | 1st card ace) = 
3/51 and P(2nd card ace | ist card not ace) = 4/51 Because the two probabilities 
are not equal, the trials are not independent. 


Cc ¢ Binary? ESticecss = get an ace. “Failure” = don’t get an ace. 


° Independent? Yes. Because you are replacing the card in the deck and shuffling each 
time, the result of one trial doesn’t tell you anything about the outcome of any other 
trial. 


* Number? No. The number of trials is not ‘fixed in advance. 


Because there is no fixed number of trials, this is not a binomial setting. 


Check the BINS! A trial consists of turning over a card from the shuffled deck of 52 
cards and observing what’s on the card. 


There’s another clue that this is not a binomial setting: you’re counting the number 
of trials to get a Success and not the number of successes in a fixed number of 
trials. 


FOR PRACTICE, TRY EXERCISE 77 


The Independent condition involves conditional probabilities. In part (b) of the example, 
P(2nd card ace | 1st card ace) = 3/51 # P(2nd card ace | 1st card not ace) = 4/51 


so the trials are not independent. The Same probability of success condition is about 
unconditional probabilities. Because 


P(kth card in a shuffled deck is an ace)=4 /52P(kth card in a shuffled deck is an ace) = 4/52 
this condition is met in part (b) of the example. Be sure you understand the difference between 
these two conditions. When sampling is done without replacement, the Independent condition 
is violated. 

The blood type scenario in part (a) of the example is a binomial setting. If we let X = the 


number of children with type O blood, then X is a binomial random variable. The probability 
distribution of X is called a binomial distribution. 


DEFINITION Binomial random variable, Binomial distribution 


The count of successes X in a binomial setting is a binomial random variable. 
The possible values of X are 0, 1, 2, ..., n. 

The probability distribution of X is a binomial distribution. Any binomial 
distribution is completely specified by two numbers: the number of trials n of the 
chance process and the probability p of success on each trial. 


In the “Pop quiz” activity at the beginning of the lesson, X = the number of correct answers 
is a binomial random variable with n = 10 and p = 0.2. 


CHECK YOUR UNDERSTANDING 


( 


V For each of the following situations, determine whether or not the given random variable 
has a binomial distribution. Justify your answer. 


1. Shuffle a deck of cards. Turn over the top card. Put the card back in the deck, and shuffle 
again. Repeat this process 10 times. Let X = the number of aces you observe. 


2. Choose 5 students at random from your class. Let Y = the number who are over 6 feet tall. 


3. Rolla fair die 100 times. Sometime during the 100 rolls, one corner of the die chips off. 
Let W = the number of 5s you roll. 


Calculating Binomial Probabilities 


How can we calculate probabilities involving binomial random variables? Let’s return to the 
scenario from part (a) of the preceding example: 


Genetics says that the genes children receive from their parents are independent from one 
child to another. Each child of a particular set of parents has probability 0.25 of having type 
O blood. Suppose these parents have 5 children. Count the number of children with type O 
blood. 


In this binomial setting, a child with type O blood is a “success” (S) and a child with another 
blood type is a “failure” (F). The count X of children with type O blood is a binomial random 
variable with n = 5 trials and probability p = 0.25 of success on each trial. 


e What’s P(X = 0)? That is, what’s the probability that none of the 5 children has type O 
blood? The probability that any one of this couple’s children doesn’t have type O blood is 1 
— 0.25 = 0.75 (complement rule). By the multiplication rule for independent events (Section 
5.3), 


P(X = 0) = P(FFFFF) = (0.75)(0.75)(0.75)(0.75)(0.75) = (0.75)5 = 0.2373 


P(X = 0) = P(FFFFF) = (0.75) (0.75) (0.75) (0.75) (0.75) = (0.75)° = 0.2373 


e How about P(X = 1)? There are several different ways in which exactly 1 of the 5 children 
could have type O blood. For instance, the first child born might have type O blood, while 
the remaining 4 children don’t have type O blood. The probability that this happens is 


P(SFFFF)=(0.25)(0.75)(0.75)(0.75)(0.75)=(0.25)1(0.75)4 
P (SFFFF) = (0.25) (0.75) (0.75) (0.75) (0.75) = (0.25)'(0.75)* 


Alternatively, Child 2 could be the one that has type O blood. The corresponding probability is 


P(FSFFF)=(0.75)(0.25)(0.75)(0.75)(0.75)=(0.25)1(0.75)4 
P (FSFFF) = (0.75) (0.25) (0.75) (0.75) (0.75) = (0.25)' (0.75)* 


There are three more possibilities to consider—those in which Child 3, Child 4, and Child 5 are 
the only ones to inherit type O blood. Of course, the probability will be the same for each of 
those cases. In all, there are five different ways in which exactly 1 child would have type O 
blood, each with the same probability of occurring. As a result, 


P(X=1)=P(exactly 1 child with type O blood)?(* = 1) = P(exactly 1 child with type O blood) 
__ = 5(0.25)1(0.75)4 = 0.3955 
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The pattern of this calculation works for any binomial probability: 


P(X = k) = ( of ways to get k successes in n trials) (success probability) (failure 
probability)” * 


To use this formula, we must count the number of arrangements of k successes in n trials. This 
number is called the binomial coefficient. We use the following fact to do the counting 
without actually listing all the arrangements. 


DEFINITION Binomial coefficient 


The number of ways to arrange k successes among n trials is given by the 
binomial coefficient 


n n! 
(nk)=n!k!(n-k)! ( k ) ~ Ri(n—B)! 
fork =0,1,2,...,n where n! (read as "n factorial") is given by 
n !=n(n-1)(n-2)--+- -(3)(2)(1)"! = (7m — 1)(m — 2) +++ + (3) (2) 1) 
and O! = 1. 


The larger of the two factorials in the denominator of a binomial coefficient will cancel 


much of the n! in the numerator. For example, the binomial coefficient we need to find the 
probability that exactly 2 of the couple’s 5 children inherit type O blood is 


(62)=51213!=(6)(4)(8)(2VDAUB)2(1)=G)(4)2)(1)=10 
(5) 8 (M(Z)(Z)(Z) wa) 


= = 


8! aan(Z)(zZ)(Z) A) 


3) 

Oo The binomial coefficient (52) ( >) is not related to the fraction 522, A helpful way 
to remember its meaning is to read it as “S choose 2”—as in, how many ways are there to 
choose which 2 children have type O blood in a family with 5 children? Binomial coefficients 
have many uses, but we are interested in them only as an aid to finding binomial probabilities. 
If you need to compute a binomial coefficient, use your calculator. 


>) 
Some people prefer the notation ;C> instead of (52) ( 2/ for the binomial 


13. Technology Corner |CALCULATING BINOMIAL 
COEFFICIENTS (>) 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 


>) 
To calculate a binomial coefficient like (52) ( 2/ on the TI-83/84, proceed as follows: 


e Type 5, press |MATH\, arrow over to PROB, choose nCr, and press |ENTER|. Then type 2 


and press again to execute the command 5 nCr 2 (which displays as 5C> on 


devices with pretty print). 


NORMAL FLOAT AUTO REAL RADIAN MP fl 
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The binomial coefficient inh k counts the number of different ways in which k 
successes can be arranged among n trials. The binomial probability P(X = k) is this count 
multiplied by the probability of any one specific arrangement of the k successes. 


BINOMIAL PROBABILITY FORMULA 


Suppose that X is a binomial random variable with n trials and probability p of success on 
each trial. The probability of getting exactly k successes in n trials (k = 0, 1, 2, ..., n) is 


P(x=8)= (7) paz)" 
P(X=k)=(nk)pk(1-p)n-k k 


(nk)=n!k!(n-k)! ( k 


With our formula in hand, we can now calculate any binomial probability. 


EXAMPLE | Inheriting blood type (>) 
Calculating a binomial probability 


PROBLEM: Genetics says that the genes children receive from their parents are 
independent from one child to another. Each child of a particular set of parents has 
probability 0.25 of having type O blood. Suppose these parents have 5 children. Let X = the 
number of children with type O blood. Find P(X = 3). Interpret this value. 


SOLUTION: 
X is a binomial random variable with n = 5 and p = 0.25. 
P(X=3) = (8) (0.25)*(0.75)’ 
= 10 (0.25)*(0.75)? 
P(X = 3)=(35) (0.25)3(0.75)2=10 (0.25)3(0.75)2=0.08789 = 0.08789 
There is about a 9% probability that exactly 2 of the 5 children have type O blood. 


P(X 
P(X=k)=(nk)pk(1-p)n-k 


FOR PRACTICE, TRY EXERCISE 81 


There are times when we want to calculate a probability involving more than one value of a 
binomial random variable. As the following example illustrates, we can just use the binomial 
probability formula for each value of interest. 


EXAMPLE | Inheriting blood type (>) 
Calculating a cumulative binomial probability 


PROBLEM: The preceding example tells us that each child of a particular set of parents has 
probability 0.25 of having type O blood. Suppose these parents have 5 children. Should the 
parents be surprised if more than 3 of their children have type O blood? Calculate an 
appropriate probability to support your answer. 


SOLUTION: 


Let X = the number of children with type O blood. X has a binomial distribution with n = 
5 and p = 0.25. 
P(X>3)=P(X=4)+P(X=5)=(45)(0.25)4(0.75)1+(55) 
(0.25)5(0.75)0=5 (0.25)4 (0.75)1 + 1(0.25)5 (0.75)0=0.01465+0.00098=0.01563 
P(X >3) = P(X =4)+ P(X =5) 

= (5) (0.25)*(0.75)' + (2) (0.25)° (0.75)° 

= 5 (0.25)* (0.75)' + 1(0.25)° (0.75)° 

= 0.01465 + 0.00098 

= 0.01563 


Because there’s only about a 1.5% probability of having more than 3 children with type O 
blood, the parents should definitely be surprised if this happens. 


FOR PRACTICE, TRY EXERCISE 85 
We can also use the calculator’s binompdf and binomcdf commands to perform the 


calculations in the previous two examples. The following Technology Corner shows how to do 
it. 


14. Technology Corner |CALCULATING BINOMIAL 


PROBABILITIES \») 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


There are two handy commands on the TI-83/84 for finding binomial probabilities: 
binompdf and binomcdf. The inputs for both commands are the number of trials n, the 
success probability p, and the values of interest for the binomial random variable X. 
binompdf (n,p,k) computesP(X=k)binomcdf (n,p,k) computesP(X<k) 
binompdf (n,p,k) computesP(X=k) 
binomedf (n,p,k) computesP(X < k) 


Let’s use these commands to confirm our answers in the previous two examples. 
1. Find P(X = 3). 


e Press (DISTR) and choose binompdf(. 
OS 2.55 or later: In the dialog box, enter these values: trials:5, p:0.25, x value:3, 
choose Paste, and then press [ENTER], 


Older OS: Complete the command binompdf (5,0.25,3) and press [ENTER]. 


These results agree with our previous answer using the binomial probability formula: 


NORMAL FLOAT AUTO REAL RADIAN MP nl 


binompdf(S,.25,3) 
@.987890625 


0.08789. 


2. Should the parents be surprised if more than 3 of their children have type O blood? To 
find P(X = 3), use the complement rule: 


P(X>3)=1-P(X<3)=1-binomcdf(5,0.25,3) 
P(X > 3) =1-— P(X < 3) = 1 — binomcdf (5,0.25,3) 


e Press (DISTR) and choose binomcdf(. 


OS 2.55 or later: In the dialog box, enter these values: trials:5, p:0.25, x value:3, 
choose Paste, and then press [ENTER|. Subtract this result from 1 to get the answer. 
Older OS: Complete the command binomcdf(5,0.25,3) and press ENTER]. Subtract 
this result from 1 to get the answer. 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


binomcdf(5,.25,3) 
@.984375 


This result agrees with our previous answer using the binomial probability formula: 
0.01563. 


We could also have done the calculation for part (b) as P(X = 3) = P(X = 4) + P(X =5)= 
binompdf(5,0.25,4) + binompdf(5,0.25,5) = 0.01465 + 0.00098 = 0.01563. 


Note the use of the complement rule to find P(X = 3) in the Technology Corner: P(X = 3) = 
1 - P(X < 3). This is necessary because the calculator’s binomcdf(n,p,k) command only 
computes the probability of getting k or fewer successes in n trials. Students often have trouble 
identifying the correct third input for the binomcdf command when a question asks them to find 
the probability of getting less than, more than, or at least so many successes. 

Here’s a helpful tip to avoid making such a mistake: write out the possible values of the 
variable, circle the ones you want to find the probability of, and cross out the rest. In the 
preceding example, X can take values from 0 to 5 and we want to find P(X = 3): 


OA2 24 5 


Crossing out the values from 0 to 3 shows why the correct calculation is 1—P(X<3). 
1— P(X < 3) 


AP® EXAM TIP 


Don’t rely on “calculator speak” when showing your work on free response 
questions. Writing binompdf(5,0.25,3) = 0.08789 will not earn you full credit for a 
binomial probability calculation. At the very least, you must indicate what each of 
those calculator inputs represents. For example, “binompdf(trials:5,p:0.25,x value:3) 
= 0.08789.” 


Take another look at the solutions in the two blood-type examples. The structure is much 
like the one we used when doing Normal calculations. Here is a summary box that describes 
the process. 


HOW TO FIND BINOMIAL PROBABILITIES 


Step 1: State the distribution and the values of interest. Specify a binomial distribution 
with the number of trials n, success probability p, and the values of the variable clearly 
identified. 

Step 2: Perform calculations—show your work! Do one of the following: 

i. Use the binomial probability formula to find the desired probability; or 
ii. Use the binompdf or binomcdf command and label each of the inputs. 


Be sure to answer the question that was asked. 


Here’s an example that shows the method at work. 


EXAMPLE |Free lunch? (>) 
Calculating binomial probabilities 


< 


a: 


eldadcarin/Getty Images 


PROBLEM: A local fast-food restaurant is running a “Draw a three, get it free” lunch 

promotion. After each customer orders, a touchscreen display shows the message “Press here 

to win a free lunch.” A computer program then simulates one card being drawn from a 

standard deck. If the chosen card is a 3, the customer’s order is free. Otherwise, the customer 

must pay the bill. 

a. On the first day of the promotion, 250 customers place lunch orders. Find the probability 
that fewer than 10 of them win a free lunch. 

b. In fact, only 9 customers won a free lunch. Does this result give convincing evidence that 


the computer program is flawed? 


SOLUTION: 


Let Y = the number of customers who win a free lunch. Y has a binomial distribution 
with n = 250 and p = 4/52. 
P (Y < 10)=P (Ys9)= binomcdf (trials: 250, p:4/52, x value: 9)= 0.00613 
P(Y <10)=P(Y <9) 
= binomedf (trials: 250, p:4/52, x value: 9) 
= 0.00613 


a. 


b. There ts only a 0.006 probability that fewer than 10 customers would win a free funch 
if the computer program 1S working properly. Because only 9 customers won a free 
funch on this day, we have convincing evidence that the computer program 1S flawed. 


Step 1: State the distribution and the values of interest. 


The values of Y that interest us are 
On 62 9845 67'S 19 a0 ea 250 


Step 2: Perform calculations—show your work! 


Use the binomial probability formula to find the desired probability; or 
use the binompdf or binomcdf command and label each of the inputs. 
To use the binomial formula, you would have to add the probabilities for Y = 0, 1, ... , 


9. That’s too much work! 
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CHECK YOUR UNDERSTANDING 


Ny 


J 

To introduce his class to binomial distributions, Mr. Miller does the “Pop quiz” activity 

at the beginning of this section (page 403). Each student in the class guesses an answer from 

A through E on each of the 10 multiple-choice questions. Mr. Miller determines the 

“correct” answer for each of the 10 questions randomly so that A, B, C, D, or E was equally 

likely to be chosen. Hannah is one of the students in this class. Let X = the number of 

questions that Hannah answers correctly. 

1. What probability distribution does X have? Justify your answer. 

2. Use the binomial probability formula to find P(X = 3). Interpret this result. 

3. To get a passing score on the quiz, a student must answer at least 6 questions correctly. 
Would you be surprised if Hannah earned a passing score? Calculate an appropriate 


probability to support your answer. 


Describing a Binomial Distribution: Shape, Center, and Variability 


What does the probability distribution of a binomial random variable look like? The table 
shows the possible values and corresponding probabilities for X = the number of children with 
type O blood from two previous examples. This is a binomial random variable with n = 5 and p 


= 0.25. 


Value x ; 0 1 2 3 4 5 
Probability p; 0.23730 0.39551 0.26367 0.08789 0.01465 0.00098 


Figure 6.7 shows a histogram of the probability distribution. This binomial distribution with 
n=5 and p = 0.25 has a clear right-skewed shape. Why? Because the probability that any one 
of the couple’s children inherits type O blood is 0.25, it’s quite likely that 0, 1, or 2 of the 
children will have type O blood. Larger values of X are much less likely. 

0.45 
0.40 
0.35 
0.30 
0.25 
0.20 
0.15 
0.10 
0.05 


0.00 
0 1 2 3 a 5 


X = Number of children with type O blood 
FIGURE 6.7 Histogram showing the probability distribution of the binomial random variable X = 
number of children with type O blood in a family with 5 children. 


Probability 


You can use technology to graph a binomial probability distribution like the one shown in 
Figure 6.7. 


15. Technology Corner |GRAPHING BINOMIAL PROBABILITY 
DISTRIBUTIONS (>) 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 


To graph the binomial probability distribution for n = 5 and p = 0.25: 


e Type the possible values of the random variable X into list Lj: 0, 1, 2, 3, 4, and 5. 

e Highlight L> with your cursor. Enter the command binompdf(5,0.25) and press [ENTER|. 

e Make a histogram of the probability distribution using the method shown in Technology 
Comer 12 (page 370). 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


Plotl:LasL2 


Lav=, 2373046875 


The binomial distribution with n = 5 and p = 0.25 is skewed to the right. Figure 6.8 shows 
two more binomial distributions with different shapes. The binomial distribution with n = 5 and 
p = 0.51 in Figure 6.8(a) is roughly symmetric. The binomial distribution with n = 5 and p = 
0.8 in Figure 6.8(b) is skewed to the left. In general, when n is small, the probability 
distribution of a binomial random variable will be roughly symmetric if p is close to 0.5, right- 
skewed if p is much less than 0.5, and left-skewed if p is much greater than 0.5. 

n=5,p=0.51 n=5,p=0.8 
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FIGURE 6.8 (a) Probability histogram for the binomial random variable X with n = 5 and p = 
0.51. This binomial distribution is roughly symmetric. (b) Probability histogram for the binomial 
random variable X with n = 5 and p = 0.8. This binomial distribution has a left-skewed shape. 


EXAMPLE | Bottled water versus tap water (>) 
Describing a binomial distribution 


PROBLEM: Mr. Hogarth’s AP® Statistics class did the activity on page 360. There were 21 
students in the class. If we assume that the students in his class could not tell tap water from 
bottled water, then each one is guessing, with a 1/3 probability of being correct. Let X = the 
number of students who correctly identify the cup containing bottled water. Here is a 
histogram of the probability distribution of X: 


a. What probability distribution does X have? Justify your answer. 


b. Describe the shape of the probability distribution. 
0.20 


o 
_ 
a) 


Probability 
oS 
S 


> 
i) 
a 


0.00 
0 2 4 6 8 10 12 14 16 18 20 
R = Number of passengers who get a red light 


SOLUTION: 


a. A trial consists of a student in the class trying to guess which of three cups contained 
bottled water. 


Check the BINS! 


. Binary? Success = correct guess; failure = incorrect guess 


< Independent? ‘Knowing whether one student guessed. correctly does not help us predict 
whether another student guessed. correctly. 


¢ Number? n = 21 


* Same probability? p= 1/3 


xX has a binomial distribution with n = 21 and pe VES 


b. The probability distribution of X looks roughly symmetric with a single peak Atk = 


You could also say that the graph is slightly right-skewed due to the long tail that 


extends out to X = 21. 
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MEAN AND STANDARD DEVIATION OF A BINOMIAL RANDOM VARIABLE The 
random variable X = the number of children with type O blood from the previous two examples 
has a binomial distribution with n = 5 and p = 0.25. Its probability distribution is shown in the 
table. 


Value x; 0 At 2 3 4 5 
Probability p; 0.23730 0.39551 0.26367 0.08789 0.01465 0.00098 


Because X is a discrete random variable, we can calculate its mean using the formula 
UX=E(X)=>; xipi= x1p1+x2p2+x3p3+...4* SE = dL aP SR TEE TEBE Te 


from Section 6.1. We get 


UX =(0)(0.23730)+(1)(0.39551)+---+(5)(0.00098)=1.25 
ux = (0) (0.23730) + (1) (0.39551) + - + (5) (0.00098) = 1.25 


So the expected number of children with type O blood in families like this one with 5 children 
is 1.25. 
Did you think about why the mean is pty = 1.25? Because each child has a 0.25 chance of 


inheriting type O blood, we’d expect one-fourth of the 5 children to have this blood type. In 
other words, 


pX=5(0.25)=1.254x = 5(0.25) = 1.25 
This method can be used to find the mean of any binomial random variable. 
MEAN (EXPEXCTED VALUE) OF A BINOMIAL RANDOM VARIABLE 


If a count X of successes has a binomial distribution with n\umber of trials n and probability 
of success p, the mean (expected value) of X is 


UX=E(X)=np#x = 2 (X) = np 


To calculate the standard deviation of X, we start by finding the variance. 


oX2=Y (xi-.X)2pi=(0-1.25)2(0.23730)+(1-1.25)2(0.39551)+--:+(5-1.25)2(0.00098)=0.9375 


7 = (aj — ex) Pi 
= (0 — 1.25)? (0.23730) + (1 — 1.25)? (0.39551) + --- + (5 — 1.25)? (0.00098) 
= 0.9375 


So the standard deviation of X is 


oX=0.9375=0.9687x = V0.9375 = 0.968 


The number of children with type O blood will typically vary by about 0.968 from the mean of 
1.25 in families like this one with 5 children. 

There is a simple formula for the standard deviation of a binomial random variable, but it 
isn’t easy to explain (see the Think About It on page 417). For our family with n = 5 children 
and p = 0.25 of type O blood, the variance of X is 


5(0.25)(0.75)=0.9375° (0.25) (0.75) = 0.9375 


To get the standard deviation, we just take the square root: 


o6X=5(0.25)(0.75)=0.9375=0.9687% = v5 (0-25) (0.75) = v0.9375 = 0.968 


This method works for any binomial random variable. 


STANDARD DEVIATION OF A BINOMIAL RANDOM VARIABLE 


If a count X of successes has a binomial distribution with number of trials n and probability 
of success p, the standard deviation of X is 


oX=np(1—-p)?* — rp (1 7 p) 


og Remember that these formulas for the mean and standard deviation work only for 


binomial distributions. 


EXAMPLE | Bottled water versus tap water |») 
Describing a binomial distribution 


PROBLEM: Assume that each of the 21 students in Mr. Hogarth’s AP® Statistics class who 
did the bottled water versus tap water activity was just guessing, so there was a 1/3 chance of 
each student identifying the cup containing bottled water correctly. Let X = the number of 
students who make a correct identification. At right is a histogram of the probability 
distribution of X. 


a. Calculate and interpret the mean of X. 


b. Calculate and interpret the standard deviation of X. 


- _ 
en oe 
ae 
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0,2 


Probability 
— 


0.0 
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X = Number of correct guesses 


SOLUTION: 
a. Py = np = MV) = 7 


af all the students in Mr. Hogarth’s class were just guessing and repeated the activity 
many times, the average number of students who guess correctly would be about re 


oX = np(1 -p) = 21(1/3)(2/3) = 2.167X = /np (1 — p) = ./21 (1/3) (2/3) = 2.16 


b. afr all the students in Mr. Hogarth’s class were just guessing and repeated the activity 


many times, the number of students who guess correctly would typically vary by about 
2.16 from the mean of Ta 


FOR PRACTICE, TRY EXERCISE 95 


Of the 21 students in Mr. Hogarth’s class, 13 made correct identifications. Are you 
convinced that some of Mr. Hogarth’s students could tell bottled water from tap water? The 
class’s result corresponds to X = 13, a value that’s nearly 3 standard deviations above the mean. 
How likely is it that 13 or more of Mr. Hogarth’s students would guess correctly? It’s 

P(X213)=1-P(X<12)=1-binomcdf(trials:21, p:1/3, xvalue:12)=1—0.9932=0.0068 
P(X >13) = 1-—P(X < 12) 
= 1-—binomedf(trials:21, p:1/3, xvalue:12) 
= 1-— 0.9932 
= 0.0068 


The students had less than a 1% chance of getting so many correct identifications if they were 


all just guessing. This result gives convincing evidence that some of the students in the class 
could tell bottled water from tap water. 


Think About It 


WHERE DO THE BINOMIAL MEAN AND VARIANCE FORMULAS COME FROM? 
We can derive the formulas for the mean and variance of a binomial distribution using what 
we learned about combining random variables in Section 6.2. Let’s start with the random 
variable B that’s described by the following probability distribution. 


Value b; 0 
Probability p; 1-p p 


You can think of B as representing the result of a single trial of some chance process. If a 
success occurs (probability p), then B = 1. If a failure occurs, then B = 0. Notice that the 
mean of B is 


uB=y, bipi=(0)(1-p)+(1)(p)=p"” = >? bip; = (0) (1—p)+ (1) (p) =p 
and that the variance of B is 


oB2=), (bi pB)2pi-(0-p)2(1-p)+(1-p)2p=p2(1-p)+(1-p)2p=p(-p)l pt(1-p) I-p(-p) 
o% = >> (bi —He)*pi = (0—p)’ (1—p) + (1—p)"p 
= p’(1—p)+(1—p)p 
p(1—p) [p+ (1—p)| 
= p(1—p) 


Now consider the random variable X = B; + By +... + B,. We can think of X as counting 


the number of successes in n independent trials of this chance process, with each trial having 
success probability p. In other words, X is a binomial random variable. By the rules from 
Section 6.2, the mean of X is 


uX=B1+pB2+...+pBn=ptp+...+p=nphX = Bap F Aa hse TMB = BOP hoe B= Dp 
and the variance of X is 


0X2=0B12+0B22+...+oBn2=p(1-p)+p(1-p)+...+p(1-p)=np(1-p) 


% = om, tom, +... +05 
= p(l—p)+p(1—p)+...+p(1—p) 
= np(1—p) 


The standard deviation of X is therefore 


oX=np(1-p)?* = Jf np (1 a 4) 


CHECK YOUR UNDERSTANDING 


. 


”, To introduce his class to binomial distributions, Mr. Miller does the “Pop quiz” activity 
at the beginning of this section (page 403). Each student in the class guesses an answer from 
A through E on each of the 10 multiple-choice questions. Mr. Miller determines the 
“correct” answer for each of the 10 questions randomly so that A, B, C, D, or E was equally 
likely to be chosen. Hannah is one of the students in this class. Let Y = the number of 
questions that Hannah answers incorrectly. 


1. Use technology to make a histogram of the probability distribution of Y. Describe its 
shape. 

2. Calculate and interpret the mean of Y. 

3. Calculate and interpret the standard deviation of Y. 


4. On page 412, we defined X = the number of correct answers that Hannah got on the quiz. 
How do the shape, center, and variability of the probability distribution of X compare to 
your answers for Questions 1 to 3? 


Binomial Distributions in Statistical Sampling 


The binomial distributions are important in statistics when we wish to make inferences about 
the proportion p of successes in a population. For instance, suppose that a supplier inspects a 
random sample of 10 flash drives from a shipment of 10,000 flash drives in which 200 are 
defective (bad). Let X = the number of bad flash drives in the sample. 


fatamorgana-999/Shutterstock.com 


This is not quite a binomial setting. Because we are sampling without replacement, the 
independence condition is violated. The conditional probability that the second flash drive 
chosen is bad changes when we know whether the first is good or bad: P(second is bad | first is 
good) = 200/9999 = 0.0200 but P(second is bad | first is bad) = 199/9999 = 0.0199. These 
probabilities are very close because removing 1 flash drive from a shipment of 10,000 changes 


the makeup of the remaining 9999 flash drives very little. The distribution of X is very close to 
the binomial distribution with n = 10 and p = 0.02. 


To illustrate this, let’s compute the probability that none of the 10 flash drives is defective. 
Using the binomial distribution, it’s 


P(X =0)= ie (0.02)°(0.98)'° = 0.8171 
P(X=0)=(100)(0.02)0(0.98)10=0.8171 0 


The actual probability of getting no defective flash drives is 


P(no defectives)=980010,000x97999999x97989798x: --x97919991=0.8170 


| 9800 9799 9798 9791 
P tn Gaiters ees ge ee EI 
(no defectives) => 99 * 9999 “ 708 < “9901 


Those two probabilities are quite close! 


Almost all real-world sampling, such as taking an SRS from a population of interest, is 
done without replacement. As the preceding example illustrates, sampling without replacement 
leads to a violation of the Independent condition. 


However, the flash drives context shows how we can use binomial distributions in the 
Statistical setting of selecting a random sample. When the population is much larger than the 
sample, a count of successes in an SRS of size n has approximately the binomial distribution 
with n equal to the sample size and p equal to the proportion of successes in the population. 
What counts as “much larger’? In practice, the binomial distribution gives a good 
approximation as long as we sample less than 10% of the population. We refer to this as the 
10% condition. 


DEFINITION 10% condition 


When taking a random sample of size n from a population of size N, we can use a 
binomial distribution to model the count of successes in the sample as long as n < 
0.10N. 


Here’s a scenario that shows why it’s important to check the 10% condition before 
calculating a binomial probability. You might recognize the setting from the first activity in the 
book (page 6). 


An airline has just finished training 25 pilots—15 male and 10 female—to become 
captains. Unfortunately, only 8 captain positions are available right now. Airline 
managers announce that they will use a lottery to determine which pilots will fill the 
available positions. One day later, managers reveal the results of the lottery: Of the 8 
captains chosen, 5 are female and 3 are male. Some of the male pilots who weren’t 
selected suspect that the lottery was not carried out fairly. 


What’s the probability of choosing 5 female pilots in a fair lottery? Let X = the number of 
female pilots selected in a random sample of size n = 8 from the population of N = 25 pilots. 


Notice that the sample size is almost 1/3 of the population size. If we ignore this fact and use a 
binomial probability calculation, we get 


P(X 
P(X=5)=(85)(0.40)5(0.60)3=0.124 


= 


5) = CO (0.40)° (0.60)* = 0.124 


The correct probability, however, is 0.106. You can see that the binomial probability is off by 
about 17% (0.018/0.106) from the correct answer. 


EXAMPLE | Teens and debit cards (>) 
Binomial distributions and sampling 


PROBLEM: Ina survey of 500 U.S. teenagers aged 14 to 18, subjects were asked a variety 
of questions about personal finance.2 One question asked whether teens had a debit card. 
Suppose that exactly 12% of teens aged 14 to 18 have debit cards. Let X = the number of 
teens in a random sample of size 500 who have a debit card. 


a. Explain why X can be modeled by a binomial distribution even though the sample was 
selected without replacement. 


b. Use a binomial distribution to estimate the probability that 50 or fewer teens in the sample 
have debit cards. 


SOLUTION: 


a. 500 ts less than 10% of all U.S. teenagers aged 14 to 18. 


Check the 10% condition: n < 0.10N 


b. X is approximately binomial with n = 500 and a, 


p(X<50) = binomcdf(trials:500, p:0.12, x value: 50)= 0.0932 
p(X <50) = binomcedf(trials : 500, p: 0.12, 2 value: 50) 


= 0.0932 
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The Normal Approximation to Binomial Distributions! 
prPLe> 


9 As you Saw earlier, the shape of a binomial distribution can be skewed to the right, 
skewed to the left, or roughly symmetric. Something interesting happens to the shape as the 


number of trials n increases. You can investigate the relationship between n and p yourself 
using the Normal Approximation to Binomial Distributions applet at the book’s website, 
highschool.bfwpub.com/tps6e. 

Figure 6.9 shows histograms of binomial distributions for different values of n and p. As 
the number of observations n becomes larger, the binomial distribution gets close to a Normal 
distribution. 
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FIGURE 6.9 Histograms of binomial distributions with (a) n = 10 and p = 0.8, (b) n = 20 and p = 
0.8, and (c) n= 50 and p = 0.8. As rn increases, the shape of the probability distribution gets 
closer and closer to Normal. 


When n is large, we can use Normal probability calculations to approximate binomial 
probabilities. To see if n is large enough, check the Large Counts condition. 


DEFINITION Large Counts condition 


Suppose that a count X of successes has the binomial distribution with n trials and 
success probability p. The Large Counts condition says that the probability 
distribution of X is approximately Normal if 


np>10andn(1-p)>10"? >10 and n(l—p)>10 


That is, the expected numbers (counts) of successes and failures are both at least 
10. 


This condition is called “large counts” because np is the expected (mean) count of 
successes and n(1 — p) is the expected (mean) count of failures in a binomial setting. Why do 
we require that both these values be at least 10? Look back at Figure 6.9. It is clear that a 
Normal distribution does not approximate the probability distributions in parts (a) or (b) very 
well. This isn’t surprising because the Large Counts condition is not met in either case. For 
graph (a), both np = 10(0.8) = 8 and n(1 — p) = 10(0.2) = 2 are less than 10. For graph (b), np = 
20(0.8) = 16 = 10, but n(1 — p) = 20(0.2) = 4 is less than 10. The Normal curve in graph (c) 
appears to model the binomial probability distribution well. This time, the Large Counts 
condition is met: np = 50(0.8) = 40 = 10 and n(1 — p) = 50(0.2) = 10 = 10. 

The accuracy of the Normal approximation improves as the sample size n increases. It is 
most accurate for any fixed n when p is close to 1/2 and least accurate when p is near 0 or 1. 
This is why the Large Counts condition depends on p as well as n. 


EXAMPLE | Teens and debit cards (>) 
Normal approximation to a binomial distribution 


PROBLEM: Ina survey of 500 U.S. teenagers aged 14 to 18, subjects were asked a variety 
of questions about personal finance. One question asked whether teens had a debit card. 
Suppose that exactly 12% of teens aged 14 to 18 have debit cards. Let X = the number of 
teens in a random sample of size 500 who have a debit card. 


a. Justify why X can be approximated by a Normal distribution. 


b. Use a Normal distribution to estimate the probability that 50 or fewer teens in the sample 
have debit cards. 


SOLUTION: 


a. X is approximately binomial with n = 500 and p = 0.12. Because np = 500(0.12) = 60 = 
10 and n(1 - p = 500(0.88) = 440 = 10, we can approximate X with a Normal 
distribution. 

b. wy = np = 500(0.12) = 60 and oX = np(1 - p) = 500(0.12)(0.88) = 7.266 
ox = \/np(1 — p) = ./500 (0.12) (0.88) = 7.266 


Start by calculating the mean and standard deviation of the binomial random variable 


Xx. 


| \ 
38202 45,468 P 52.734 60 67.266 4,532 81,758 


X = Number of teens with a debit card 


1. Draw a Normal distribution. 
2. Perform calculations—show your work! 


(i) Standardize and use Table A or technology; or 
(ii) Use technology without standardizing. 
Be sure to answer the question that was asked. 


50—60 
4 z= —— = —- 1.38 
1. Z=50 — 607.266 = —1.38 7.266 


Using Table A: 0.0838 
Using technology: normalcdf(lower: —1000, upper: —1.38, mean:o, SD:1) = 0.0838 


i. normalcdf(lower:o, upper:50, mean:60, S‘D:7.2 66) = 0.0844 


P(X <50)5P (z<-1.38)P (X < 50)5P(Z <- 1.38) 
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The probability from the Normal approximation in this example, 0.0844, misses the exact 
binomial probability of 0.0932 from the example by about 0.0088. 


Geometric Random Variables 


In a binomial setting, the number of trials n is fixed in advance, and the binomial random 
variable X counts the number of successes. The possible values of X are 0, 1, 2, ... , n. In other 
situations, the goal is to repeat a chance process until a success occurs: 


e Roll a pair of dice until you get doubles. 
e In basketball, attempt a 3-point shot until you make one. 


e Keep placing a $1 bet on the number 15 in roulette until you win. 


These are all examples of a geometric setting. 


DEFINITION Geometric setting 


A geometric setting arises when we perform independent trials of the same 
chance process and record the number of trials it takes to get one success. On 
each trial, the probability p of success must be the same. 


Here’s an activity your class can try that involves a geometric setting. 


ACTIVITY Is this your lucky day? 


Your teacher is planning to give you 10 problems for homework. As an alternative, you can 
agree to play the Lucky Day game. Here’s how it works. A student will be selected at 
random from your class and asked to pick a day of the week (e.g., Thursday). Then your 
teacher will use technology to randomly choose a day of the week as the “lucky day.” If the 
student picks the correct day, the class will have only one homework problem. If the student 


picks the wrong day, your teacher will select another student from the class at random. The 
chosen student will pick a day of the week and your teacher will use technology to choose a 
“lucky day.” If this student gets it right, the class will have two homework problems. The 
game continues until a student correctly picks the lucky day. Your teacher will assign a 
number of homework problems that is equal to the total number of picks made by members 
of your class. Are you ready to play the Lucky Day game? 


1. Decide as a class whether to “gamble” on the number of homework problems you will 
receive. You have 30 seconds. 


2. Play the Lucky Day game and see what happens! 


In a geometric setting, if we define the random variable Y to be the number of trials needed 
to get the first success, then Y is called a geometric random variable. The probability 
distribution of Y is a geometric distribution. 


DEFINITION Geometric random variable, Geometric distribution 


The number of trials Y that it takes to get a success in a geometric setting isa 
geometric random variable. The probability distribution of Y is a geometric 
distribution with probability p of success on any trial. The possible values of Y 
are 1, 2, 3, .... 


As with binomial random variables, it’s important to be able to distinguish situations in 
which a geometric distribution does and doesn’t apply. Let’s consider the Lucky Day game. 
The random variable of interest in this game is Y = the number of picks it takes to correctly 
match the lucky day. Each pick is one trial of the chance process. Knowing the result of one 
student’s pick tells us nothing about the result of any other pick. On each trial, the probability 
of a correct pick is 1/7. This is a geometric setting. Because Y counts the number of trials to get 
the first success, it is a geometric random variable with p = 1/7. 

What is the probability that the first student picks correctly and wins the Lucky Day game? 
It’s P(Y = 1) = 1/7. That’s also the class’s chance of having only one homework problem 
assigned. For the class to have two homework problems assigned, the first student selected 
must pick an incorrect day of the week and the second student must pick the lucky day 
correctly. The probability that this happens is 


P(Y=2)=(6/7)(1/7)=0.1224P(¥ = 2) = (6/7) (1/7) = 0.1224 
Likewise, 
P(Y=3)=(6/7)(6/7)(1/7)=0.1050P(¥ = 3) = (6/7) (6/7) (1/7) = 0.1050 


In general, the probability that the first correct pick comes on the kth trial is 


P(Y=k)=(6/7)k-1(1/7)P(Y = k) = (6/7)"* (2/7) 


Let’s summarize what we’ve learned about calculating a geometric probability. 


GEOMETRIC PROBABILITY FORMULA 


If Y has the geometric distribution with probability p of success on each trial, the possible 


values of Y are 1, 2, 3, ... . If k is any one of these values, 


P(Y=k)=(1-p)k-1p? (Y = &) = (1—p)*'p 


With the geometric probability formula in hand, we can now compute any geometric 
probability. 


EXAMPLE |The Lucky Day game (>) 
Calculating geometric probabilities 


PROBLEM: Mr. Lochel’s class decides to play the Lucky Day game. Let Y = the number of 
homework problems that the class receives. 


a. Find the probability that the class receives exactly 10 homework problems as a result of 
playing the Lucky Day game. 


b. Find P(Y < 10) and interpret this value. 


SOLUTION: 
Y has a geometric distribution with (b= 1/7. 


CONE 1) G0? 77) = O57 
(6) P(Y < 10) = P(Y =1) + P(Y = 2) + P(Y =3) +... + P(Y =9) 


= 1/7 + (6/7)(1/7) + (6/7)7(1/7) + ... + (6/78 (1/7) 

=O 7503 
There’s about a 75% probability that the class will get fewer than 10 homework problems 
by playing the Lucky Day game. 


FOR PRACTICE, TRY EXERCISE 107 


There’s a clever alternative approach to finding the probability in part (b) of the example. 
By the complement rule, P(Y < 10) = 1 — P(Y= 10). What’s the probability that it will take at 
least 10 picks for Mr. Lochel’s class to win the Lucky Day game? It’s the chance that the first 9 
6\° 
picks are all incorrect: eng-0250be) 7 iad So the probability that the class will win the 
Lucky Day game in fewer than 10 picks (and therefore have fewer than 10 homework problems 
assigned) is 


P(Y<10)=1-P(Y210)=1-0.250=0.750P(¥ < 10) = 1— P(Y > 10) = 1 — 0.250 = 0.750 


As you probably guessed, we can use technology to calculate geometric probabilities. The 
following Technology Corner shows how to do it. 


16. Technology Corner | CALCULATING GEOMETRIC 
PROBABILITIES (>) 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 


There are two handy commands on the TI-83/84 for finding geometric probabilities: 
geometpdf and geometcdf. The inputs for both commands are the success probability p and 
the value(s) of interest for the geometric random variable Y. 


geometpdf (p,k) computes P(Y = k) 
geometcdf (p,k) computes P(Y < k) 


Let’s use these commands to confirm our answers in the previous example. 


a. Find the probability that the class receives exactly 10 homework problems as a result of 
playing the Lucky Day game. 


e Press (DISTR) and choose geometpdf(. 


OS 2.55 or later: In the dialog box, enter these values: p:1/ 7, x value:10, choose 


Paste, and then press |ENTER|. 


Older OS: Complete the command geometpdf(1/ 7,10) and press [ENTER], 


These results agree with our previous answer using the geometric probability formula: 
0.0357. 


NORMAL FLOAT AUTO REAL RADIAN MP | 


geometpdf(1i/7,18) 
@.9356763859 


b. Find P(Y<< 10) and interpret this value. To find P(Y<10), use the geometcdf command: 


P(Y<10)=P(Y<9)=geometcdf(1/7,9)? (¥ < 10) = P(¥ < 9) = geometedf(1/7,9) 


e Press (DISTR) and choose geometcdf. 


OS 2.55 or later: In the dialog box, enter these values: p:1/ 7, x value:9, choose Paste, 


and then press [ENTER|. 


Older OS: Complete the command geometcdf(1/ 7,9) and press [ENTER]. 


These results agree with our previous answer using the geometric probability formula: 
0.7503. 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


geometcdf(1/7,9) 
@. 7502652985 


The table shows part of the probability distribution of Y = the number of picks it takes to 
match the lucky day. We can’t show the entire distribution because the number of trials it takes 
to get the first success could be a very large number. 


Value y ; 1 2 3 4 5 6 7 8 9 
Probability p ; 0.143 0.122 0.105 0.090 0.077 0.066 0.057 0.049 0.042 


Figure 6.10 is a histogram of the probability distribution for values of Y from 1 to 26. Let’s 
describe what we see. 
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Y = Number of picks required 
FIGURE 6.10 Histogram showing the probability distribution of the geometric random variable Y 
= number of trials needed for students to pick correctly in the Lucky Day game. 


Shape: Skewed to the right. Every geometric distribution has this shape. That’s because the 
most likely value of a geometric random variable is 1. The probability of each successive value 


decreases by a factor of (1 — p). 


Center: The mean (expected value) of Y is py = 7. (Due to the infinite number of possible 


values of Y, the calculation of the mean is beyond the scope of this text.) If the class played the 
Lucky Day game many times, they would receive an average of 7 homework problems. It’s no 
coincidence that p = 1/7 and pry = 7. With probability of success 1/7 on each trial, we’d expect 


it to take an average of 7 trials to get the first success. 


Variability: The standard deviation of Y is oy = 6.48. (Due to the infinite number of possible 


values of Y, the calculation of the standard deviation is beyond the scope of this text.) If the 
class played the Lucky Day game many times, the number of homework problems they receive 
would typically vary by about 6.5 problems from the mean of 7. That could mean a lot of 
homework! 

We can generalize the result for the mean of a geometric random variable. 


MEAN (EXPECTED VALUE) OF A GEOMETRIC RANDOM VARIABLE 


If Y is a geometric random variable with probability of success p on each trial, then its mean 


(expected value) is UY=E(Y)=1p"" = E(Y)= 
required to get the first success is 1/p. 


1 
p, That is, the expected number of trials 


CHECK YOUR UNDERSTANDING 


i. 
v 


Suppose you roll a pair of fair, six-sided dice until you get doubles. Let T = the number of 
rolls it takes. Note that the probability of getting doubles on any roll is 6/36 = 1/6. 


1. Show that T is a geometric random variable. 
2. Find P(T = 3). Interpret this result. 


3. In the game of Monopoly, a player can get out of jail free by rolling doubles within 3 
turns. Find the probability that this happens. 


Section 6.3 Summary 


e A binomial setting arises when we perform n independent trials of the same chance process 
and count the number of times that a particular outcome (a “success”) occurs. The conditions 
for a binomial setting are: 


« Binary? The possible outcomes of each trial can be classified as “success” or “failure.” 


= Independent? Trials must be independent. That is, knowing the result of one trial must not 
tell us anything about the result of any other trial. 


= Number? The number of trials n of the chance process must be fixed in advance. 


« Same probability? There is the same probability of success p on each trial. 
Remember to check the BINS! 


The count of successes X in a binomial setting is a special type of discrete random variable 
known as a binomial random variable. Its probability distribution is a binomial 
distribution. Any binomial distribution is completely specified by two numbers: the number 
of trials n of the chance process and the probability of success p on any trial. The possible 
values of X are the whole numbers 0, 1, 2, ... , n. 


Use the binomial probability formula to calculate the probability of getting exactly k 
successes in n trials: 


P{(X=) = (7) (1 p)* 
P(X=k)=(nk)pk(1-p)n-k k 


» The binomial coefficient 


n n! 
(nk)=n!k!(n—-k)! ( ‘) . ki(n—k)! 
counts the number of ways k successes can be arranged among n trials. 
« The factorial of n is 
n!=n(n-1)(n-2):--+ (3)(2)(1)™! = 7 (m — 1) (m= 2) +++ (3) (2) (1) 
for positive whole numbers n, and 0! = 1. 


You can also use technology to calculate binomial probabilities. The TI-83/84 command 
binompdf(n,p,k) computes P(X = k). The TI-83/84 command binomcdf(n,p,k) computes P(X 
<k). 


A binomial distribution can have a shape that is roughly symmetric, skewed to the right, or 
skewed to the left. 


The mean and standard deviation of a binomial random variable X are 
uX=npandoX=np(1-p) #* =" and ox = ,/np(1—p) 


The binomial distribution with n trials and probability p of success gives a good 
approximation to the count of successes in a random sample of size n from a large 
population containing proportion p of successes. This is true as long as the sample size n is 
less than 10% of the population size N (the 10% condition). 


The Normal approximation to the binomial distribution* says that if X is a count of successes 
having the binomial distribution with n trials and success probability p, then when n is large, 


X is approximately Normally distributed. You can use this approximation when np = 10 and 
n(1 — p) = 10 (the Large Counts condition). 


e A geometric setting consists of repeated trials of the same chance process in which the 
probability p of success is the same on each trial, and the goal is to count the number of trials 
it takes to get one success. If Y = the number of trials required to obtain the first success, then 
Y is a geometric random variable. Its probability distribution is called a geometric 
distribution. 


e If Y has the geometric distribution with probability of success p, the possible values of Y are 
the positive integers 1, 2, 3, ... . The probability that it takes exactly k trials to get the first 
success is given by 


P(Y=k)=(1-p)k-1p? (¥ =) = (1p) 'p 


e The mean (expected value) of a geometric random variable Y is pry = 1/p. 


6.3 Technology Corner 


TI-Nspire and other technology instructions are on the book’s website at 

highschool. bfwpub.com/tps6e. 

13. Calculating binomial coefficients Page 408 
14. Calculating binomial probabilities Page 410 
15. Graphing binomial probability distributions Page 413 
16. Calculating geometric probabilities Page 424 


Section 6.3 Exercises 


In Exercises 77—80, determine whether the given scenario describes a binomial setting. Justify 
your answer. 


77. pg 405 Baby elk Biologists estimate that a randomly selected baby elk has a 44% 
Ne] 


chance of surviving to adulthood. Assume this estimate is correct. Suppose researchers 
choose 7 baby elk at random to monitor. Let X = the number that survive to adulthood. 


78. Long or short? Put the names of all the students in your statistics class in a hat. Mix up 
the names, and draw 4 without looking. Let X = the number whose last names have more 
than six letters. 


79. Bull’s-eye! Lawrence likes to shoot a bow and arrow in his free time. On any shot, he has 
about a 10% chance of hitting the bull’s-eye. As a challenge one day, Lawrence decides to 
keep shooting until he gets a bull’s-eye. Let Y = the number of shots he takes. 


80. 


81. 


82. 


83. 


84. 


85. 


86. 


87. 
88. 


Taking the train According to New Jersey Transit, the 8:00 A.M. weekday train from 
Princeton to New York City has a 90% chance of arriving on time on a randomly selected 
day. Suppose this claim is true. Choose 6 days at random. Let Y = the number of days on 
which the train arrives on time. 

pg 409 > Baby elk Refer to Exercise 77. Use the binomial probability formula to find 
P(X = 4). Interpret this value. 


Taking the train Refer to Exercise 80. Use the binomial probability formula to find P(Y = 
4). Interpret this value. 


Take a spin An online spinner has two colored regions—blue and yellow. According to 
the website, the probability that the spinner lands in the blue region on any spin is 0.80. 
Assume for now that this claim is correct. Suppose we spin the spinner 12 times and let X 
= the number of times it lands in the blue region. 


a. Explain why X is a binomial random variable. 


b. Find the probability that exactly 8 spins land in the blue region. 


Red light! Pedro drives the same route to work on Monday through Friday. His route 
includes one traffic light. According to the local traffic department, there is a 55% chance 
that the light will be red on a randomly selected work day. Suppose we choose 10 of 
Pedro’s work days at random and let Y = the number of times that the light is red. 


a. Explain why Y is a binomial random variable. 
b. Find the probability that the light is red on exactly 7 days. 
pg 409 » Baby elk Refer to Exercise 77. How surprising would it be for more than 4 


elk in the sample to survive to adulthood? Calculate an appropriate probability to support 
your answer. 


Taking the train Refer to Exercise 80. Would you be surprised if the train arrived on time 
on fewer than 4 days? Calculate an appropriate probability to support your answer. 


Take a spin Refer to Exercise 83. Calculate and interpret P(X < 7). 
Red light! Refer to Exercise 84. Calculate and interpret P(Y > 7). 


89. 


90. 


91. 


pg 411 |») The last kiss Do people have a preference for the last thing they taste? 


Researchers at the University of Michigan designed a study to find out. The researchers 
gave 22 students five different Hershey’s Kisses (milk chocolate, dark chocolate, créme, 
caramel, and almond) in random order and asked the student to rate each one. Participants 
were not told how many Kisses they would be tasting. However, when the 5th and final 
Kiss was presented, participants were told that it would be their last one.2 Assume that the 
participants in the study don’t have a special preference for the last thing they taste. That 
is, assume that the probability a person would prefer the last Kiss tasted is p = 0.20. 


a. Find the probability that 14 or more students would prefer the last Kiss tasted. 


b. Of the 22 students, 14 gave the final Kiss the highest rating. Does this give convincing 
evidence that the participants have a preference for the last thing they taste? 


1 in 6 wins As a special promotion for its 20-ounce bottles of soda, a soft drink company 
printed a message on the inside of each bottle cap. Some of the caps said, “Please try 
again!” while others said, “You’re a winner!” The company advertised the promotion with 
the slogan “1 in 6 wins a prize.” Grayson’s statistics class wonders if the company’s claim 
holds true at a nearby convenience store. To find out, all 30 students in the class go to the 
store and each buys one 20-ounce bottle of the soda. 


a. Find the probability that two or fewer students would win a prize if the company’s 
claim is true. 


b. Two of the students in Grayson’s class got caps that say, “You’re a winner!” Does this 
result give convincing evidence that the company’s 1-in-6 claim is false? 


pg 414 |») Bag check Thousands of travelers pass through the airport in Guadalajara, 


Mexico, each day. Before leaving the airport, each passenger must go through the customs 
inspection area. Customs agents want to be sure that passengers do not bring illegal items 
into the country. But they do not have time to search every traveler’s luggage. Instead, 
they require each person to press a button. Either a red or a green bulb lights up. If the red 
light flashes, the passenger will be searched by customs agents. A green light means “go 
ahead.” Customs agents claim that the light has probability 0.30 of showing red on any 
push of the button. Assume for now that this claim is true. Suppose we watch 20 
passengers press the button. Let R = the number who get a red light. Here is a histogram of 
the probability distribution of R: 


a. What probability distribution does R have? Justify your answer. 


b. Describe the shape of the probability distribution. 
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R = Number of passengers who get a red light 


92. Easy-start mower? A company has developed an “easy-start” mower that cranks the 
engine with the push of a button. The company claims that the probability the mower will 
start on any push of the button is 0.9. Assume for now that this claim is true. On the next 
30 uses of the mower, let T = the number of times it starts on the first push of the button. 
Here is a histogram of the probability distribution of T: 


a. What probability distribution does T have? Justify your answer. 
b. Describe the shape of the probability distribution. 


0.25 


0.20 


0.05 
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R = Number of passengers who get a red light 


93. Take a spin An online spinner has two colored regions—blue and yellow. According to 
the website, the probability that the spinner lands in the blue region on any spin is 0.80. 
Assume for now that this claim is correct. Suppose we spin the spinner 12 times and let X 
= the number of times it lands in the blue region. Make a graph of the probability 
distribution of X. Describe its shape. 


94. Red light! Pedro drives the same route to work on Monday through Friday. His route 
includes one traffic light. According to the local traffic department, there is a 55% chance 


95. 


96. 


O7. 


98. 


99. 


that the light will be red on a randomly selected work day. Suppose we choose 10 of 
Pedro’s work days at random and let Y = the number of times that the light is red. Make a 
graph of the probability distribution of Y. Describe its shape. 


pg 416 a) Bag check Refer to Exercise 91. 


a. Calculate and interpret the mean of R. 


b. Calculate and interpret the standard deviation of R. 


Easy-start mower Refer to Exercise 92. 
a. Calculate and interpret the mean of T. 


b. Calculate and interpret the standard deviation of T. 


Random digit dialing When a polling company calls a telephone number at random, 
there is only a 9% chance that the call reaches a live person and the survey is successfully 
completed.£2 Suppose the random digit dialing machine makes 15 calls. Let X = the 
number of calls that result in a completed survey. 


a. Find the probability that more than 12 calls are not completed. 
b. Calculate and interpret pX* . 


c. Calculate and interpret oX?% . 


Lie detectors A federal report finds that lie detector tests given to truthful persons have 
probability 0.2 of suggesting that the person is deceptive.t+ A company asks 12 job 
applicants about thefts from previous employers, using a lie detector to assess their 
truthfulness. Suppose that all 12 answer truthfully. Let Y = the number of people whom 
the lie detector indicates are being deceptive. 


a. Find the probability that the lie detector indicates that at least 10 of the people are being 
honest. 


b. Calculate and interpret p Y/Y . 
c. Calculate and interpret oY7Y . 


pg 419 |») Lefties A total of 11% of students at a large high school are left-handed. A 
Statistics teacher selects a random sample of 100 students and records L = the number of 
left-handed students in the sample. 


a. Explain why L can be modeled by a binomial distribution even though the sample was 
selected without replacement. 


b. Use a binomial distribution to estimate the probability that 15 or more students in the 
sample are left-handed. 


100. In debt? According to financial records, 24% of U.S. adults have more debt on their 
credit cards than they have money in their savings accounts. Suppose that we take a 
random sample of 100 U.S. adults. Let D = the number of adults in the sample with more 
debt than savings. 


a. Explain why D can be modeled by a binomial distribution even though the sample 
was selected without replacement. 


b. Use a binomial distribution to estimate the probability that 30 or more adults in the 
sample have more debt than savings. 


101. Airport security The Transportation Security Administration (TSA) is responsible for 
airport safety. On some flights, TSA officers randomly select passengers for an extra 
security check before boarding. One such flight had 76 passengers—12 in first class and 
64 in coach class. Some passengers were surprised when none of the 10 passengers 
chosen for screening were seated in first class. Should we use a binomial distribution to 
approximate this probability? Justify your answer. 


102. Scrabble In the game of Scrabble, each player begins by drawing 7 tiles from a bag 
containing 100 tiles. There are 42 vowels, 56 consonants, and 2 blank tiles in the bag. 
Cait chooses her 7 tiles and is surprised to discover that all of them are vowels. Should 
we use a binomial distribution to approximate this probability? Justify your answer. 


103. *pg 421 \p) Lefties Refer to Exercise 99. 


a. Justify why L can be approximated by a Normal distribution. 


b. Use a Normal distribution to estimate the probability that 15 or more students in the 
sample are left-handed. 


104. *In debt? Refer to Exercise 100. 
a. Justify why D can be approximated by a Normal distribution. 


b. Use a Normal distribution to estimate the probability that 30 or more adults in the 
sample have more debt than savings. 


105. 10% condition To use a binomial distribution to approximate the count of successes in 
an SRS, why do we require that the sample size n be less than 10% of the population size 
N? 


106. * Large Counts condition To use a Normal distribution to approximate binomial 
probabilities, why do we require that both np and n(1 — p) be at least 10? 


107. pg 423 |») Cranky mower To start her old lawn mower, Rita has to pull a cord and 


hope for some luck. On any particular pull, the mower has a 20% chance of starting. 
a. Find the probability that it takes her exactly 3 pulls to start the mower. 


b. Find the probability that it takes her more than 6 pulls to start the mower. 


108. 


109. 


110. 


111. 


112. 


1-in-6 wins Alan decides to use a different strategy for the 1-in-6 wins game of Exercise 
90. He keeps buying one 20-ounce bottle of the soda at a time until he gets a winner. 


a. Find the probability that he buys exactly 5 bottles. 


b. Find the probability that he buys at most 6 bottles. Show your work. 


Geometric or not? Determine whether each of the following scenarios describes a 
geometric setting. If so, define an appropriate geometric random variable. 


a. A popular brand of cereal puts a card bearing the image of 1 of 5 famous NASCAR 
drivers in each box. There is a 1/5 chance that any particular driver’s card ends up in 
any box of cereal. Buy boxes of the cereal until you have all 5 drivers’ cards. 


b. In a game of 4-Spot Keno, Lola picks 4 numbers from 1 to 80. The casino randomly 
selects 20 winning numbers from 1 to 80. Lola wins money if she picks 2 or more of 
the winning numbers. The probability that this happens is 0.259. Lola decides to keep 
playing games of 4-Spot Keno until she wins some money. 


Geometric or not? Determine whether each of the following scenarios describes a 
geometric setting. If so, define an appropriate geometric random variable. 


a. Shuffle a standard deck of playing cards well. Then turn over one card at a time from 
the top of the deck until you get an ace. 


b. Billy likes to play comhole in his free time. On any toss, he has about a 20% chance 
of getting a bag into the hole. As a challenge one day, Billy decides to keep tossing 
bags until he gets one in the hole. 


Using Benford’s law According to Benford’s law (Exercise 15, page 377), the 
probability that the first digit of the amount of a randomly chosen invoice is an 8 or a 9 is 
0.097. Suppose you examine randomly selected invoices from a vendor until you find 
one whose amount begins with an 8 or a 9. 


a. How many invoices do you expect to examine before finding one that begins with an 
8 or 9? 


b. In fact, the first invoice you find with an amount that starts with an 8 or 9 is the 40th 
invoice. Does this result provide convincing evidence that the invoice amounts are not 
genuine? Calculate an appropriate probability to support your answer. 


Roulette Marti decides to keep placing a $1 bet on number 15 in consecutive spins of a 
roulette wheel until she wins. On any spin, there’s a 1-in-38 chance that the ball will land 
in the 15 slot. 


a. How many spins do you expect it to take for Marti to win? 


b. Would you be surprised if Marti won in 3 or fewer spins? Compute an appropriate 
probability to support your answer. 


Multiple Choice: Select the best answer for Exercises 113-117. 


113. Joe reads that 1 out of 4 eggs contains salmonella bacteria. So he never uses more than 3 


eggs in cooking. If eggs do or don’t contain salmonella independently of each other, the 
number of contaminated eggs when Joe uses 3 eggs chosen at random has the following 
distribution: 


a. binomial; n = 4 and p = 1/4 
b. binomial; n = 3 and p = 1/4 
c. binomial; n = 3 and p = 1/3 
d. geometric; p = 1/4 


e. geometric; p = 1/3 


Exercises 114 and 115 refer to the following setting. A fast-food restaurant runs a promotion in 
which certain food items come with game pieces. According to the restaurant, 1 in 4 game 
pieces is a winner. 


114. 


115. 


If Jeff gets 4 game pieces, what is the probability that he wins exactly 1 prize? 
a. 0.25 


b. 1.00 
4 
C nee: 
4 
d. aiynosiorodt 


e. (0.75)3(0.75)! 


If Jeff keeps playing until he wins a prize, what is the probability that he has to play the 
game exactly 5 times? 


a. (0.25)? 
b. (0.75)* 
c. (0.75)? 


d. (0.75)4(0.25) 


5 
0.7 
e. sioseuiaaeh 1 


116. Each entry in a table of random digits like Table D has probability 0.1 of being a 0, and 


the digits are independent of one another. Each line of ‘Table D contains 40 random 


digits. The mean and standard deviation of the number of Os in a randomly selected line 
will be approximately 


a. mean = 0.1, standard deviation = 0.05. 
b. mean = 0.1, standard deviation = 0.1. 
c. mean = 4, standard deviation = 0.05. 
d. mean = 4, standard deviation = 1.90. 


e. mean = 4, standard deviation = 3.60. 


117. * In which of the following situations would it be appropriate to use a Normal distribution 


to approximate probabilities for a binomial distribution with the given values of n and p? 
a. n=10,p=0.5 

b. n= 40, p = 0.88 

c. n= 100, p=0.2 

d. n= 100, p = 0.99 

e. n= 1000, p = 0.003 


Recycle and Review 


118. 


119. 


Spoofing (4.2) To collect information such as passwords, online criminals use 
“spoofing” to direct Internet users to fraudulent websites. In one study of Internet fraud, 
students were warmed about spoofing and then asked to log into their university account 
starting from the university’s home page. In some cases, the log-in link led to the 
genuine dialog box. In others, the box looked genuine but, in fact, was linked to a 
different site that recorded the ID and password the student entered. The box that 
appeared for each student was determined at random. An alert student could detect the 
fraud by looking at the true Internet address displayed in the browser status bar, but most 
just entered their ID and password. 


a. Is this an observational study or an experiment? Justify your answer. 


b. What are the explanatory and response variables? Identify each variable as categorical 
or quantitative. 

Standard deviations (6.1) Continuous random variables A, B, and C all take values 

between 0 and 10. Their density curves, drawn on the same horizontal scales, are shown 

here. Rank the standard deviations of the three random variables from smallest to largest. 

Justify your answer. 


MM _. FRAPPY! FREE RESPONSE AP® 
PROBLEM, YAY! 


The following problem is modeled after actual AP® Statistics exam free response 
questions. Your task is to generate a complete, concise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you use, because you will be 
scored on the correctness of your methods as well as on the accuracy and completeness of 
your results and explanations. 


Buckley Farms produces homemade potato chips that it sells in bags labeled “16 
ounces.” The total weight of each bag follows an approximately Normal distribution with 
a mean of 16.15 ounces and a standard deviation of 0.12 ounce. 


a. If you randomly selected 1 bag of these chips, what is the probability that the total 
weight is less than 16 ounces? 


If you randomly selected 10 bags of these chips, what is the probability that 
exactly 2 of the bags will have a total weight less than 16 ounces? 


Buckley Farms ships its chips in boxes that contain 6 bags. The empty boxes have 
a mean weight of 10 ounces and a standard deviation of 0.05 ounce. Calculate the 
mean and standard deviation of the total weight of a box containing 6 bags of 
chips. 


Buckley Farms decides to increase the mean weight of each bag of chips so that 
only 5% of the bags have weights that are less than 16 ounces. Assuming that the 
standard deviation remains 0.12 ounce, what mean weight should Buckley Farms 
use? 


After you finish, you can view two example solutions on the book’s website 
(highschool.bfwpub.com/tps6e). Determine whether you think each solution is 
“complete,” “substantial,” “developing,” or “minimal.” If the solution is not 
complete, what improvements would you suggest to the student who wrote it? 
Finally, your teacher will provide you with a scoring rubric. Score your response 
and note what, if anything, you would do differently to improve your own score. 


Chapter 6 Review 


Section 6.1: Discrete and Continuous Random Variables 


A random variable assigns numerical values to the outcomes of a chance process. The 
probability distribution of a random variable describes its possible values and their 
probabilities. There are two types of random variables: discrete and continuous. Discrete 
random variables take on a fixed set of values with gaps in between. Continuous random 
variables take on all values in an interval of numbers. 


As in Chapter 1, we are often interested in the shape, center, and variability of a probability 
distribution. The shape of a discrete probability distribution can be identified by graphing a 
probability histogram, with the height of each bar representing the probability of a single value. 
The center is usually identified by the mean (expected value) of the random variable. The mean 
(expected value) is the average value of the random variable if the chance process is repeated 
many times. The variability of a probability distribution is usually identified by the standard 
deviation, which describes how much the values of a random variable typically vary from the 
mean value, in many repetitions of the chance process. 


Continuous probability distributions, such as the Normal distribution, describe the 
distribution of continuous random variables. A density curve is used to display a continuous 
probability distribution. Probabilities for continuous random variables are determined by 
finding the area under the density curve and above the values of interest. 


Section 6.2: Transforming and Combining Random Variables 
In this section, you learned how linear transformations of a random variable affect the shape, 
center, and variability of its probability distribution. Similar to what you learned in Chapter 2, 
adding a positive constant to (or subtracting it from) each value of a random variable changes 
the measures of center and location, but not the shape or variability of the probability 
distribution. Multiplying or dividing each value of a random variable by a positive constant 
changes the measures of center and location and measures of variability, but not the shape of 
the probability distribution. 

You also learned how to calculate the mean and standard deviation for a combination of 
two or more random variables. The mean of a sum or difference of any two random variables X 
and Y is given by 


uX+Y=pX+pVYandpX-Y=pX-py Hx+yY = ex + Hy and px_y = Hx — By 


If X and Y are any two independent random variables, variances add: 


) 


2 —e 2 2 ere | 2 
oX+V¥2=0X24+0Y2 and oX-Y2=0X2+0Y27x+Y =% +%y and oy y =7y + oy 


To find the standard deviation, just take the square root of the variance. Recall that X and Y 
are independent if knowing the value of one variable doesn’t help you predict the value of the 


other variable. Also, if independent random variables X and Y are both Normally distributed, 
then their sum X + Y and difference X — Y are both Normally distributed as well. 


Section 6.3: Binomial and Geometric Random Variables 


In this section, you learned about two common types of discrete random variables, binomial 
random variables and geometric random variables. Binomial random variables count the 
number of successes in a fixed number of trials (n), whereas geometric random variables count 
the number of trials needed to get one success. Otherwise, the binomial and geometric settings 
have the same conditions: there must be two possible outcomes for each trial (success or 
failure), the trials must be independent, and the probability of success p must stay the same 
throughout all trials. 

To calculate probabilities for a binomial distribution with n trials and probability of success 
p on each trial, use technology or the binomial probability formula 


P(X =k) = ( ) Xa —pyr"" 
P(X=k)=(nk)pk(1-p)n-k k 


The mean and standard deviation of a binomial random variable X are 


— 
uX=npandoX=np(1-p) ix =np and ox = ,/np(1— p) 


The shape of a binomial distribution depends on both the number of trials n and the 
probability of success p. When the number of trials is large enough that both np and n(1 — p) 
are at least 10, the distribution of the binomial random variable X has an approximately Normal 
distribution. Be sure to check the Large Counts condition before using a Normal approximation 
to a binomial distribution. 

A common application of the binomial distribution is when we count the number of times a 
particular outcome occurs in a random sample from some population. Because sampling is 
almost always done without replacement, the Independent condition is violated. However, if 
the sample size is a small fraction of the population size (less than 10%), the lack of 
independence isn’t a concern. Be sure to check the 10% condition when sampling is done 
without replacement before using a binomial distribution. 

Finally, to calculate probabilities for a geometric distribution with probability of success p 
on each trial, use technology or the geometric probability formula 


P(Y=k)=(1-p)k-1p? (¥ = *) = (1—p)*"p 


The mean (expected value) of a geometric random variable Y is pY=1/p¥Y¥ — 1/p_ 


What Did You Learn? 


Related 
Example on Relevant Chapter Review 


Learning Target 


Use the probability 
distribution of a discrete 
random variable to 
calculate the probability of 
an event. 


Make a histogram to 
display the probability 
distribution of a discrete 
random variable and 
describe its shape. 


Calculate and interpret the 
mean (expected value) of 
a discrete random 
variable. 


Calculate and interpret the 
standard deviation of a 
discrete random variable. 


Use the probability 
distribution of a continuous 
random variable (uniform 
or Normal) to calculate the 
probability of an event. 


Describe the effect of 
adding or subtracting a 
constant or multiplying or 
dividing by a constant on 
the probability distribution 
of a random variable. 


Calculate the mean and 
standard deviation of the 
sum or difference of 
random variables. 


Find probabilities involving 
the sum or difference of 
independent Normal 
random variables. 


Determine whether the 
conditions for a binomial 
setting are met. 


Calculate and interpret 
probabilities involving 
binomial distributions. 


Calculate the mean and 
standard deviation of a 
binomial random variable. 
Interpret these values. 
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Chapter 6 Review Exercises 


These exercises are designed to help you review the important ideas and methods of the 
chapter. 


R6.1 _») Knees Patients receiving artificial knees often experience pain after surgery. The 


pain is measured on a subjective scale with possible values of 1 (low) to 5 (high). Let Y 
be the pain score for a randomly selected patient. The following table gives the 
probability distribution for Y. 


Value 1 2 3 4 5 
Probability 0.1 0.2 0.3 0.3 2? 


a. Find P(Y = 5). Interpret this value. 
b. Find the probability that a randomly selected patient has a pain score of at most 2. 
c. Calculate the expected pain score and the standard deviation of the pain score. 


R6.2 ») A glass act In a process for manufacturing glassware, glass stems are sealed by 


heating them in a flame. Let X be the temperature (in degrees Celsius) for a randomly 
chosen glass. The mean and standard deviation of X are [fy = 550°C and oy = 5.7°C. 


a. Is temperature a discrete or continuous random variable? Explain your answer. 


b. The target temperature is 550°C. What are the mean and standard deviation of the 
number of degrees off target, D = X — 550°? 


c. A manager asks for results in degrees Fahrenheit. The conversion of X into degrees 
_9 
Fahrenheit is given by Y=95X+32> =e 32 What are the mean ty and the 


standard deviation oy of the temperature of the flame in the Fahrenheit scale? 


R6.3 _p) Keno In a game of 4-Spot Keno, the player picks 4 numbers from 1 to 80. The 


casino randomly selects 20 winning numbers from 1 to 80. The table shows the possible 
outcomes of the game and their probabilities, along with the amount of money (Payout) 
that the player wins for a $1 bet. If X = the payout for a single $1 bet, you can check that 
Ly = $0.70 and oy = $6.58. 


Matches 0 1 Z 3 4 
Payout x; $0 $0 $1 $3 $120 
Probability p; 0.308 0.433 0.213 0.043 0.003 


a. Make a graph of the probability distribution. Describe what you see. 


. Interpret the values of pty and oy. 


Jerry places a single $5 bet on 4-Spot Keno. Find the expected value and the standard 
deviation of his winnings. 


. Marla plays five games of 4-Spot Keno, betting $1 each time. Find the expected value 


and the standard deviation of her total winnings. 


R6.4 ») Applying torque A machine fastens plastic screw-on caps onto containers of motor 


oil. If the machine applies more torque than the cap can withstand, the cap will break. 
Both the torque applied and the strength of the caps vary. The capping-machine torque T 
follows a Normal distribution with mean 7 inch-pounds and standard deviation 0.9 inch- 
pound. The cap strength C (the torque that would break the cap) follows a Normal 
distribution with mean 10 inch-pounds and standard deviation 1.2 inch-pounds. 


a. 


Find the probability that a randomly selected cap has a strength greater than 11 inch- 
pounds. 


. Explain why it is reasonable to assume that the cap strength and the torque applied by 


the machine are independent. 


c. Let the random variable D = C — T. Find its mean and standard deviation. 


. What is the probability that a randomly selected cap will break while being fastened 


by the machine? 


Exercises R6.5 and R6.6 refer to the following setting. 

According to Mars, Incorporated, 20.5% of the M&M’S® Milk Chocolate Candies made at its 
Cleveland factory are orange. Assume that the company’s claim is true. Suppose you take a 
random sample of 8 candies from a large bag of M&M’S. Let X = the number of orange 
candies you get. 


R6.5 ») Orange M&M’S 


a. 


Explain why it is reasonable to use the binomial distribution for probability 
calculations involving X. 


. What’s the probability that you get 3 orange M&M’S? 


c. Calculate P(X = 4). Interpret this result. 


. Suppose that you get 4 orange M&M’S in your sample. Does this result provide 


convincing evidence that Mars’s claim about its M&M’S is false? Justify your 
answer. 


R6.6 ») Orange M&M’S 


a. Find and interpret the expected value of X. 


b. Find and interpret the standard deviation of X. 


R6.7 \») Sushi Roulette In the Japanese game show Sushi Roulette, the contestant spins a 
large wheel that’s divided into 12 equal sections. Nine of the sections show a sushi roll, 
and three have a “wasabi bomb.” When the wheel stops, the contestant must eat whatever 
food applies to that section. Then the game show host replaces the item of food on the 
wheel. To win the game, the contestant must eat one wasabi bomb. Find the probability 
that it takes 3 or fewer spins for the contestant to get a wasabi bomb. 

R6.8* _») Public transportation In a large city, 34% of residents use public transportation at 
least once per week. Suppose the city’s mayor selects a random sample of 200 residents. 
Let T = the number who use public transportation at least once per week. 

a. What type of probability distribution does T have? Justify your answer. 
b. Explain why T can be approximated by a Normal distribution. 


c. Calculate the probability that at most 60 residents in the sample use public 
transportation at least once per week. 


Chapter 6 AP® Statistics Practice Test 


Section |: Multiple Choice Select the best answer for each question. 


Questions T6.1-T6.3 refer to the following setting. A psychologist studied the number of 
puzzles that subjects were able to solve in a 5-minute period while listening to soothing 
music. Let X be the number of puzzles completed successfully by a randomly chosen 
subject. The psychologist found that X had the following probability distribution. 


Value 1 2 3 4 
Probability 0.2 0.4 0.3 0.1 


T6.1 What is the probability that a randomly chosen subject completes more than the 
expected number of puzzles in the 5-minute period while listening to soothing music? 


a. 0.1 
b. 0.4 
c. 0.8 
d. 1 


e. Cannot be determined 


T6.2 The standard deviation of X is 0.9. Which of the following is the best interpretation of 
this value? 


a. About 90% of subjects solved 3 or fewer puzzles. 


b. About 68% of subjects solved between 0.9 puzzles less and 0.9 puzzles more than 
the mean. 


c. The typical subject solved an average of 0.9 puzzles. 


d. The number of puzzles solved by subjects typically differed from the mean by about 
0.9 puzzles. 


e. The number of puzzles solved by subjects typically differed from one another by 
about 0.9 puzzles. 


T6.3 Let D be the difference in the number of puzzles solved by two randomly selected 
subjects in a 5-minute period. What is the standard deviation of D? 


a. O 
b. 0.81 
c. 0.9 


a. 27 
e. 1.8 


T6.4 Suppose a student is randomly selected from your school. Which of the following pairs 
of random variables are most likely independent? 


a. X = student’s height; Y = student’s weight 

b. X = student’s IQ; Y = student’s GPA 

c. X =student’s PSAT Math score; Y = student’s PSAT Verbal score 

d. X = average amount of homework the student does per night; Y = student’s GPA 


e. X = average amount of homework the student does per night; Y = student’s height 


T6.5 A certain vending machine offers 20-ounce bottles of soda for $1.50. The number of 
bottles X bought from the machine on any day is a random variable with mean 50 and 
standard deviation 15. Let the random variable Y equal the total revenue from this 
machine on a randomly selected day. Assume that the machine works properly and that 
no sodas are stolen from the machine. What are the mean and standard deviation of Y? 


a. py = $1.50, oy = $22.50 
b. py = $1.50, oy = $33.75 
C. py = $75, oy = $18.37 
d. py = $75, oy = $22.50 
e. Hy = $75, dy = $33.75 


T6.6 The weight of tomatoes chosen at random from a bin at the farmer’s market follows a 
Normal distribution with mean p = 10 ounces and standard deviation o = 1 ounce. 
Suppose we pick four tomatoes at random from the bin and find their total weight T. 
The random variable T is 


a. Normal, with mean 10 ounces and standard deviation 1 ounce. 
b. Normal, with mean 40 ounces and standard deviation 2 ounces. 
c. Normal, with mean 40 ounces and standard deviation 4 ounces. 
d. binomial, with mean 40 ounces and standard deviation 2 ounces. 


e. binomial, with mean 40 ounces and standard deviation 4 ounces. 


T6.7 Which of the following random variables is geometric? 
a. The number of times I have to roll a single die to get two 6s 
b. The number of cards I deal from a well-shuffled deck of 52 cards to get a heart 


c. The number of digits I read in a randomly selected row of the random digits table to 


geta 7 
d. The number of 7s in a row of 40 random digits 


e. The number of 6s I get if I roll a die 10 times 


T6.8 Seventeen people have been exposed to a particular disease. Each one independently 
has a 40% chance of contracting the disease. A hospital has the capacity to handle 10 
cases of the disease. What is the probability that the hospital’s capacity will be 
exceeded? 


a. 0.011 
b. 0.035 
c. 0.092 
d. 0.965 
e. 0.989 


T6.9 The figure shows the probability distribution of a discrete random variable X. Which of 

the following best describes this random variable? 

a. Binomial with n = 8, p = 0.1 

b. Binomial with n = 8, p = 0.3 

c. Binomial with n = 8, p = 0.8 

d. Geometric with p = 0.1 

e. Geometric with p = 0.2 
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T6.10 A test for extrasensory perception (ESP) involves asking a person to tell which of 5 
shapes—a circle, star, triangle, diamond, or heart—appears on a hidden computer 
screen. On each trial, the computer is equally likely to select any of the 5 shapes. 
Suppose researchers are testing a person who does not have ESP and so is just 
guessing on each trial. What is the probability that the person guesses the first 4 
shapes incorrectly but gets the fifth one correct? 


a. 1/5 


b apa 


4\4 
c. (4sya-cas)\3) (5) 
3) 
1 


d. ere | )-@) (3) 


e. 4/5 


Section II: Free Response Show all your work. Indicate clearly the methods you use, 
because you will be graded on the correctness of your methods as well as on the accuracy and 
completeness of your results and explanations. 


T6.11 Let Y denote the number of broken eggs in a randomly selected carton of one dozen 
“store brand” eggs at a local supermarket. Suppose that the probability distribution of 
Y is as follows. 


Value y; 0 1 2 3 4 
Probability p; 0.78 0.11 0.07 0.03 0.01 


a. What is the probability that at least 10 eggs in a randomly selected carton are 
unbroken? 


b. Calculate and interpret ily. 
c. Calculate and interpret oy. 


d. A quality control inspector at the store keeps looking at randomly selected cartons 
of eggs until he finds one with at least 2 broken eggs. Find the probability that this 
happens in one of the first three cartons he inspects. 


T6.12 Ladies Home Journal magazine reported that 66% of all dog owners greet their dog 
before greeting their spouse or children when they return home at the end of the 
workday. Assume that this claim is true. Suppose 12 dog owners are selected at 
random. Let X = the number of owners who greet their dogs first. 


a. Explain why it is reasonable to use the binomial distribution for probability 
calculations involving X. 


b. Find the probability that exactly 6 owners in the sample greet their dogs first when 
returning home from work. 


c. In fact, only 4 of the owners in the sample greeted their dogs first. Does this give 
convincing evidence against the Ladies Home Journal claim? Calculate P(X < 4) 
and use the result to support your answer. 


T6.13 Ed and Adelaide attend the same high school but are in different math classes. The 
time E that it takes Ed to do his math homework follows a Normal distribution with 
mean 25 minutes and standard deviation 5 minutes. Adelaide’s math homework time 
A follows a Normal distribution with mean 50 minutes and standard deviation 10 
minutes. Assume that E and A are independent random variables. 


a. Randomly select one math assignment of Ed’s and one math assignment of 
Adelaide’s. Let the random variable D be the difference in the amount of time each 
student spent on their assignments: D = A — E. Find the mean and the standard 
deviation of D. 


b. Find the probability that Ed spent longer on his assignment than Adelaide did on 
hers. 


T6.14 According to the Census Bureau, 13% of American adults (aged 18 and over) are 
Hispanic. An opinion poll plans to contact an SRS of 1200 adults. 


a. What is the mean number of Hispanics in such samples? What is the standard 
deviation? 


b. Should we be suspicious if the sample selected for the opinion poll contains 10% 
or less Hispanic people? Calculate an appropriate probability to support your 
answer. 


Chapter f Sampling Distributions 
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Chapter 7 Review Exercises 
Chapter 7 AP® Statistics Practice Test 


INTRODUCTION 


In this chapter, we will return to a key idea about statistical inference from Chapter 4—making 
conclusions about a population based on data from a sample. Here are a few examples of 
Statistical inference in practice: 


e Each month, the Current Population Survey (CPS) interviews a random sample of 
individuals in about 60,000 U.S. households. The CPS uses the proportion of unemployed 
people in the sample p/? to estimate the national unemployment rate p.?- 


e To estimate how much gasoline prices vary in a large city, a reporter records the price per 
gallon of regular unleaded gasoline at a random sample of 10 gas stations in the city. The 
range (Maximum — Minimum) of the prices in the sample is 25 cents. What can the reporter 
say about the range of gas prices at all the city’s stations? 


e A battery manufacturer wants to make sure that the AA batteries it produces each hour meet 
certain standards. Quality control inspectors collect data from a random sample of 50 AA 
batteries produced during one hour and use the sample mean lifetime x £ to estimate the 
unknown population mean lifetime pi for all batteries produced that hour. 


Let’s look at the battery example a little more closely. To make an inference about the 
batteries produced in the given hour, we need to know how close the sample mean x is likely 
to be to the population mean pL.“ After all, different random samples of 50 batteries from the 
same hour of production would yield different values of x .©- How can we describe this 
sampling distribution of possible x © values? We can think of x £ as a random variable 
because it takes numerical values that describe the outcomes of the random sampling process. 
As a result, we can examine its probability distribution using what we learned in Chapter 6. 

The following activity will help you get a feel for the distribution of two very common 
statistics, the sample mean x = and the sample proportion p/.P: 


ACTIVITY A penny for your thoughts? 


In this activity, your class will investigate how the mean year x £ and the proportion of 
pennies from the 2000s p/? vary from sample to sample, using a large population of pennies 
of various ages.4 


peterspiro/Getty Images 


1. Each member of the class should randomly select 1 penny from the population and record 
the year of the penny with an “X” on the dotplot provided by your teacher. Return the 
penny to the population. Repeat this process until at least 100 pennies have been selected 
and recorded. This graph gives you an idea of what the population distribution of penny 
years looks like. 


2. Each member of the class should then select an SRS of 5 pennies from the population and 
note the year on each penny. 


99 86a 


e Record the average year of these 5 pennies (rounded to the nearest year) with an “x ”“Z” on 
a new class dotplot. Make sure this dotplot is on the same scale as the dotplot in Step 1. 


e Record the proportion of pennies from the 2000s with a “pnp ” ona different dotplot 
provided by your teacher. 


Return the pennies to the population. Repeat this process until there are at least 100 x ’sz’s 
and 100 p/’s.P ®- 


3. Repeat Step 2 with SRSs of size n=20.n = 20. Make sure these dotplots are on the same 
scale as the corresponding dotplots from Step 2. 


4. Compare the distribution of X (year of penny) with the two distributions of x % (mean 
year). How are the distributions similar? How are they different? What effect does sample 
size seem to have on the shape, center, and variability of the distribution of x ?%? 


5. Compare the two distributions of p\.P: How are the distributions similar? How are they 
different? What effect does sample size seem to have on the shape, center, and variability 
of the distribution of prep? 


Sampling distributions are the foundation of inference when data are produced by random 
sampling. Because the results of random samples include an element of chance, we can’t 
guarantee that our inferences are correct. What we can guarantee is that our methods usually 
give correct answers. The reasoning of statistical inference rests on asking, “How often would 
this method give a correct answer if I used it many times?” If our data come from random 
sampling, the laws of probability help us answer this question. These laws also allow us to 


determine how far our estimates typically vary from the truth and what values of a statistic 
should be considered unusual. 


Section 7.1 presents the basic ideas of sampling distributions. The most common 
applications of statistical inference involve proportions and means. Section 7.2 focuses on 
sampling distributions of sample proportions. Section 7.3 investigates sampling distributions of 
sample means. 


SECTION 7.1 What Is a Sampling Distribution? 


LEARNING TARGETS By the end of the section, you should be able to: 


Distinguish between a parameter and a statistic. 


Create a sampling distribution using all possible samples from a small 
population. 


Use the sampling distribution of a statistic to evaluate a claim about a 
parameter. 


Distinguish among the distribution of a population, the distribution of a sample, 
and the sampling distribution of a statistic. 


Determine if a Statistic is an unbiased estimator of a population parameter. 


Describe the relationship between sample size and the variability of a statistic. 


What is the average income of U.S. residents with a college degree? Each March, the 
government’s Current Population Survey (CPS) asks detailed questions about income. The 
random sample of about 70,000 U.S. college grads contacted in March 2016 had a mean “total 
money income” of $73,750 in 2015.2 That $73,750 describes the sample, but we use it to 
estimate the mean income of all college grads in the United States. 


Because of some very large incomes, the mean total income ($73,750) was larger than the median 
total income ($55,071). 


Parameters and Statistics 


As we begin to use sample data to draw conclusions about a larger population, we must be 
clear about whether a number describes a sample or a population. For the sample of college 
graduates contacted by the CPS, the mean income was x =$73,750.2 = $73, 750. The number 
$73,750 is a statistic because it describes this one CPS sample. The population that the poll 
wants to draw conclusions about is the nearly 100 million U.S. residents with a college degree. 
In this case, the parameter of interest is the mean income pL of all these college graduates. 
We don’t know the value of this parameter, but we can estimate it using data from the sample. 


DEFINITION Statistic, Parameter 
A statistic is a number that describes some characteristic of a sample. 


A parameter is a number that describes some characteristic of the population. 


Remember s and p: statistics come from samples, and parameters come from populations. 
As long as we were doing data analysis, the distinction between population and sample rarely 
came up. Now that we are focusing on statistical inference, however, it is essential. The 
notation we use should reflect this distinction. The table shows three commonly used statistics 
and their corresponding parameters. 


It is common practice to use Greek letters for parameters and Roman letters for statistics. In that 
case, the population proportion would be 71 (pi, the Greek letter for “p”) and the sample 
proportion would be p. We'll stick with the notation that’s used on the AP® exam, however. 


Sample statistic Population parameter 
xX (the sample estimates uF (the population 
mean) mean) 
pP (the sample estimates p (the population 
proportion) proportion) 


Sy, (the sample SD) estimates o@ (the population SD) 


EXAMPLE |From ghosts to cold cabins (>) 
Parameters and statistics 


RyersonClark/Getty Images 

PROBLEM: Identify the population, the parameter, the sample, and the statistic in each of 

the following settings. 

a. The Gallup Poll asked 515 randomly selected U.S. adults if they believe in ghosts. Of the 
respondents, 160 said “Yes.”2 

b. During the winter months, the temperatures outside the Starneses’ cabin in Colorado can 
stay well below freezing for weeks at a time. To prevent the pipes from freezing, Mrs. 
Starnes sets the thermostat at 50°F. She wants to know how low the temperature actually 
gets in the cabin. A digital thermometer records the indoor temperature at 20 randomly 
chosen times during a given day. The minimum reading is 38°F. 


SOLUTION: 


a. Population: all U.S. adults. Parameter: pe the proportion of all U.S. adults who believe 
in ghosts. Sample: the 515 people who were interviewed in thts Gallup Poll. Statistic: pe 
P= the proportion in the sample who say they believe in ghosts = 160/51 =O 


b. Population: all times during the day m question. Parameter: the true minimum 
temperature in the cabin at all times that day. Sample: the 20 temperature readings at 
randomly selected times. Statistic: the sample minimum temperature, aye, 


Not all parameters and statistics have their own symbols. To distinguish parameters 


and statistics in these cases, use descriptors like “true” and “sample.” 


FOR PRACTICE, TRY EXERCISE 1 


AP® EXAM TIP 


Many students lose credit on the AP® Statistics exam when defining parameters 
because their description refers to the sample instead of the population or because 


the description isn’t clear about which group of individuals the parameter is 
describing. When defining a parameter, we suggest including the word all/ or the 
word true in your description to make it clear that you aren't referring to a sample 
Statistic. 


The Idea of a Sampling Distribution 


The students in Mrs. Gallas’s class did the “Penny for your thoughts” activity at the beginning 
of the chapter. Figure 7.1 shows their “dotplot” of the sample mean year for 50 samples of size 
n=5.n =. 
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FIGURE 7.1 Distribution of the sample mean year of penny for 50 samples of size n=5n = 5 
from Mrs. Gallas’s population of pennies. 


It shouldn’t be surprising that the statistic x Z is a variable. After all, different samples of 
n=5n = 5 pennies will produce different means. As you learned in Section 4.3, this basic fact is 


called sampling variability. 


DEFINITION Sampling variability 


Sampling variability refers to the fact that different random samples of the same 
size from the same population produce different values for a statistic. 


Knowing how statistics vary from sample to sample is essential when making an inference 
about a population. Understanding sampling variability reminds us that the value of a statistic is 
unlikely to be exactly equal to the value of the parameter it is trying to estimate. It also lets us 
say how much we expect an estimate to vary from its corresponding parameter. 

Mrs. Gallas’s class took only 50 random samples of 5 pennies. However, there are many, 
many possible random samples of size 5 from Mrs. Gallas’s large population of pennies. If the 
students took every one of those possible samples, calculated the value of x % for each, and 
graphed all those x £ values, then we’d have a sampling distribution. 


DEFINITION Sampling distribution 


The sampling distribution of a statistic is the distribution of values taken by the 
Statistic in all possible samples of the same size from the same population. 


Remember that a distribution describes the possible values of a variable and how often these 


values occur. Thus, a sampling distribution shows the possible values of a statistic and how 
often these values occur. 


For large populations, it is too difficult to take all possible samples of size n to obtain the 
exact sampling distribution of a statistic. Instead, we can approximate a sampling distribution 
by taking many samples, calculating the value of the statistic for each of these samples, and 
graphing the results. Because the students in Mrs. Gallas’s class didn’t take all possible 
samples of 5 pennies, their dotplot of x ’s#’s in Figure 7.1 is called an approximate sampling 
distribution. 


The following example demonstrates how to construct a complete sampling distribution 
using a small population. 


EXAMPLE | Sampling heights (>) 
Creating a sampling distribution 


Brian Miller 


PROBLEM: John and Carol have four grown sons. Their heights (in inches) are 71, 75, 72, 
and 68. 


a. List all 6 possible samples of size 2. 


b. Calculate the mean of each sample and display the sampling distribution of the sample 
mean using a dotplot. 


c. Calculate the range of each sample and display the sampling distribution of the sample 
range using a dotplot. 


SOLUTION: 
a. 
Sample 
fA, 5 
flete 
71, 68 
15; (2 
75, 68 
72, 68 
b. 
Sample Sample mean 
Yves) 73.0 
TM 71.5 
71, 68 69.5 


(52 73.5 


«2, 68 70.0 
3 
e a = v c) 
26) | i) a | es eT ema 
69 70 a 72 730 #4 


¥ = sample mean height (in.) 


Sample Sample range 
(Gs: 4 
Wh, WZ 1 
71, 68 3 
Hore 3 
75, 68 7 
72, 68 4 

: ; 8 : 

0 1 2 3 4 5 5 ? g 
Sample range of height Cin.) 
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Being able to construct (or approximate) the sampling distribution of a statistic allows us to 
determine the values of the statistic that are likely to occur by chance alone—and the values 
that should be considered unusual. The following example shows how we can use a sampling 
distribution to evaluate a claim. 


EXAMPLE | Reaching for chips (>) 
Using a sampling distribution to evaluate a claim 


woodygraphs/Getty 
Images 


PROBLEM: To determine how much homework time students will get in class, Mrs. Lin 
has a student select an SRS of 20 chips from a large bag. The number of red chips in the SRS 
determines the number of minutes in class students get to work on homework. Mrs. Lin 
claims that there are 200 chips in the bag and that 100 of them are red. When Jenna selected 
a random sample of 20 chips from the bag (without looking), she got 7 red chips. Does this 
provide convincing evidence that less than half of the chips in the bag are red? 


a. What is the evidence that less than half of the chips in the bag are red? 


b. Provide two explanations for the evidence described in part (a). 


We used technology to simulate choosing 500 SRSs of size n=20n = 20 from a population 
of 200 chips, 100 red and 100 blue. The dotplot shows p/? = the sample proportion of red 
chips for each of the 500 samples. 


c. There is one dot on the graph at 0.80. Explain what this value represents. 


d. Would it be surprising to get a sample proportion of p\=7/20=0.35? = 7/20 = 0.35 oy 


smaller in an SRS of size 20 when p=0.5?? = 9-5? Justify your answer. 


e. Based on your previous answers, is there convincing evidence that less than half of the 
chips in the large bag are red? Explain your reasoning. 


aA a Nid ke ee Eh hab 
0.1 02 03 04 OS 0.6 0.7 08 0.9 


p = sample proportion of red chips 


SOLUTION: 


a. Jenna’s sample proportion was pp = 7/20 = 0.35, which is less than 0.50. 


b. ‘It is possible that Mrs. Lin is telling the truth and Jenna gota pe less than 0.50 


because of sampling variability. It is also possible that Mrs. Lin is lying and less than 
half of the chips in the bag are red. 


c. Inone simulated SRS of 20 chips, there were 16 red chips. So pp = 16/20 = 0.80 for 
this sample. 


d. No; there were many simulated samples that had pe values less than OT equal to 0.35. 


e. Because it isn’t surprising to get a pp less than or equal (HO) OLA by chance alone when 


Pp = 0.50, there isn’t convincing evidence that less than half of the chips in the bag are 
red. 
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Suppose that Jenna’s sample included only 3 red chips, giving pA=3/20=0.15. 
p = 3/20 = 0.15. would this provide convincing evidence that less than half of the chips in the 
bag are red? Yes. According to the simulated sampling distribution in the example, it would be 
very unusual to get a pAP value this small when p=0.50.P = 9-59. Therefore, sampling 
variability would not be a plausible explanation for the outcome of Jenna’s sample. The only 
plausible explanation for a prP value of 0.15 is that less than half of the chips in the bag are 
red. 


Figure 7.2 illustrates the process of choosing many random samples of 20 chips from a 
population of 100 red chips and 100 blue chips and finding the sample proportion of red chips 
pp for each sample. Follow the flow of the figure from the population distribution on the left, 
to choosing an SRS, graphing the distribution of sample data, and finding the p/P for that 
particular sample, to collecting together the p’’sP $ from many samples. The first sample has 
p=0.40.P = 9-40. The second sample is a different group of chips, with p\=0.55P = 9-55, and 
So on. 


Distributions of sample data 
14 


Population distribution eB 10- Approximate 
Parameter: p = 0.50 g 8 sampling distribution 
srs § 6 p= 0.40 
120 n=20 & 4 20 
100 2 
ee 0 
= 80 Blue Red 
5 Color 
= 60 
= 12 
=, 40 10 
o> 8 Pat 
20 = 
5 6 WT 
0 SRS = p=—=055 — [ 
Blue —- Red n=20 & 4 P~ 20 . | t. 
Color 2 sso oe 
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Color p = sample proportion of red chips 


FIGURE 7.2 The idea of a sampling distribution is to take many samples from the same 
population, collect the value of the statistic from all the samples, and display the distribution of 
the statistic. The dotplot shows the approximate sampling distribution of p\P = the sample 
proportion of red chips. 


The dotplot at the right of the figure shows the distribution of the values of p/P from 500 
separate SRSs of size 20. This is the approximate sampling distribution of the statistic p.?: 


AP® EXAM TIP 


Terminology matters. Never just say “the distribution.” Always say “the distribution 
of [blank],” being careful to distinguish the distribution of the population, the 


distribution of sample data, and the sampling distribution of a statistic. Likewise, 
don’t use ambiguous terms like “sample distribution,” which could refer to the 
distribution of sample data or to the sampling distribution of a statistic. You will lose 
credit on free response questions for misusing statistical terms. 


As Figure 7.2 shows, there are three distinct distributions involved when we sample 
repeatedly and calculate the value of a statistic. 


e The population distribution gives the values of the variable for all individuals in the 
population. In this case, the individuals are the 200 chips and the variable we’re recording is 


color. Our parameter of interest is the proportion of red chips in the population, p=0.50. 
p = 0.50. 


e The distribution of sample data shows the values of the variable for the individuals in a 
sample. In this case, the distribution of sample data shows the values of the variable color for 
the 20 chips in the sample. For each sample, we record a value for the statistic p/?, the 
sample proportion of red chips. 


e The sampling distribution of the sample proportion displays the values of pAP from all 
possible samples of the same size. 
Remember that a sampling distribution describes how a statistic (e.g., pp ) varies in many 
samples from the population. However, the population distribution and the distribution of 
sample data describe how individuals (e.g., chips) vary. 


CHECK YOUR UNDERSTANDING 


. 


”, Mars,® Inc. says that the mix of colors in its M&M’S® Milk Chocolate Candies from its 
Hackettstown, NJ, factory is 25% blue, 25% orange, 12.5% green, 12.5% yellow, 12.5% red, 
and 12.5% brown. Assume that the company’s claim is true and that you will randomly 
select 50 candies to estimate the proportion that are orange. 


1. Identify the population, the parameter, the sample, and the statistic in this setting. 

2. Graph the population distribution. 

3. Imagine taking a random sample of 50 M&M’S Milk Chocolate Candies. Make a graph 
showing a possible distribution of the sample data. Give the value of the statistic for this 
sample. 


4. Which of these three graphs could be the approximate sampling distribution of the 
Statistic? Explain your choice. 
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p = sample proportion of orange candies p = sample proportion of orange candies 


Describing Sampling Distributions 


The fact that statistics from random samples have definite sampling distributions allows us to 
answer the question “How trustworthy is a statistic as an estimate of a parameter?” To get a 
complete answer, we will consider the shape, center, and variability of the sampling 
distribution. For reasons that will be clear later, we’ ll save shape for Sections 7.2 and 7.3. 

Here is an activity that gets you thinking about the center and variability of a sampling 
distribution. 


ACTIVITY The craft stick problem 


In this activity, you will create a statistic for estimating the total number of craft sticks in a 
bag (N). The sticks are numbered 1, 2, 3, ..., N. Near the end of the activity, your teacher 
will select a random sample of n=7n = 7 sticks and read the number on each stick to the 
class. The team that has the best estimate for the total number of sticks will win a prize. 


Stephen Orsillo/Alamy 


1. Form teams of three or four students. As a team, spend about 10 minutes brainstorming 
different ways to estimate the total number of sticks. Try to come up with at least three 
different statistics. 


2. Before your teacher provides the sample of sticks, use simulation to investigate the 
sampling distribution of each statistic. For the simulation, assume that there are N=100 
N = 100 sticks in the bag and that you will be selecting samples of size n=7.n = 7. 


e Using your TI-83/84 calculator, select an SRS of size 7 using the command 
RandIntNoRep(lower:1,upper:100,n:7). [With older OS, use the command 
RandInt(lower:1,upper:100,n:7) and verify that there are no repeated numbers. If there are 
repeats, press ENTER to get a new sample. ] 


e For each sample, calculate the value of each of your three statistics. 
e Graph these values on a set of dotplots like those shown here. 


e Perform as many trials of the simulation as possible. 


Statistic2 Statistic? 


Statistic 3 


Estimated total 


3. Based on the simulated sampling distributions, which of your statistics is likely to produce 
the best estimate? Discuss as a team. 


4. Your teacher will now select a random sample of n=7n = 7 sticks from the bag. On a 
piece of paper, write the names of your group members, your group’s estimate for the 
number of sticks in the bag (a number), and the statistic you used to calculate your 
estimate (a formula). 


CENTER: BIASED AND UNBIASED ESTIMATORS Figure 7.3 shows the simulated 
sampling distribution of p\?= proportion of red chips when selecting samples of size n=20 
n = 20 from a population where p = 0.5. 
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p = sample proportion of red chips 
FIGURE 7.3 The distribution of p\P = the sample proportion of red chips in 500 SRSs of size 
n=20n = 20 from a population where p = 0.5. 


Notice that the center of this distribution is very close to 0.5, the parameter value. In fact, if 
we took all possible samples of 20 chips from the population, calculated pip for each sample, 
and then found the mean of all those pAP -values, we’d get exactly 0.5. For this reason, we say 
that p’P is an unbiased estimator of p. 


DEFINITION Unbiased estimator 


A statistic used to estimate a parameter is an unbiased estimator if the mean of 
its sampling distribution is equal to the value of the parameter being estimated. 


In a particular sample, the value of an unbiased estimator might be greater than the value of 
the parameter or it might be less than the value of the parameter. However, because the 
sampling distribution of the statistic is centered at the true value, the statistic will not 
consistently overestimate or consistently underestimate the parameter. This fits with our 
definition of bias from Chapter 4. The design of a statistical study shows bias if it is very likely 
to underestimate or very likely to overestimate the value we want to know. 

We will confirm mathematically in Section 7.2 that the sample proportion p/P is an 
unbiased estimator of the population proportion p. This is a very helpful result if we’re dealing 
with a categorical variable (like color). With quantitative variables, we might be interested in 
estimating the population mean, median, minimum, maximum, Q, Q3, variance, standard 
deviation, JQR, or range. Which (if any) of these are unbiased estimators? 

Let’s revisit the “Sampling heights” example with John and Carol’s four sons to investigate 
one of these statistics. Recall that the heights of the four sons are 71, 75, 72, and 68 inches. 
Here again is the sampling distribution of the sample mean x © for samples of size 2: 


° ° : ° ° 
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69 70 71 72 73 74 


¥ = sample mean height (in.) 


To determine if the sample mean is an unbiased estimator of the population mean, we need 
to compare the mean of the sampling distribution to the value we are trying to estimate—the 
mean of the population yp. 

The mean of the sampling distribution of x @ is 

69.5+-70+71.5+71.5+73+73.5 


rr 


ux” =69.5+70+71.5+71.5+73+73.56=71.5° 6 


The mean of the population is 
71+75+72+68 TLS 
ic = . 
=71+75+72+684=71.5" 4 


Because these values are equal, this example suggests that the sample mean x = is an 
unbiased estimator of the population mean p.“: We will confirm this fact in Section 7.3. 


EXAMPLE | Estimating the range (>) 
Biased and unbiased estimators 


Brian Miller 


PROBLEM: In the “Sampling heights” example, we created the sampling distribution of the 
sample range for samples of size n=2n = 2 from the population of John and Carol’s four 
sons with heights of 71, 75, 72, and 68 inches tall. Is the sample range an unbiased estimator 
of the population range? Explain your answer. 


0 1 2 3 = 5 6 7 8 


Sample range of height (in.) 
SOLUTION: 
The mean of the sampling distribution is 
14+3+3+4+4+44+7 
—————————_ = 3.67 
1+3+3+4+4+76 = 3.67 6 


The range of the population 1s 

Population range = 75 — 68 =7 
Because the mean of the sampling distribution of the sample range (3.67) is not equal to 
the value tt is trying to estimate (7), the sample range is not an unbiased estimator of the 
population range. 
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Because the sample range is consistently smaller than the population range, the sample 
range is a biased estimator of the population range. 


Think About It 


WHY DO WE DIVIDE BY n - 1 WHEN CALCULATING THE SAMPLE 
STANDARD DEVIATION? Now that we know about sampling distributions and unbiased 
estimators, we can finally answer this question. In Chapter 1, you learned that the formula 
{> (ai-2)? 
= 3s, = 4 =. 
for the sample standard deviation is sx=); (xi-x )2n-1. \ You also learned 
that the value obtained before taking the square root in the standard deviation calculation is 


SS > (a; z \? 


known as the variance. That is, the sample variance is sx2=¥ (xi-x )2n-1.°" od 
In an inference setting involving a quantitative variable, we might be interested in 

estimating the variance 020° of the population distribution. The most logical choice for our 

estimator is the sample variance sx2.57- We used technology to select 500 SRSs of size n=4 


n = 4 from a population where the population variance is o2=9.0° = 9. For each sample, we 
recorded the value of two statistics: 


aN 2 
racla~tjaeeeet. (=-2) = g2 


var(n—1)=)) (xi-x )2n-1=sx2 n—1 
7 _ +. var(n) =—————— 
var(n)=>; (xi-x )2n n 


Figure 7.4 shows the approximate sampling distributions of these two statistics. The blue 
vertical lines mark the means of these two distributions. 
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FIGURE 7.4 Results from a simulation of 500 SRSs of size n=4n = 4 from a population with 


2 
variance 62=9.0° = 9. The sample variance sx28z (labeled “var(n-1)” in the figure) is an 
unbiased estimator. The “var(n)” statistic is a biased estimator. 


We can see that “var(n)” is a biased estimator of the population variance. The mean of 
its approximate sampling distribution (marked with a blue line segment) is clearly less than 
the value of the population parameter, o2=9.0° = 9. However, the statistic “var(n—-1)” 
(otherwise known as the sample variance sx282) is an unbiased estimator. Its values are 
centered at 62=9.0° = 9. That’s why we divide by n - 1 when calculating the sample 
variance—and when calculating the sample standard deviation. 


VARIABILITY: SMALLER IS BETTER! To get a trustworthy estimate of an unknown 
population parameter, start by using a statistic that’s an unbiased estimator. This ensures that 
you won’t consistently overestimate or consistently underestimate the parameter. 
Unfortunately, using an unbiased estimator doesn’t guarantee that the value of your statistic 
will be close to the actual parameter value. 


To investigate the variability of a statistic, let’s consider the proportion of people in a 
random sample who have ever watched the show Survivor. According to Nielsen ratings, 
Survivor was one of the most-watched television shows in the United States every week that it 
aired. Suppose that the true proportion of U.S. adults who have ever watched Survivor is 
p=0.37.P = 0.37. 

The top dotplot in Figure 7.5 shows the results of drawing 400 SRSs of size n=100 = 100 
from a large population with p=0.37? = 9-37 and recording the value of p\? = the sample 
proportion who have ever watched Survivor. We see that a sample of 100 people often gave a 
p/P quite far from the population parameter. 


Let’s repeat our simulation, this time taking 400 SRSs of size n=1000" = 1000 from a large 
population with proportion p = 0.37 who have watched Survivor. The bottom dotplot in Figure 
7.5 displays the distribution of the 400 values of p? from these larger samples. Both graphs 
are drawn on the same horizontal scale to make comparison easy. 


0.20 aT a an ee 0.30 0.35 0.40 0.45 0.55 
p = sample proportion who have watched Survivor — = 100) 
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FIGURE 7.5 The approximate sampling distribution of the sample proportion p\P from SRSs 
of size n=100n = 100 and n=1000n = 1000 chosen from a large population with proportion p 
= 0.37. Both dotplots show the results of 400 SRSs. 


We can see that the variability shown in the top dotplot in Figure 7.5 is much greater than 
the variability shown in the bottom dotplot. With samples of size 100, the standard deviation of 
these p/P values is about 0.047. Using SRSs of size 1000, the standard deviation of these pA? 


values is about 0.016. This confirms what we learned in Section 4.3: larger random samples 
tend to produce estimates that are closer to the true population value. 


One important and surprising fact is that the variability of a statistic does not depend very 
much on the size of the population, as long as the sample size is less than 10% of the 
population size. Suppose that in a small town of 25,000 people, 37% of the population have 
watched Survivor. That is, p = 0.37. Let’s simulate taking 400 SRSs of size 1000 from this 
small town and compute pip , the sample proportion who have watched Survivor. The results 
are shown in Figure 7.6. 
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FIGURE 7.6 The approximate sampling distribution of the sample proportion p\P from 400 
SRSs of size n=1000n = 1000 chosen from a population of 25,000 individuals with proportion 
p =0.37. 
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Notice that the distribution of p/? looks nearly the same when sampling from a small town 
of 25,000 residents as when sampling from the entire United States. In fact, the standard 
deviation for each sampling distribution is approximately 0.016. 


Sergey Skleznev/Shutterstock 


Why does the size of the population have little influence on the behavior of statistics from 
random samples? Imagine sampling harvested corn by thrusting a scoop into a large sack of 
com kernels. The scoop doesn’t know if it is surrounded by a bag of corn or by an entire 
truckload. As long as the corn is well mixed (so that the scoop selects a random sample), the 
variability of the result depends only on the size of the scoop. 


CHECK YOUR UNDERSTANDING 


( 


y The histogram on the left shows the interval (in minutes) between eruptions of the Old 
Faithful geyser for all 222 recorded eruptions during a particular month. For this population, 


the median is 75 minutes. We used technology to take 500 SRSs of size 10 from the 
population. The 500 values of the sample median are displayed in the histogram on the right. 
The mean of these 500 values is 73.5. 
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1. Is the sample median an unbiased estimator of the population median? Justify your 
answer. 

2. Suppose we had taken samples of size 20 instead of size 10. Would the variability of the 
sampling distribution of the sample median be larger, smaller, or about the same? Justify 
your answer. 


3. Describe the shape of the sampling distribution of the sample median. 


CHOOSING AN ESTIMATOR In many cases, it is obvious which statistic should be used as 
an estimator of a population parameter. If we want to estimate the population mean pl, use the 
sample mean x .£. If we want to estimate a population proportion p, use the sample proportion 
p\.P: However, in other cases, there isn’t an obvious best choice. When trying to estimate the 
population maximum in the “Craft stick” activity, there were many different estimators that 
could be used. 

To decide which estimator to use when there are several choices, consider both bias and 
variability. Imagine the true value of the population parameter as the bull’s-eye on a target and 
the sample statistic as an arrow fired at the target. Both bias and variability describe what 
happens when we take many shots at the target. 

Bias means that our aim is off and we consistently miss the bull’s-eye in the same direction. 
Our sample statistics do not center on the population parameter. In other words, our estimates 
are not accurate. High variability means that repeated shots are widely scattered on the target. 
Repeated samples do not give very similar results. In other words, our estimates are not very 
precise. Figure 7.7 shows this target illustration. 


Estimator with high bias Estimator with low bias 
and low variability and high variability 


(a) (b) 


Estimator with high bias Estimator with no bias 
and high variability and low variability 


(c) (d) 
FIGURE 7.7 Bias and variability of four different estimators. (a) High bias, low variability. (b) Low 
bias, high variability. (c) High bias, high variability. (d) The ideal estimator: no bias, low 
variability. 
Notice that an estimator with low variability can also have high bias, as in Figure 7.7(a). 


And an estimator with low or no bias can be quite variable, as in Figure 7.7(b). Ideally, we’d 
like to use an estimator that is unbiased with low variability, as in Figure 7.7(d). 


AP® EXAM TIP 


Make sure to understand the difference between accuracy and precision when 
writing responses on the AP® Statistics Exam. Many students use “accurate” when 
they really mean “precise.” For example, a response that says “increasing the 
sample size will make an estimate more accurate” is incorrect. It should say that 


increasing the sample size will make an estimate more precise. If you can’t 
remember which term to use, don’t use either of them. Instead, explain what you 
mean without using statistical vocabulary. 


Section 7.1 Summary 


A parameter is a number that describes a population. To estimate an unknown parameter, 
use a Statistic calculated from a sample. 


The population distribution of a variable describes the values of the variable for all 
individuals in a population. The sampling distribution of a statistic describes the values of 
the statistic in all possible samples of the same size from the same population. Don’t confuse 
the sampling distribution with a distribution of sample data, which gives the values of the 
variable for all individuals in a particular sample. 


A Statistic can be an unbiased estimator or a biased estimator of a parameter. A statistic is 
an unbiased estimator if the center (mean) of its sampling distribution is equal to the true 
value of the parameter. 


The variability of a statistic is described by the spread of its sampling distribution. Larger 
samples give less variability. 


When trying to estimate a parameter, choose a statistic with low or no bias and minimum 
variability. 


Section 7.1 Exercises 


For Exercises 1-6, identify the population, the parameter, the sample, and the statistic in each 
setting. 


1. 


pq 442 ») Healthy living From a large group of people who signed a card saying they 


intended to quit smoking, 1000 people were selected at random. It turned out that 210 
(21%) of these individuals had not smoked over the past 6 months. 


Unemployment Each month, the Current Population Survey interviews about 60,000 
randomly selected U.S. adults. One of their goals is to estimate the national 
unemployment rate. In October 2016, 4.9% of those interviewed were unemployed. 


Fillings How much do prices vary for filling a cavity? To find out, an insurance company 
randomly selects 10 dental practices in California and asks for the cash (non-insurance) 
price for this procedure at each practice. The interquartile range is $74. 


Warm turkey Tom is cooking a large turkey breast for a holiday meal. He wants to be 
sure that the turkey is safe to eat, which requires a minimum internal temperature of 


165°F. Tom uses a thermometer to measure the temperature of the turkey meat at four 
randomly chosen points. The minimum reading is 170°F. 


Iced tea On Tuesday, the bottles of Arizona Iced Tea filled in a plant were supposed to 
contain an average of 20 ounces of iced tea. Quality control inspectors selected 50 bottles 
at random from the day’s production. These bottles contained an average of 19.6 ounces 
of iced tea. 


Bearings A production run of ball bearings is supposed to have a mean diameter of 
2.5000 centimeters (cm). An inspector chooses 100 bearings at random from the run. 
These bearings have mean diameter 2.5009 cm. 


Exercises 7—10 refer to the small population of 5 students in the table. 


10. 


11. 


12, 


Name Gender Quiz score 
Abigail Female 10 
Bobby Male 5 
Carlos Male 10 
DeAnna Female 

Emily Female 


pg 444 |») Sample means List all 10 possible SRSs of size n=2,” ~ 2, calculate the 


mean quiz score for each sample, and display the sampling distribution of the sample 
mean on a dotplot. 


Sample minimums List all 10 possible SRSs of size n=3,” = 3; calculate the minimum 
quiz score for each sample, and display the sampling distribution of the sample minimum 
on a dotplot. 


Sample proportions List all 10 possible SRSs of size n=2,” = 2; calculate the proportion 
of females for each sample, and display the sampling distribution of the sample proportion 
on a dotplot. 


Sample medians List all 10 possible SRSs of size n=3,” = 3 calculate the median quiz 
score for each sample, and display the sampling distribution of the sample median on a 
dotplot. 


Doing homework A school newspaper article claims that 60% of the students at a large 
high school completed their assigned homework last week. Assume that this claim is true 
for the 2000 students at the school. 


a. Make a bar graph of the population distribution. 


b. Imagine one possible SRS of size 100 from this population. Sketch a bar graph of the 
distribution of sample data. 


Tall girls According to the National Center for Health Statistics, the distribution of height 
for 16-year-old females is modeled well by a Normal density curve with mean p= 64 


13. 


inches and standard deviation o=2.5 inches.o = 2.5 inches. Assume this claim is true for 
the three hundred 16-year-old females at a large high school. 


a. Make a graph of the population distribution. 


b. Imagine one possible SRS of size 20 from this population. Sketch a dotplot of the 
distribution of sample data. 


pg 445 |») More homework Some skeptical AP® Statistics students want to 


investigate the newspaper’s claim in Exercise 11, so they choose an SRS of 100 students 
from the school to interview. In their sample, 45 students completed their homework last 
week. Does this provide convincing evidence that less than 60% of all students at the 
school completed their assigned homework last week? 


a. What is the evidence that less than 60% of all students completed their assigned 
homework last week? 


b. Provide two explanations for the evidence described in part (a). 


We used technology to simulate choosing 250 SRSs of size n=100n = 100 froma 
population of 2000 students where 60% completed their assigned homework last week. 
The dotplot shows pAP = the sample proportion of students who completed their assigned 
homework last week for each of the 250 simulated samples. 


c. There is one dot on the graph at 0.73. Explain what this value represents. 


d. Would it be surprising to get a sample proportion of pA=0.45? = 0.45 or smaller in an 
SRS of size 100 when p=0.60? = 9-60? Justify your answer. 


e. Based on your previous answers, is there convincing evidence that less than 60% of all 
students at the school completed their assigned homework last week? Explain your 
reasoning. 


0.45 0.50 0.55 0.60 0.65 0.70 0.75 


p = simulated sample proportion who completed 
their assigned homework last week 


14. Tall girls? To see if the claim made in Exercise 12 is true at their high school, an AP® 


Statistics class chooses an SRS of twenty 16-year-old females at the school and measures 
their heights. In their sample, the mean height is 64.7 inches. Does this provide convincing 


evidence that 16-year-old females at this school are taller than 64 inches, on average? 


a. What is the evidence that the average height of all 16-year-old females at this school is 
greater than 64 inches, on average? 


b. Provide two explanations for the evidence described in part (a). 


We used technology to simulate choosing 250 SRSs of size n=20n = 20 from a population 
of three hundred 16-year-old females whose heights follow a Normal distribution with 
mean p=64/! = 64 inches and standard deviation p=2.5/4 = 2- inches. The dotplot shows 
x £= the sample mean height for each of the 250 simulated samples. 


c. There is one dot on the graph at 62.5. Explain what this value represents. 


d. Would it be surprising to get a sample mean of x =64.7£ = 64.7 or larger in an SRS of 
size 20 when p=64/ = ©4 inches and o=2.50 = 2.5 inches? Justify your answer. 


e. Based on your previous answers, is there convincing evidence that the average height 
of all 16-year-old females at this school is greater than 64 inches? Explain your 
reasoning. 
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62.0 625 63.0 635 640 645 65.0 65.5 66.0 


X¥ = simulated sample mean height (in.) 


15. Even more homework Refer to Exercises 11 and 13. Suppose that the sample proportion 
of students who did all their assigned homework last week is pA=57/100=0.57. 
Pp = 57/100 = 0.57. would this sample proportion provide convincing evidence that less 
than 60% of all students at the school completed all their assigned homework last week? 


Explain your reasoning. 


16. Even more tall girls Refer to Exercises 12 and 14. Suppose that the sample mean height 
of the twenty 16-year-old females is x “= 65.8 inches. Would this sample mean provide 
convincing evidence that the average height of all 16-year-old females at this school is 
greater than 64 inches? Explain your reasoning. 


Exercises 17 and 18 refer to the following setting. During the winter months, outside 
temperatures at the Starneses’ cabin in Colorado can stay well below freezing (32°F, or 0°C) 
for weeks at a time. To prevent the pipes from freezing, Mrs. Starnes sets the thermostat at 


50°F. The manufacturer claims that the thermostat allows variation in home temperature that 
follows a Normal distribution with o = 3°F. To test this claim, Mrs. Starnes programs her 
digital thermometer to take an SRS of n=10n = 10 readings during a 24-hour period. Suppose 
the thermostat is working properly and that the actual temperatures in the cabin vary according 
to a Normal distribution with mean p = 50°F and standard deviation o = 3°F. 


17. 


18. 


Cold cabin? The dotplot shows the results of taking 300 SRSs of 10 temperature readings 
from a Normal population with p“= 50 and o7= 3 and recording the sample standard 
deviation sx** each time. Suppose that the standard deviation from an actual sample is 
sx=5°F $2 = 5 F. what would you conclude about the thermostat manufacturer’s claim? 
Explain your reasoning. 


1.0 1.5 2.0 2.5 3.0 3 4.0 4.5 5.0 


S, = simulated sample standard deviation 
of temperature (°F) 


Really cold cabin The dotplot shows the results of taking 300 SRSs of 10 temperature 
readings from a Normal population with p“= 50 and o7= 3 and recording the sample 
minimum each time. Suppose that the minimum of an actual sample is 40°F. What would 
you conclude about the thermostat manufacturer’s claim? Explain your reasoning. 
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Simulated sample minimum temperature (°F) 


Exercises 19-22 refer to the small population of 4 cars listed in the table. 


Color Age (years) 
Red 1 
White 5 


Silver 8 


19. 


20. 


zi, 


22. 


pa ® 


24. 


25. 


Red 20 


pg 449 ») Sample proportions List all 6 possible SRSs of size n=2," ~ 2; calculate 


the proportion of red cars in the sample, and display the sampling distribution of the 
sample proportion on a dotplot. Is the sample proportion an unbiased estimator of the 
population proportion? Explain your answer. 


Sample minimums List all 6 possible SRSs of size n=2,” = 2) calculate the minimum age 
for each sample, and display the sampling distribution of the sample minimum on a 
dotplot. Is the sample minimum an unbiased estimator of the population minimum? 
Explain your answer. 


More sample proportions List all 4 possible SRSs of size n=3,” = 3 calculate the 
proportion of red cars in the sample, and display the sampling distribution of the sample 
proportion on a dotplot with the same scale as the dotplot in Exercise 19. How does the 
variability of this sampling distribution compare with the variability of the sampling 
distribution from Exercise 19? What does this indicate about increasing the sample size? 


More sample minimums List all 4 possible SRSs of size n=3,” = 3; calculate the 
minimum age for each sample, and display the sampling distribution of the sample 
minimum on a dotplot with the same scale as the dotplot in Exercise 20. How does the 
variability of this sampling distribution compare with the variability of the sampling 
distribution from Exercise 20? What does this indicate about increasing the sample size? 


A sample of teens A study of the health of teenagers plans to measure the blood 
cholesterol levels of an SRS of 13- to 16-year-olds. The researchers will report the mean 
x £ from their sample as an estimate of the mean cholesterol level 1 in this population. 
Explain to someone who knows little about statistics what it means to say that x @ is an 
unbiased estimator of p. 


Predict the election A polling organization plans to ask a random sample of likely voters 
who they plan to vote for in an upcoming election. The researchers will report the sample 
proportion p/P that favors the incumbent as an estimate of the population proportion p 
that favors the incumbent. Explain to someone who knows little about statistics what it 
means to say that p? is an unbiased estimator of p. 


Bias and variability The figure shows approximate sampling distributions of 4 different 
Statistics intended to estimate the same parameter. 


(i) 


‘ 
| Population parameter 


(ii) 


Population parameter 


(ii _ Jools da 


4 
| Population parameter 


(iv) ee 


Population parameter 


a. Which statistics are unbiased estimators? Justify your answer. 


b. Which statistic does the best job of estimating the parameter? Explain your answer. 
Multiple Choice Select the best answer for Exercises 26-30. 


26. Ata particular college, 78% of all students are receiving some kind of financial aid. The 
school newspaper selects a random sample of 100 students and 72% of the respondents 
say they are receiving some sort of financial aid. Which of the following is true? 


a. 78% is a population and 72% is a sample. 
b. 72% is a population and 78% is a sample. 
c. 78% is a parameter and 72% is a statistic. 
d. 72% is a parameter and 78% is a Statistic. 


e. 72% is a parameter and 100 is a statistic. 


27. A Statistic is an unbiased estimator of a parameter when 
a. the statistic is calculated from a random sample. 


b. ina single sample, the value of the statistic is equal to the value of the parameter. 


28. 


29. 


30. 


c. in many samples, the values of the statistic are very close to the value of the 
parameter. 


d. in many samples, the values of the statistic are centered at the value of the parameter. 


e. in many samples, the distribution of the statistic has a shape that is approximately 
Normal. 


In a residential neighborhood, the median value of a house is $200,000. For which of the 
following sample sizes is the sample median most likely to be above $250,000? 


a. n=10n = 10 

b. n=50n = 50 

c. n=100n = 100 
d. n=1000n = 1000 


e. Impossible to determine without more information. 


Increasing the sample size of an opinion poll will reduce the 

a. bias of the estimates made from the data collected in the poll. 

b. variability of the estimates made from the data collected in the poll. 
c. effect of nonresponse on the poll. 

d. variability of opinions in the sample. 


e. variability of opinions in the population. 


The math department at a small school has 5 teachers. The ages of these teachers are 23, 
34, 37, 42, and 58. Suppose you select a random sample of 4 teachers and calculate the 
sample minimum age. Which of the following shows the sampling distribution of the 
sample minimum age? 


(ay —-* —,-—_, 


(b) : 


0) Q 40 So. AD 
(d) © se 8 e 
20 30) 40 50s 


(e) None of these. 


Recycle and Review 


31. Dem bones (2.2) Osteoporosis is a condition in which the bones become brittle due to loss 
of minerals. To diagnose osteoporosis, an elaborate apparatus measures bone mineral 
density (BMD). BMD is usually reported in standardized form. The standardization is 
based on a population of healthy young adults. The World Health Organization (WHO) 
criterion for osteoporosis is a BMD score that is 2.5 standard deviations below the mean 
for young adults. BMD measurements in a population of people similar in age and gender 
roughly follow a Normal distribution. 


a. What percent of healthy young adults have osteoporosis by the WHO criterion? 


b. Women aged 70 to 79 are, of course, not young adults. The mean BMD in this age 
group is about —2 on the standard scale for young adults. Suppose that the standard 
deviation is the same as for young adults. What percent of this older population has 
osteoporosis? 


32. Squirrels and their food supply (3.2) Animal species produce more offspring when their 
supply of food goes up. Some animals appear able to anticipate unusual food abundance. 
Red squirrels eat seeds from pinecones, a food source that sometimes has very large crops. 
Researchers collected data on an index of the abundance of pinecones and the average 
number of offspring per female over 16 years.4 Computer output from a least-squares 
regression on these data and a residual plot are shown here. 


Predictor Coef SE Coef T P 
Constant 1.4146 0.2517 5.62 0.000 
Cone index 0.4399 0.1016 4,33 0.001 
S = 0.600309 R-Sq = 57.2% R-Sq(adj) = 54.2% 
154 
1.0 4 , 
| 0.57 o se [ 
3 °. 
cs ° 
i 
= &% . 
-0.5 + ° 
-1.0 - 
0 i 2 3 4 5 
Cone index 


a. Is a linear model appropriate for these data? Explain. 
b. Give the equation for the least-squares regression line. Define any variables you use. 


c. Interpret the values of r? and s in context. 


SECTION 7.2 Sample Proportions 


LEARNING TARGETS By the end of the section, you should be able to: 


e Calculate the mean and standard deviation of the sampling distribution of a 
sample proportion p’? and interpret the standard deviation. 


¢ Determine if the sampling distribution of p’? is approximately Normal. 


e If appropriate, use a Normal distribution to calculate probabilities involving p*.P: 


What proportion of U.S. teens know that 1492 was the year in which Columbus “discovered” 
America? A Gallup Poll found that 210 out of a random sample of 501 American teens aged 13 
to 17 knew this historically important date.2 The sample proportion 

210 


p =——=0.42 
pA=210501=0.42" 501 


is the statistic that we use to gain information about the unknown population proportion p. 
Because another random sample of 501 teens would likely result in a different estimate, we can 
only say that “about” 42% of all U.S. teenagers know that Columbus discovered America in 
1492. In this section, we’ll use sampling distributions to clarify what “about” means. 


AS FAR AS I KNoW THE WORLD 
THERE ISN'T A THERE ISN'T A So WHY IS. HAS MANY 
“CLEVELAND DAY’... “CINCINNATI DAY”... THERE A MYSTERIES. 
THERE ISN’T AN 
"AKRON DAY"... 


FOXTROT©2005 Bill Amend. Andrews McMeel Syndicate. 


The Sampling Distribution of p4? 


When Mrs. Gallas’s class did the “Penny for your thoughts” activity at the beginning of the 
chapter, her students produced the “dotplot” in Figure 7.8, which approximates the sampling 


distribution of the sample proportion of pennies from the 2000s for samples of size n=20. 
n= 20. 


p 
, ? 
ABs 
Pp PP 
PPP P 
P PP PP P 
PRPPR FP 
PeRPREE #P 
PPR ECE CP 
P PPPPPPRPP PP ?P 
0.3 0.4 05 0.6 0.F 0.28 0.9 7.0 


p = sample proportion of pennies from 20005 (n = 20) 


FIGURE 7.8 Approximate sampling distribution of the sample proportion of pennies from the 
2000s in 50 samples of size n=20n = 20 from a population of pennies. 


DEFINITION Sampling distribution of the sample proportion 


The sampling distribution of the sample proportion p’? describes the 
distribution of values taken by the sample proportion p’? in all possible samples 
of the same size from the same population. 


This distribution is roughly symmetric with a mean of about 0.65 and a standard deviation 
of about 0.10. By the end of this section, you should be able to anticipate the shape, center, and 
variability of distributions like this one without getting your hands dirty in a jar of pennies. 


ACTIVITY The candy machine 


Imagine a very large candy machine filled with orange, brown, and yellow candies. When 
you insert money, the machine dispenses a sample of candies. In this activity, you will use an 
applet to investigate the shape, center, and variability of the sampling distribution of the 
sample proportion p/.P: 


Reese’s Pieces® 


Probability of orange |0.5 
Number of candies {25 
Number ofsamples 1 
¥] Animate 


Draw Samples _ 
Total = 1 


O Number of orange zs 


@ Proportion of orange —=—=== 


Most recent p= 0.360 
As extreme as | 2 | || Count 


“| Summary Stats 


. Launch the Reese’s Pieces® applet at www.rossmanchance.com/applets. Make sure the 
Probability of orange = 0.5, Number of candies (sample size) = 25, and Number of 
samples = 1. Choose “Proportion of orange” as the statistic to be calculated and check the 
box for Summary stats to be calculated, as shown in the screen shot for Step 2. 


. Click on the “Draw Samples” button. An animated simple random sample of n=25n = 25 
candies should be dispensed. The following screen shot shows the results of one such 
sample where the sample proportion of orange candies was pA=0.360.? = 0.360. How far 
was your sample proportion of orange candies from the actual population proportion, p = 
0.50? 


. Click “Draw Samples” 9 more times so that you have a total of 10 sample proportions. 
Look at the dotplot of your p/? values. What is the mean of your 10 sample proportions? 
What is their standard deviation? 


. To take many more samples quickly, enter 990 in the “Number of samples” box. Click on 
the “Animate” box to turn the animation off. Then click “Draw Samples.” You have now 
taken a total of 1000 samples of 25 candies from the machine. Describe the shape, center, 
and variability of the approximate sampling distribution of pp shown in the dotplot. 


. How does the sampling distribution of pip change if the proportion of orange candies in 
the machine is different from p = 0.5? Use the applet to investigate this question. Then 
write a brief summary of what you learned. 


. How does the sampling distribution of pip change if the machine dispenses a different 
number of candies? Use the applet to investigate this question. Then write a brief 
summary of what you learned. 


. For what combinations of n and p is the sampling distribution of p/? approximately 
Normal? Use the applet to investigate this question. Then write a brief summary of what 
you learned. 


The graphs in Figure 7.9 show approximate sampling distributions of p? for different 


combinations of p and n. 


Mean = 0.100 Mean = 0.500 Mean = 0.900 
SD = 0.060 SD = 0.100 SD = 0.060 
| | t T t | | f oa coxt T t | 
0.08 0.16 0.24 0.32 0.12 0.24 036 048 0.60 0.72 0.84 0.56 0.64 0.72 0.80 0.88 0.96 
p = sample proportion of orange candies p = sample proportion of orange candies p = sample proportion of orange candies 
(p = 0.10, n = 25) (p = 0.50, n = 25) (p = 0.90, n = 25) 

Mean = 0.100 Mean = 0.500 
SD = 0.030 SD = 0.050 


0 0.08 0.16 0.24 0.32 


p = sample proportion of orange candies 
(p = 0.10, n = 100) 


0.12 0.24 0.36 0.48 


0.60 0.72 0.84 


p = sample proportion of orange candies 
(p = 0.50, n = 100) 


0.56 0.64 0.72 0.80 0.88 0.96 
p = sample proportion of orange candies 
(p = 0.90, n = 100) 


FIGURE 7.9 Approximate sampling distributions of p\P = sample proportion of orange candies 
for different combinations of p (population proportion) and n (Sample size). 


What do these graphs teach us about the sampling distribution of p\?P ? 


Shape: When n is small and p is close to 0, the sampling distribution of pp is skewed to the 
right. When n is small and p is close to 1, the sampling distribution of p/? is skewed to the left. 
Finally, the sampling distribution of p\? becomes more Normal when p is closer to 0.5 or n is 


larger (or both). 


Center: The mean of the sampling distribution of p? is equal to the population proportion p. 
This makes sense because the sample proportion p/P is an unbiased estimator of p. 


Variability: The value of op\°P depends on both n and p. For a specific sample size, the 
standard deviation opp is larger for values of p close to 0.5 and smaller for values of p close 
to 0 or 1. For a specific value of p, the standard deviation op\°? gets smaller as n gets larger. 
Specifically, multiplying the sample size by 4 cuts the standard deviation in half. 


Here’s a summary of the important facts about the sampling distribution of p\.P: 


SAMPLING DISTRIBUTION OF A SAMPLE PROPORTION p/P 


Choose an SRS of size n from a population of size N with proportion p of successes. Let p/ 
P be the sample proportion of successes. Then: 


e The mean of the sampling distribution of p\? is pp/=p."” 


— Pp. 


e The standard deviation of the sampling distribution of pAP is 


| p(.—p) 
A=p(1- °p ~\ n 
op/=p(1—p)n 


as long as the 10% condition is satisfied: n<0.10N.n < 0.10N. The value op*? measures 
the typical distance between a sample proportion p/? and the population proportion p. 


¢ The sampling distribution of p\? is approximately Normal as long as the Large Counts 
condition is satisfied: np>10"P = 10 and n(1—p)=10.” (1—p) > 10. 


The two conditions mentioned in the preceding box are very important. 


¢ Large Counts condition: If we assume that the sampling distribution of p/? is approximately 
Normal when it isn’t, any calculations we make using a Normal distribution will be flawed. 


e 10% condition: When we’re sampling without replacement from a (finite) population, the 
observations are not independent, because knowing the outcome of one trial helps us predict 
the outcome of future trials. But the standard deviation formula assumes that the 
observations are independent. If we sample too large a fraction of the population, our 
calculated value of op\*p will be too large. 


We call it the “Large Counts” condition because np is the expected count of successes in the 
sample and n (1 — p) is the expected count of failures in the sample. 


Because larger random samples give better information, it sometimes makes sense to 
sample more than 10% of a population. In such a case, there’s a more accurate formula for 
calculating the standard deviation op’.°?* It uses something called a finite population 
correction (FPC). We’|l avoid situations that require the FPC in this text. 


EXAMPLE | Backing the pack (>) 
The sampling distribution of p/? 


bo1982/Getty Images 


PROBLEM: Suppose that 84% of students at a large high school regularly use a backpack 
to carry their books from class to class. Imagine taking an SRS of 100 students and 
calculating p/? = the proportion of students in the sample who regularly use a backpack. 


a. Identify the mean of the sampling distribution of p\.P: 


b. Calculate and interpret the standard deviation of the sampling distribution of p/.P: Verify 
that the 10% condition is met. 


c. Describe the shape of the sampling distribution of p\.P: Justify your answer. 


SOLUTION: 


a. ppA= 0.8.4» — 0.84 


up» = p 
b. Assuming that n = 100 students ts less than 10% of students in a large high school, 


0.84(1—0.84 
= 4/ ———_———— = 0.0367 


o- 
op’ = 0.84(1 — 0.84)100=0.0367 ” 100 


In SRSs of size 100, the sample proportion of students who regularly use a backpack 
will typically vary by about 0.0367 from the true proportion of (= 0.84. 


When <0.10N, “ 0.10N, 


op*=p(i-pyn’e Vo 


c. Because 100(0.84)=84>10!00 (0.84) = 84 > 10 gig 100(1-0.8.4 )=16=10, 


100(1 — 0.84) = 16 > 10, iff, sampling distribution of pip is approximately Normal. 


When np210"P = 10 and n (1 - p) == 10, the sampling distribution of p\P is 
approximately Normal. 


FOR PRACTICE, TRY EXERCISE 33 


Think About It 


HOW IS THE SAMPLING DISTRIBUTION OF prp RELATED TO THE BINOMIAL 
COUNT X? From Chapter 6, we know that the mean and standard deviation of a binomial 
random variable X are 


uX=np and oX=np(1-p)x = "P and ox = ,/np(1—p) 
The sample proportion of successes is closely related to X: 
_ count of successes in sample ia 


P F — 
p/=count of successes in samplesample size=Xn sample size S 


Because p\=X/n=(1/n)X,P = X/n= (1/n) X, we're just multiplying the random variable X 
by a constant (1/n) to get the random variable p/.?- Recall from Chapter 6 that multiplying a 
random variable by a constant multiplies both the mean and the standard deviation by that 
constant. We have 


1 1 
uph=1n Xin (np)=p hn (np) =p 


op/=1n oX=1nnp(1-p)=np(1-p)n2=p(1-p)n 

a t mies J ‘np(l—p) __ [p(1—p) 

o- =—ax =— \/ np ( (l—p | ——————_ = _4/ ———— 
n | nz ym 

What about shape? Multiplying a random variable by a positive constant doesn’t change 
the shape of the probability distribution. So the sampling distribution of pAP will have the 
same shape as the distribution of the binomial random variable X. If you studied the optional 
material in Chapter 6 about the Normal approximation to a binomial distribution, then you 
already know that a Normal distribution can be used to approximate the sampling 
distribution of pip whenever both np and n(1 — p) are at least 10. 


CHECK YOUR UNDERSTANDING 


. 


y Suppose that 75% of young adult Internet users (ages 18 to 29) watch online videos. A 

polling organization contacts an SRS of 1000 young adult Internet users and calculates the 

proportion p/P in this sample who watch online videos. 

1. Identify the mean of the sampling distribution of p\.P: 

2. Calculate and interpret the standard deviation of the sampling distribution of p/.?: Check 
that the 10% condition is met. 

3. Is the sampling distribution of p\? approximately Normal? Check that the Large Counts 
condition is met. 

4. If the sample size were 9000 rather than 1000, how would this change the sampling 
distribution of prep? 


Using the Normal Approximation for p/? 


Inference about a population proportion p is based on the sampling distribution of p\.P: When 
the sample size is large enough for np and n(1 — p) to both be at least 10 (the Large Counts 
condition), the sampling distribution of p/? is approximately Normal. In that case, we can use 
a Normal distribution to estimate the probability of obtaining an SRS in which p/? lies ina 
specified interval of values. Here is an example. 


EXAMPLE | Going to college (>) 
Normal calculations involving p’? 


lvan Vukovic/Shutterstock 


PROBLEM: A polling organization asks an SRS of 1500 first-year college students how far 
away their home is. Suppose that 35% of all first-year students attend college within 50 
miles of home. Find the probability that the random sample of 1500 students will give a 
result within 2 percentage points of the true value. 


SOLUTION: 


Up = 0.35ttn = 0.35 


Assuming that 1500 < 10% of all first-year college students, 


——-[(0.35)(0.65) 


Tame = 0.0123 
Bp (0.35)(0.65)1500 = 0.012 3? \ 1500 


Calculate the mean and standard deviation of the sampling distribution of p’.P: 
Up*=p 


Hs =P 


o> = 
op*=p (1-p)n 


Because np = 1 500(0.35) = 525 = 10 and n(1 - p = 1500(0.65) = 975 = 10, the distribution 
of [ee od 1s approximately Normal. 


Justify that the distribution of p*P iS approximately Normal using the Large Counts 


condition. 


ry 


0.3131 0.3254\ 0.337% 0.35 0.3623 0.3746 0.3869 
A 
p= 0.33 p= 0.37 
A 
P= sample proportion within 50 miles 


, — 0:33-0.35 _ _1 gg 
(i) z=0.33 — 0.350.0123=-1.63, 0.0128 «sand z = 0.37 — 0.350.0123 =1.63 


0.37—0.35 : 
~ - ,0123 = 1.63 


~ 
“~~ 


Using Table A: P(0.33<p<0.37)=P(-1.63<2<1.63)=0.9484-0.0516=0.8968 
P(0.33 < p < 0.37) = P(-1.63 <z< 1.63) = 0.9484 — 0.0516 = 0.8968 


‘Using technology: normalcdf (lower:-1.63, upper:1.63, mean:o, SD:1) = 0.8969 
(ii) normalcdf (lower:0.33, upper:0.37, mean:0.35, SD:07012 2) = 0.8961 


. Draw a Normal distribution. 
. Perform calculations. 


i. Standardize and use Table A or technology or 
il. Use technology without standardizing. 
Be sure to answer the question that was asked. 


FOR PRACTICE, TRY EXERCISE 43 


In the preceding example, about 90% of all SRSs of size 1500 from this population will 
give aresult within 2 percentage points of the truth about the population. This result also 
suggests that in about 90% of all SRSs of size 1500 from this population, the true proportion 
will be within 2 percentage points of the sample proportion. This fact will become very 
important in Chapter 8 when we use sample data to create an interval of plausible values for a 
population parameter. 


Section 7.2 Summary 


e When we want information about the population proportion p of successes, we often take an 
SRS and use the sample proportion p/? to estimate the unknown parameter p. The sampling 
distribution of the sample proportion p’? describes how the statistic p/P varies in all 
possible samples of the same size from the population. 

e The mean of the sampling distribution of pip is up/=p.! ‘p ~P* So p/P is an unbiased 
estimator of p. 


e The standard deviation of the sampling distribution of pAP is op4=p(1-p)/n 


05 = /p(1—p)/n for an SRS of size n. This formula can be used if the sample size is less 
than 10% of the population size (the 10% condition). 


e The sampling distribution of p\? is approximately Normal when both np > 10 and n(1 — p) 
> 10 (the Large Counts condition). 


Section 7.2 Exercises 


33. PQ 462 |) Registered voters In a congressional district, 55% of registered voters are 


Democrats. A polling organization selects a random sample of 500 registered voters from 
this district. Let p\? = the proportion of Democrats in the sample. 


a. Identify the mean of the sampling distribution of p/.P: 


b. Calculate and interpret the standard deviation of the sampling distribution of p\.P: 
Verify that the 10% condition is met. 


c. Describe the shape of the sampling distribution of p/.P: Justify your answer. 


34. Married with children According to a recent U.S. Bureau of Labor Statistics report, the 
proportion of married couples with children in which both parents work outside the home 
is 59%.© You select an SRS of 50 married couples with children and let p\? = the sample 
proportion of couples in which both parents work outside the home. 


a. Identify the mean of the sampling distribution of p\.P: 


b. Calculate and interpret the standard deviation of the sampling distribution of p\.P: 
Verify that the 10% condition is met. 


c. Describe the shape of the sampling distribution of p’.P: Justify your answer. 


35. Orange Skittles® The makers of Skittles claim that 20% of Skittles candies are orange. 
Suppose this claim is true. You select a random sample of 30 Skittles from a large bag. Let 
p/P = the proportion of orange Skittles in the sample. 


a. Identify the mean of the sampling distribution of p\.P: 


b. Calculate and interpret the standard deviation of the sampling distribution of p\.P: 
Verify that the 10% condition is met. 


c. Describe the shape of the sampling distribution of p’.P: Justify your answer. 


36. Male workers A factory employs 3000 unionized workers, 90% of whom are male. A 
random sample of 15 workers is selected for a survey about worker satisfaction. Let pA? = 
the proportion of males in the sample. 


a. Identify the mean of the sampling distribution of p/.P: 


b. Calculate and interpret the standard deviation of the sampling distribution of p\.P: 
Verify that the 10% condition is met. 


c. Describe the shape of the sampling distribution of p/.P: Justify your answer. 


37. More Skittles® What sample size would be required to reduce the standard deviation of 
the sampling distribution to one-half the value you found in Exercise 35(b)? Justify your 
answer. 


38. More workers What sample size would be required to reduce the standard deviation of 


39. 


40. 


41. 


42. 


the sampling distribution to one-third the value you found in Exercise 36(b)? Justify your 
answer. 


Airport security The Transportation Security Administration (TSA) is responsible for 
airport safety. On some flights, TSA officers randomly select passengers for an extra 
security check before boarding. One such flight had 76 passengers—12 in first class and 
64 in coach class. TSA officers selected an SRS of 10 passengers for screening. Let p\P 
be the proportion of first-class passengers in the sample. 


a. Is the 10% condition met in this case? Justify your answer. 
b. Is the Large Counts condition met in this case? Justify your answer. 


Scrabble® In the game of Scrabble, each player begins by drawing 7 tiles from a bag 
containing 100 tiles. There are 42 vowels, 56 consonants, and 2 blank tiles in the bag. Cait 
chooses an SRS of 7 tiles. Let p4? be the proportion of vowels in her sample. 


a. Is the 10% condition met in this case? Justify your answer. 


b. Is the Large Counts condition met in this case? Justify your answer. 


Do you drink the cereal milk? A USA Today poll asked a random sample of 1012 U.S. 
adults what they do with the milk in the bowl after they have eaten the cereal. Let pp be 
the proportion of people in the sample who drink the cereal milk. A spokesman for the 
dairy industry claims that 70% of all U.S. adults drink the cereal milk. Suppose this claim 
is true. 


a. What is the mean of the sampling distribution of pi2P ? 


b. Find the standard deviation of the sampling distribution of p/.P- Verify that the 10% 
condition is met. 


c. Verify that the sampling distribution of p/? is approximately Normal. 


d. Of the poll respondents, 67% said that they drink the cereal milk. Find the probability 
of obtaining a sample of 1012 adults in which 67% or fewer say they drink the cereal 
milk, assuming the milk industry spokesman’s claim is true. 


e. Does this poll give convincing evidence against the spokesman’s claim? Explain your 
reasoning. 


Do you go to church? The Gallup Poll asked a random sample of 1785 adults if they 
attended church during the past week. Let p/P be the proportion of people in the sample 
who attended church. A newspaper report claims that 40% of all U.S. adults went to 
church last week. Suppose this claim is true. 


a. What is the mean of the sampling distribution of p\?P ? 


b. Find the standard deviation of the sampling distribution of p/.P- Verify that the 10% 
condition is met. 


43. 


44. 


45. 


46. 


c. Verify that the sampling distribution of p/? is approximately Normal. 


d. Of the poll respondents, 44% said they did attend church last week. Find the probability 
of obtaining a sample of 1785 adults in which 44% or more say they attended church 
last week, assuming the newspaper report’s claim is true. 


e. Does this poll give convincing evidence against the newspaper’s claim? Explain your 
reasoning. 


pg 464 |») Students on diets Suppose that 70% of college women have been on a diet 


within the past 12 months. A sample survey interviews an SRS of 267 college women. 
What is the probability that 75% or more of the women in the sample have been on a diet? 


Who owns a Harley? Harley-Davidson motorcycles make up 14% of all the motorcycles 
registered in the United States. You plan to interview an SRS of 500 motorcycle owners. 
How likely is your sample to contain 20% or more who own Harleys? 


On-time shipping A mail-order company advertises that it ships 90% of its orders within 
three working days. You select an SRS of 100 of the 5000 orders received in the past 
week for an audit. The audit reveals that 86 of these orders were shipped on time. 


a. If the company really ships 90% of its orders on time, what is the probability that the 
proportion in an SRS of 100 orders is 0.86 or less? 


b. Based on your answer to part (a), is there convincing evidence that less than 90% of all 
orders from this company are shipped within three working days? Explain your 
reasoning. 


Wait times A hospital claims that 75% of people who come to its emergency room are 
seen by a doctor within 30 minutes of checking in. To verify this claim, an auditor 
inspects the medical records of 55 randomly selected patients who checked into the 
emergency room during the last year. Only 32 (58.2%) of these patients were seen by a 
doctor within 30 minutes of checking in. 


a. If the wait time is less than 30 minutes for 75% of all patients in the emergency room, 
what is the probability that the proportion of patients who wait less than 30 minutes is 
0.582 or less in a random sample of 55 patients? 


b. Based on your answer to part (a), is there convincing evidence that less than 75% of all 
patients in the emergency room wait less than 30 minutes? Explain your reasoning. 


Multiple Choice Select the best answer for Exercises 47-50. 


Exercises 47—49 refer to the following setting. The magazine Sports Illustrated asked a 
random sample of 750 Division I college athletes, “Do you believe performance- 
enhancing drugs are a problem in college sports?” Suppose that 30% of all Division I 
athletes think that these drugs are a problem. Let p/? be the sample proportion who say 
that these drugs are a problem. 


47. 


48. 


49. 


50. 


Which of the following are the mean and standard deviation of the sampling distribution 
of the sample proportion p’?? ? 


a. Mean = 0.30, SD = 0.017 

b. Mean = 0.30, SD = 0.55 

c. Mean = 0.30, SD = 0.0003 

d. Mean = 225, SD = 12.5 

e. Mean = 225, SD = 157.5 

Decreasing the sample size from 750 to 375 would multiply the standard deviation by 
a. 2. (d)1/2.1/v2. 

b. 2v2 (e) none of these. 

e. 1/2, 

The sampling distribution of p\? is approximately Normal because 
a. there are at least 7500 Division I college athletes. 

b. np = 225 and n(1 — p) = 525 are both at least 10. 

c. arandom sample was chosen. 

d. the athletes’ responses are quantitative. 


e. the sampling distribution of p/P always has this shape. 


In a congressional district, 55% of the registered voters are Democrats. Which of the 
following is equivalent to the probability of getting less than 50% Democrats in a random 
sample of size 100? 


—— 
a 


P( 
a. P(z<0.50—-0.55100) 


Plzeze 0.50—-0.55 
/ 0.55(0.45) 


b. P(z<0.50-0.550.55(0.45)100) \~ 100 


P ~ _ .0.55—0.50 
de, ns 
/ 0.55(0.45) 


c. P(z<0.55—-0.500.55(0.45)100) \ 100 


d. P(z<0.50—0.55100(0.55)(0.45)) v 100(0.55)(0.45) 


e. P(z<0.55—0.50100(0.55)(0.45)) V/100(0-55)(0.45) 
Recycle and Review 


51. Sharing music online (5.2, 5.3) A sample survey reports that 29% of Internet users 
download music files online, 21% share music files from their computers, and 12% both 


download and share music.Z 


a. Make a two-way table that displays this information. 
b. What percent of Internet users neither download nor share music files? 


c. Given that an Internet user downloads music files online, what is the probability that 
this person also shares music files? 


52. Whole grains (4.2) A series of observational studies revealed that people who typically 
consume 3 servings of whole grain per day have about a 20% lower risk of dying from 


heart disease and about a 15% lower risk of dying from stroke or cancer than those who 


consume no whole grains.2 


a. Explain how confounding makes it difficult to establish a cause-and-effect relationship 
between whole grain consumption and risk of dying from heart disease, stroke, or 
cancer, based on these studies. 


b. Explain how researchers could establish a cause-and-effect relationship in this context. 


SECTION 7.3 Sample Means 


LEARNING TARGETS By the end of the section, you should be able to: 


e Calculate the mean and standard deviation of the sampling distribution of a 
sample mean X * and interpret the standard deviation. 


e Explain how the shape of the sampling distribution of X X is affected by the 
shape of the population distribution and the sample size. 


e If appropriate, use a Normal distribution to calculate probabilities involving X . 
Xx. 


Sample proportions arise most often when we are interested in categorical variables. We then 
ask questions like “What proportion of U.S. adults has watched Survivor?” or “What percent of 
the adult population attended church last week?” But when we record quantitative variables— 
household income, lifetime of car brake pads, blood pressure—we are interested in other 
Statistics, such as the median or mean or standard deviation of the variable. The sample mean 

x is the most common statistic computed from quantitative data. 


The Sampling Distribution of X Xx 


When Mrs. Gallas’s class did the “Penny for your thoughts” activity at the beginning of the 
chapter, her students produced the “dotplots” in Figure 7.10, which approximate the sampling 
distribution of the sample mean year of penny for samples of size n = 5 and for samples of 
size n = 20. 


DEFINITION Sampling distribution of the sample mean 


The sampling distribution of the sample mean X X describes the distribution 


of values taken by the sample mean X in all possible samples of the same 
size from the same population. 


_x 
X xX X X 
- << X%_ XXXXXKX X 
X KX XX XKXXKXXKXXX 
XXX XXX KXXXKXXKXXXXXX XK 
1990 1995 2000 2005 200 205 


X= Sample mean yedn= 5) 


1990 1995 2000 2005 200 205 
¥= Sample mean yean= 20 
FIGURE 7.10 Approximate sampling distribution of the sample mean year of pennies in 50 
samples of size n = 5 and 50 samples of size n = 20 from a population of pennies. 


How do these approximate sampling distributions compare? 


e Shape: The distribution of x £ is slightly skewed to the left when using samples of size n = 
5 but roughly symmetric when using samples of size n = 20. 
e Center: The distribution of x £ is centered at around 2002 for both sample sizes 
(ux * 2002)Hz © 2002. 
e Variability: The distribution of x ® is about half as variable when using samples of size n = 
20 (ox 2.6%: ~ 2-6) than with samples of size n = 5 (ox ¥5.2%% ~ 5-2), 
Like the sampling distribution of p/,?> there are some simple rules that describe the mean 


and standard deviation of the sampling distribution of x .%. Describing the shape of the 
sampling distribution of x £ is more complicated, so we’ll save that for later. 


SAMPLING DISTRIBUTION OF x =: MEAN AND STANDARD 
DEVIATION 


Suppose that x £ is the mean of an SRS of size n drawn from a large population with mean 
pt and standard deviation o. Then: 


e The mean of the sampling distribution of x @ is px =p.4e ~ H: 


e The standard deviation of the sampling distribution of x # is 


as long as the 10% condition is satisfied: n<0.10N.n < 0.10N. The value ox 7% measures 
the typical distance between a sample mean x @ and the population mean i. 


The behavior of x Z in repeated samples is much like that of the sample proportion pA:P : 
e The sample mean x @ is an unbiased estimator of the population mean p. 


e The variability of x # depends on both the variability in the population o@ and the sample 
size n. Values of x £ will be more variable for populations that have more variability. 
Values of x £ will be less variable for larger samples. Specifically, multiplying the sample 
size by 4 cuts the standard deviation in half. 


¢ You should use the formula o/n?/V" for the standard deviation of x @ only when the 
sample size is less than 10% of the population size (the 10% condition). There is a more 
complicated formula for the standard deviation when the sample size is larger than 10% of 
the population size, but it is beyond the scope of this book. 


Notice that these facts about the mean and standard deviation of x = are true no matter what 
shape the population distribution has. 


AP® EXAM TIP 


Notation matters. The symbols p%,x_,N,P,H,0,UP*,Op% px 2272 Ps Ho Fo Mp Op as 
and ox 7= all have specific and different meanings. Either use notation correctly— 
or don’t use it at all. You can expect to lose credit if you use incorrect notation. 


EXAMPLE | Been to the movies recently? (>) 
Mean and standard deviation of X X 


how PLAYING 


PatriciaPix/Getty Images 


PROBLEM: The number of movies viewed in the last year by students at a large high 
school has a mean of 19.3 movies with a standard deviation of 15.8 movies. Suppose we take 
an SRS of 100 students from this school and calculate the mean number of movies viewed 
by the members of the sample. 


a. Identify the mean of the sampling distribution of x .®. 


b. Calculate and interpret the standard deviation of the sampling distribution of x .%- Verify 
that the 10% condition is met. 


SOLUTION: 


©, Wi = 16 19.3 movies 


UX =p = 


b. Assuming that n = 100 is less than 10% of students at the large high school, ox” = 


15.8 ~ 
oz = — = 1.58 ; 
15.8100 = 1.58 v 100 movies. 


When n <0.10N,Whenn < 0.10N, 


C's — 
Ox =oOn “ie 


In SRSs of size 100, the sample mean number of movies viewed will typically vary by 
about 1.58 movies from the true mean of 19.3 movies. 
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Think About It 


WHERE DO THE FORMULAS FOR THE MEAN AND STANDARD DEVIATION 
OF x © COME FROM? Choose an SRS of size n from a population, and measure a 
quantitative variable X on each individual in the sample. Call the individual measurements 
X1, Xp, ..., Xp. If the population is large relative to the sample, we can think of these X;’s as 


independent random variables, each with mean p/ and standard deviation o. Because 


1 
-_ z=>— (X; + Xo +... +X,) 
x =1n(X1+X2+...+Xn) on 


we can use the rules for random variables from Chapter 6 to find the mean and standard 


—_ - cE 
deviation of x .@. If we let T= X,+X)+...+X,, then x =1nT.” ~ E 


Using the addition rules for means and variances, we get 
wT=pX1+pX2+---+pXn=ptpt+:--+p=npoT2=0XK12+0X22+---+oXn2=02+02+---+o2=no2 
br = Px, +X, Teoo + MX, HRT +tpH Ny 


9 
ra 


2: am et 2 } ’ n 2 a ee 
Or = OX, t oX, oX,, — O oT : = no 


oaks — / yi 2 _ in 
=0T=n02=on Pe ynes See 


Because x = is just a constant multiple of the random variable T, 


1 1 
= Lr =— tan Tl — 
[Lx =1npT=1n(np)=p" ni = BW) = 8 
1 1 _— } 7 {1 1 o 
og = or = (yn) = 0,|—5 = 04 — = 0 = 
ox =1noT=1n(on)=onn2=01n=0ln=on " n V n2 n Jn vy 


Sampling from a Normal Population 


We have described the mean and standard deviation of the sampling distribution of a sample 
mean x = but not its shape. That’s because the shape of the sampling distribution of x # 
depends on the shape of the population distribution. In one important case, there is a simple 
relationship between the two distributions. The following activity shows what we mean. 


ACTIVITY Exploring the sampling distribution of x = for a Normal 
population 


Professor David Lane of Rice University has developed a wonderful applet for investigating 
the sampling distribution of x .Z. It’s dynamic, and it’s fun to play with. In this activity, 
you’ll use Professor Lane’s applet to explore the shape of the sampling distribution when the 
population is Normally distributed. 


1. Go to http://onlinestatbook.com/stat_sim/sampling_dist/ or search for “online statbook 
sampling distributions applet” and go to the website. When the BEGIN button appears on 
the left side of the screen, click on it. You will then see a yellow page entitled “Sampling 
Distributions” like the one in the screen shot. 


mean= 16,00 Parent population 


median= 16.00 Clear lower 3 
sd= 5.00 |Normal | 
Normal * 
skew= 0.00 
kurtosis= 0.00 
0 Sample: 

Reps= 5 Sample Data | Animated | 
range= 13.00 : ; 

4 

3 10,000 

2 100,000 

1 

0 0° ’ T 32 
Reps= 6 Distribution of Means, N=5 
mcean= 16.05 6 
median= 16.00 ; 
sd= 1.89 3 LJ Fit normal 
skew= 0.02 > 
kurtosis= —0,43 1 


= 
= 


Q) 32 


2. There are choices for the population distribution: Normal, uniform, skewed, and custom. 
Keep the default option: Normal. Click the “Animated” button. What happens? Click the 
button several more times. What do the black boxes represent? What is the blue square 
that drops down onto the plot below? 


3. Click on “Clear lower 3” to start clean. Then click on the “100,000” button under 
“Sample:” to simulate taking 100,000 SRSs of size n = 5 from the population. Answer 
these questions: 


e Does the simulated sampling distribution of x % (blue bars) have a recognizable shape? 
Click the box next to “Fit normal.” 


e To the left of each distribution is a set of summary statistics. Compare the mean of the 
simulated sampling distribution with the mean of the population. 


e How is the standard deviation of the simulated sampling distribution related to the 
standard deviation of the population? 


4. Click “Clear lower 3.” Use the drop-down menus to set up the bottom graph to display the 
mean for samples of size n = 20. Then sample 100,000 times. How do the two 
distributions of x © compare: shape, center, and variability? 


5. What have you learned about the shape of the sampling distribution of x % when the 
population has a Normal shape? 


As the preceding activity demonstrates, if the population distribution is Normal, then so is 
the sampling distribution of x .%. This is true no matter what the sample size is. 


| SAMPLING DISTRIBUTION OF THE SAMPLE MEAN X X WHEN | 


SAMPLING FROM A NORMAL POPULATION 


Suppose that a population is Normally distributed with mean p and standard deviation o. 
Then the sampling distribution of x Z has the Normal distribution with mean px =pHz ~ # 
and standard deviation ox =o/n?# = 9/V™ (provided the 10% condition is met). 


We already knew the mean and standard deviation of the sampling distribution. All we have 
added is the Normal shape. Now we have enough information to calculate probabilities 
involving x = when the population distribution is Normal. 


EXAMPLE | Young women’s heights (>) 
Sampling from a Normal population 


Alamy Images 


PROBLEM: The heights of young women follow a Normal distribution with mean pi = 64.5 
inches and standard deviation o = 2.5 inches. 


a. Find the probability that a randomly selected young woman is taller than 66.5 inches. 


b. Find the probability that the mean height of an SRS of 10 young women exceeds 66.5 
inches. 


SOLUTION: 


S¥ SIS 62 64.5 67 63.5 72 
66.5 


Height Cin.) 


1. Draw a Normal distribution. 


— = 0.80 
z = 66.5 — 64.52.5 = 0.80 2.5 
‘Using 
Table P(X > 66.5) = P(z> 0.80) =1—0.7881 


A: P(X>66.5)=P(z>0.80)=1-0.7881=0.2119 = 0.2119 


Using technology: normalcdf (lower:0.80, upper:1000, mean:o, SD:1) = 0.2119 


it. normalcdf (lower:66.5, upper:1000, mean:6 4.5, SD:2.5) = O2116 


2. Perform calculations. 
i. Standardize and use Table A or technology or 


il. Use technology without standardizing. 
Be sure to answer the question that was asked. 


6 uxX-=64.shx = 64.5 
Because 10 < 10% of all young women, 


Justify that the distribution of X X is Normal. 


62.13 62.92 63.71 64.5 6529 66.08 | 66.8F 
X= 66.5 


x = Sample mean height (in.) 


ee = Dill = Oye Vv 


Calculate the mean and standard deviation of the sampling distribution of XX - 
ux Hz = yeh 


oO 


oC 


Jn 


1 


z = 66.5 - 64.50.79 = 2.53 


Using Table A: P(x->66.5)=P(z>2.53)=1-0.9943=0.0057 
P(z > 66.5) = P(z > 2.53) = 1 — 0.9943 — 0.0057 


Using technology: normalcdf ((ower:2. 53, upper:1000, mean:o, SD:1) = 01005 7 
nt normalcdf (lower:66.5, upper:1000, mean:64.5, SD:0.79) = 0,005 77 
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Figure 7.11 compares the population distribution and the sampling distribution of x Z for 
the example about young women’s heights. It also shows the areas corresponding to the 
probabilities that we computed. You can see that it is much less likely for the average height of 
10 randomly selected young women to exceed 66.5 inches than it is for the height of one 
randomly selected young woman to exceed 66.5 inches. 


Sampling 
distribution 
of X 


Population 
distribution 


H=64.5 in. 66,5 in. 


FIGURE 7.11 The sampling distribution of the mean height x © for SRSs of 10 young women 
compared with the population distribution of young women’s heights. 


AP® EXAM TIP 


Many students lose credit on probability calculations involving x £ because they 


forget to divide the population standard deviation by n.v. Remember that 
averages are less variable than individual observations! 


The fact that averages of several observations are less variable than individual observations 
is important in many settings. For example, it is common practice in science and medicine to 
repeat a Measurement several times and report the average of the results. 


CHECK YOUR UNDERSTANDING 


fy 


y The length of human pregnancies from conception to birth varies according to a 
distribution that is approximately Normal with mean 266 days and standard deviation 16 
days. 
1. Find the probability that a randomly chosen pregnant woman has a pregnancy that lasts for 
more than 270 days. 
Suppose we choose an SRS of 6 pregnant women. Let x “= the mean pregnancy length 
for the sample. 


2. What is the mean of the sampling distribution of x ?£? 


3. Calculate and interpret the standard deviation of the sampling distribution of x .%. Verify 
that the 10% condition is met. 


4. Find the probability that the mean pregnancy length for the women in the sample exceeds 
270 days. 


The Central Limit Theorem 


Most population distributions are not Normal. What is the shape of the sampling distribution of 
x £ when sampling from a non-Normal population? The following activity sheds some light 
on this question. 


ACTIVITY Exploring the sampling distribution of X X for non-Normal 
populations 


Let’s use the sampling distributions applet from the preceding activity to investigate what 
happens when we start with a non-Normal population distribution. 


1. Go to onlinestatbook.com/stat_sim/sampling_dist/ and launch the applet. Select “Skewed” 
population. Set the bottom two graphs to display the mean—one for samples of size 2 and 
the other for samples of size 5. Click the “Animated” button a few times to be sure you see 
what’s happening. Then “Clear lower 3” and take 100,000 SRSs. Describe what you see. 


mean= 16.00 Parent population 
median= 16.00 Clear lower 3 
sd= 5.00 
Normal * 
oe ap [Normal_* | 
kurtosis= 0.00 
0 Sample: 
Reps= Sample Data a 
eps 5, Samp 
range= 13,00 5 5 
: 10,000 
2 100,000 
1 
0 0' : ’ 32 


Reps= 6 Distribution of Means, N=5 


mean- 16.05 6 

median= 16.00 ; 
ses apse + LJ Fit normal 
skew= 0.02 

kurtosis= —0.43 : 


= 
= 


0) 32 


2. Change the sample sizes to n = 10 and n =16 and take 100,000 samples. What do you 
notice? 

3. Now change the sample sizes to n = 20 and n = 25 and take 100,000 more samples. Did 
this confirm what you saw in Step 2? 


4. Clear the page, and select “Custom” distribution. Click on a point on the population graph 
to insert a bar of that height. Or click on a point on the horizontal axis, and drag up to 
define a bar. Make a distribution that looks as strange as you can. (Note: You can shorten 


a bar or get rid of it completely by clicking on the top of the bar and dragging down to the 
axis.) Then repeat Steps 1 to 3 for your custom distribution. Cool, huh? 


5. Summarize what you learned about the shape of the sampling distribution of x .@. 


The screen shots in Figure 7.12 show the approximate sampling distributions of x % for 
samples of size n = 2 and samples of size n = 25 from three different populations. 


Parent Population Parent Population Parent Population 
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FIGURE 7.12 Approximate sampling distributions of x £ for different population shapes and 
sample sizes. 


It is a remarkable fact that as the sample size increases, the sampling distribution of x % 
changes shape: it looks less like that of the population and more like a Normal distribution. 
When the sample size is large enough, the sampling distribution of x £ is very close to 
Normal. This is true no matter what shape the population distribution has, as long as the 
population has a finite standard deviation o. This important fact of probability theory is called 
the central limit theorem (sometimes abbreviated as CLT). 


DEFINITION Central limit theorem (CLT) 


Draw an SRS of size n from any population with mean p and finite standard 
deviation o. The central limit theorem (CLT) says that when n is large, the 
sampling distribution of the sample mean x = is approximately Normal. 


How large a sample size n is needed for the sampling distribution of x £ to be close to 


Normal depends on the population distribution. More observations are required if the shape of 
the population distribution is far from Normal. In that case, the sampling distribution of x % 
will also be very non-Normal if the sample size is small. 

As Figure 7.12 illustrates, even when the population distribution is very non- Normal, the 
sampling distribution of x £ often looks approximately Normal with sample sizes as small as n 
= 25. To be safe, we’ll require that n be at least 30 to invoke the CLT. 


SHAPE OF THE SAMPLING DISTRIBUTION OF THE SAMPLE MEAN X — 
X 


e Ifthe population distribution is Normal, the sampling distribution of x % will also be 


Normal, no matter what the sample size n is. 


e Ifthe population distribution is not Normal, the sampling distribution of x will be 
approximately Normal when the sample size is large (n > 30 in most cases). If the sample 
size is small and the population distribution is not Normal, the sampling distribution of x — 
£ will retain some characteristics of the population distribution (e.g., skewness). 


EXAMPLE |Free oil changes (>) 
Calculations using the CLT 


Joe Belanger/Alamy 


PROBLEM: Keith is the manager of an auto-care center. Based on service records from the 
past year, the time (in hours) that a technician requires to complete a standard oil change and 
inspection follows a right-skewed distribution with » = 30 minutes and o = 20 minutes. For a 
promotion, Keith randomly selects 40 current customers and offers them a free oil change 
and inspection if they redeem the offer during the next month. Keith budgets an average of 
35 minutes per customer for a technician to complete the work. Will this be enough? 


a. Calculate the probability that the average time it takes to complete the work exceeds 35 
minutes. 


b. How much average time per customer should Keith budget if he wants to be 99% certain 
that he doesn’t go “over budget”? 


SOLUTION: 


a. wx” = 30Hz = 30 


Assuming 40 < 10% of all current customers, 


: 7 = 4 =3.16 
ox” = 2040 = 3.16 v40 


Calculate the mean and standard deviation of the sampling distribution of x .2. 


__ a o 
Lz P Jn 


UX =U Ox =on 


Because the sample size 1s large (40 = 30), the distribution of Oat hts approximately 
Normal. 


Justify that the distribution of x £ is approximately Normal 


P| 


20.52 23.68 26.84 30 33.16 | 36.32 39.48 
¥=35 


X= sample mean time (min) 


 _ 35-30 __ 
z= 35 20st0=155 316 — 198 


Using Table A:P(x->35)=P(z>1.58)=1-0.9429=0.0571 
P(z> 36) = Plz > 1.58) = 1 — 0.9429 = 0.0571 


Using technology:normalcdf (lower. 58, upper:1000, mean:o, SD:1)=0.0571 
normalcdf (lower:1.58, upper:1000, mean:0, SD:1) = 0.0571 


it. normalcdf (lower:35, upper:1000, mean:30, SD:3.16)=0.0568 
normalcdf (lower:35, upper:1000, mean:30, SD:3.16) = 0.0568 


There is only aA 5.68% probability that Keith hasn’t budgeted enough time to complete the 
work, 


Area = 0.99 


- 


20.52 43.68 26.84 30 33.16 36.32 xy 39.48 


X= sample mean time (min) 


1. Draw a Normal distribution. 


i. Using Table A:0.99 area to left>z=2,330.99 area to left + z = 2.33 


Using technology:invnorm(area:o. 99, mean:o, SD:1)=2.33 
invnorm(area:0.99, mean:0, $D:1) = 2.33 


2.33 = z—30 
233)— % = 30316— 6 —3 724 mais 3.16 


— ® = 37.4 minutes 
i. invnorm(area:o. 99, mean:30, S‘D:3.1 6) = 37.4 minutes 
To be 99% sure he has budgeted enough time, Keith should plan for an average of al 


minutes per customer. 


. Perform calculations. 
i. Use Table A or technology to find the value of z with the indicated area under the 
standard Normal curve, then “unstandardize” to transform back to the original 
distribution; or 


ll. Use technology to find the desired value without standardizing. 
Be sure to answer the question that was asked. 


FOR PRACTICE, TRY EXERCISE 65 


What if Keith decided to give away only 10 free oil changes? Because the population 
distribution is skewed to the right and the sample size is small (10 < 30), we can’t use a Normal 
distribution to do probability calculations. The sampling distribution of x @ is likely to be 
skewed to the right—although not as strongly as the population distribution itself. 


Section 7.3 Summary 


When we want information about the population mean p for some quantitative variable, we 
often take an SRS and use the sample mean x # to estimate the unknown parameter p. The 
sampling distribution of the sample mean x = describes how the statistic x @ varies in all 
possible samples of the same size from the population. 


The mean of the sampling distribution of x % is ux =p,4z ~ > so x # is an unbiased 
estimator of 1. 
The standard deviation of the sampling distribution of x % is ox =0/n7? = %/V™ for an 


SRS of size n if the population has standard deviation o. This formula can be used if the 
sample size is less than 10% of the population size (10% condition). 


Choose an SRS of size n from a population with mean p and standard deviation o. If the 
population distribution is Normal, then so is the sampling distribution of the sample mean 
x .£. If the population distribution is not Normal, the central limit theorem (CLT) states 
that when n is large, the sampling distribution of x # is approximately Normal. 


In some cases, we can use a Normal distribution to calculate probabilities for events 
involving x .@. 
e Ifthe population distribution is Normal, so is the sampling distribution of x .@- 


e If n> 30, the sampling distribution of x Z will be approximately Normal in most cases. 


Section 7.3 Exercises 


53. PQ 470 |») Songs on an iPod David’s iPod has about 10,000 songs. The distribution of 


the play times for these songs is heavily skewed to the right with a mean of 225 seconds 
and a standard deviation of 60 seconds. Suppose we choose an SRS of 10 songs from this 
population and calculate the mean play time x & of these songs. 


a. Identify the mean of the sampling distribution of x .2. 


b. Calculate and interpret the standard deviation of the sampling distribution of x .@. 
Verify that the 10% condition is met. 


54. Making auto parts A grinding machine in an auto parts plant prepares axles with a target 


diameter p = 40.125 millimeters (mm). The machine has some variability, so the standard 
deviation of the diameters is o = 0.002 mm. The machine operator inspects a random 
sample of 4 axles each hour for quality control purposes and records the sample mean 
diameter x .- Assume the machine is working properly. 


a. Identify the mean of the sampling distribution of x .@. 


b. Calculate and interpret the standard deviation of the sampling distribution of x .@. 


55. 


56. 


o7. 


58. 


5. 


60. 


Verify that the 10% condition is met. 


Songs on an iPod Refer to Exercise 53. How many songs would you need to sample if 
you wanted the standard deviation of the sampling distribution of x Z to be 10 seconds? 
Justify your answer. 


Making auto parts Refer to Exercise 54. How many axles would you need to sample if 
you wanted the standard deviation of the sampling distribution of x Z to be 0.0005 mm? 
Justify your answer. 


pg 472 |») Bottling cola A bottling company uses a filling machine to fill plastic 


bottles with cola. The bottles are supposed to contain 300 milliliters (ml). In fact, the 
contents vary according to a Normal distribution with mean p = 298 ml and standard 
deviation o = 3 ml. 


a. What is the probability that a randomly selected bottle contains less than 295 ml? 


b. What is the probability that the mean contents of six randomly selected bottles is less 
than 295 ml? 


Cereal A company’s cereal boxes advertise that each box contains 9.65 ounces of cereal. 
In fact, the amount of cereal in a randomly selected box follows a Normal distribution 
with mean p = 9.70 ounces and standard deviation o = 0.03 ounce. 


a. What is the probability that a randomly selected box of the cereal contains less than 
9.65 ounces of cereal? 


b. Now take an SRS of 5 boxes. What is the probability that the mean amount of cereal in 
these boxes is less than 9.65 ounces? 


Cholesterol Suppose that the blood cholesterol level of all men aged 20 to 34 follows the 
Normal distribution with mean p = 188 milligrams per deciliter (mg/dl) and standard 
deviation o = 41 mg/dl. 


a. Choose an SRS of 100 men from this population. Describe the sampling distribution of 
X a. 

b. Find the probability that x % estimates within +3 mg/dl. (This is the probability that 
x £ takes a value between 185 and 191 mg/dl.) 


c. Choose an SRS of 1000 men from this population. Now what is the probability that x — 
© falls within +3 mg/dl of py? In what sense is the larger sample “better”? 


Finch beaks One dimension of bird beaks is “depth”—the height of the beak where it 
arises from the bird’s head. During a research study on one island in the Galapagos 
archipelago, the beak depth of all Medium Ground Finches on the island was found to be 
Normally distributed with mean p“= 9.5 millimeters (mm) and standard deviation o7= 


1.0 mm.2 


61. 


62. 


63. 


64. 


a. Choose an SRS of 5 Medium Ground Finches from this population. Describe the 
sampling distribution of x .@. 


b. Find the probability that x # estimates p within +0.5 mm. (This is the probability that 
x & takes a value between 9 and 10 mm.) 


c. Choose an SRS of 50 Medium Ground Finches from this population. Now what is the 
probability that x Z falls within +0.5 mm of py? In what sense is the larger sample 
“better”? 


Dead battery? A car company claims that the lifetime of its batteries varies from car to 
car according to a Normal distribution with mean p = 48 months and standard deviation 0 
= 8.2 months. A consumer organization installs this type of battery in an SRS of 8 cars and 
calculates x = 42.2 months. 


a. Find the probability that the sample mean lifetime is 42.2 months or less if the 
company’s claim is true. 


b. Based on your answer to part (a), is there convincing evidence that the company is 
overstating the average lifetime of its batteries? 


Foiled again? The manufacturer of a certain brand of aluminum foil claims that the 
amount of foil on each roll follows a Normal distribution with a mean of 250 square feet 
(ft?) and a standard deviation of 2 ft?. To test this claim, a restaurant randomly selects 10 
rolls of this aluminum foil and carefully measures the mean area to be x _©= 249.6 ft?. 


a. Find the probability that the sample mean area is 249.6 ft? or less if the manufacturer’s 
claim is true. 


b. Based on your answer to part (a), is there convincing evidence that the company is 
overstating the average area of its aluminum foil rolls? 


Songs on an iPod David’s iPod has about 10,000 songs. The distribution of the play times 
for these songs is heavily skewed to the right with a mean of 225 seconds and a standard 
deviation of 60 seconds. 


a. Describe the shape of the sampling distribution of x # for SRSs of size n = 5 from the 
population of songs on David’s iPod. Justify your answer. 


b. Describe the shape of the sampling distribution of x Z for SRSs of size n = 100 from 
the population of songs on David’s iPod. Justify your answer. 


High school GPAs The distribution of grade point average for students at a large high 
school is skewed to the left with a mean of 3.53 and a standard deviation of 1.02. 


a. Describe the shape of the sampling distribution of x # for SRSs of size n = 4 from the 
population of students at this high school. Justify your answer. 


b. Describe the shape of the sampling distribution of x % for SRSs of size n = 50 from the 


population of students at this high school. Justify your answer. 


65. pg 477 |») More on insurance An insurance company claims that in the entire 


66. 


67. 


68. 


population of homeowners, the mean annual loss from fire is p = $250 and the standard 
deviation of the loss is o = $5000. The distribution of losses is strongly right-skewed: 
many policies have $0 loss, but a few have large losses. The company hopes to sell 1000 
of these policies for $300 each. 


a. Assuming that the company’s claim is true, what is the probability that the mean loss 
from fire is greater than $300 for an SRS of 1000 homeowners? 


b. If the company wants to be 90% certain that the mean loss from fire in an SRS of 1000 
homeowners is less than the amount it charges for the policy, how much should the 
company charge? 


Cash grab At a traveling carnival, a popular game is called the “Cash Grab.” In this 
game, participants step into a sealed booth, a powerful fan turns on, and dollar bills are 
dropped from the ceiling. A customer has 30 seconds to grab as much cash as possible 
while the dollar bills swirl around. Over time, the operators of the game have determined 
that the mean amount grabbed is $13 with a standard deviation of $9. They charge $15 to 
play the game and expect to have 40 customers at their next carnival. 


a. What is the probability that an SRS of 40 customers grab an average of $15 or more? 


b. How much should the operators charge if they want to be 95% certain that the mean 
amount grabbed by an SRS of 40 customers is less than what they charge to play the 
game? 


Bad carpet The number of flaws per square yard in a type of carpet material varies with 
mean 1.6 flaws per square yard and standard deviation 1.2 flaws per square yard. 


a. Without doing any calculations, explain which event is more likely: 
e randomly selecting a 1 square yard of material and finding 2 or more flaws 


e randomly selecting 50 square yards of material and finding an average of 2 or more 
flaws 


b. Explain why you cannot use a Normal distribution to calculate the probability of the 
first event in part (a). 


c. Calculate the probability of the second event in part (a). 


How many people in a car? A study of rush-hour traffic in San Francisco counts the 
number of people in each car entering a freeway at a suburban interchange. Suppose that 
this count has mean 1.6 and standard deviation 0.75 in the population of all cars that enter 
at this interchange during rush hour. 


a. Without doing any calculations, explain which event is more likely: 


69. 


70. 


“1. 


72. 


e randomly selecting 1 car entering this interchange during rush hour and finding 2 or 
more people in the car 


e randomly selecting 35 cars entering this interchange during rush hour and finding an 
average of 2 or more people in the cars 


b. Explain why you cannot use a Normal distribution to calculate the probability of the 
first event in part (a). 


c. Calculate the probability of the second event in part (a). 


What does the CLT say? Asked what the central limit theorem says, a student replies, 
“As you take larger and larger samples from a population, the histogram of the sample 
values looks more and more Normal.” Is the student right? Explain your answer. 


What does the CLT say? Asked what the central limit theorem says, a student replies, 
“As you take larger and larger samples from a population, the variability of the sampling 
distribution of the sample mean decreases.” Is the student right? Explain your answer. 


Airline passengers get heavier In response to the increasing weight of airline passengers, 
the Federal Aviation Administration (FAA) told airlines to assume that passengers average 
190 pounds in the summer, including clothes and carry-on baggage. But passengers vary, 
and the FAA did not specify a standard deviation. A reasonable standard deviation is 35 
pounds. A commuter plane carries 30 passengers. Find the probability that the total weight 
of 30 randomly selected passengers exceeds 6000 pounds. (Hint: To calculate this 
probability, restate the problem in terms of the mean weight.) 


Lightning strikes The number of lightning strikes on a square kilometer of open ground 
in a year has mean 6 and standard deviation 2.4. The National Lightning Detection 
Network (NLDN) uses automatic sensors to watch for lightning in 1-square-kilometer 
plots of land. Find the probability that the total number of lightning strikes in a random 
sample of 50 square-kilometer plots of land is less than 250. (Hint: To calculate this 
probability, restate the problem in terms of the mean number of strikes.) 


Multiple Choice Select the best answer for Exercises 73-76. 


73. 


The distribution of scores on the mathematics part of the SAT exam in a recent year was 
approximately Normal with mean 515 and standard deviation 114. Imagine choosing 
many SRSs of 100 students who took the exam and averaging their SAT Math scores. 
Which of the following are the mean and standard deviation of the sampling distribution 
of x ?x? 


a. Mean = 515, SD = 114 


— 


b. Mean = 515, SD=114/1008P = 114/100 


c. Mean = 515/100, SD = 114/100 


d. 


e. 


Mean = 515/100, SD=114/1008D = 114/100 


Cannot be determined without knowing the 100 scores. 


74, Why is it important to check the 10% condition before calculating probabilities involving 
x a? 


13% 


76. 


a. 
b. 
c. 
d. 


e. 


To reduce the variability of the sampling distribution of x 7 

To ensure that the distribution of x # is approximately Normal 
To ensure that we can generalize the results to a larger population 
To ensure that x will be an unbiased estimator of p 


To ensure that the observations in the sample are close to independent 


A machine is designed to fill 16-ounce bottles of shampoo. When the machine is working 
properly, the amount poured into the bottles follows a Normal distribution with mean 
16.05 ounces and standard deviation 0.1 ounce. Assume that the machine is working 
properly. If 4 bottles are randomly selected and the number of ounces in each bottle is 
measured, then there is about a 95% probability that the sample mean will fall in which of 
the following intervals? 


a. 
b. 
C; 
d. 


e. 


16.05 to 16.15 ounces 
16.00 to 16.10 ounces 
15.95 to 16.15 ounces 
15.90 to 16.20 ounces 


15.85 to 16.25 ounces 


The number of hours a lightbulb burns before failing varies from bulb to bulb. The 
population distribution of burnout times is strongly skewed to the right. The central limit 
theorem says that 


a. 


as we look at more and more bulbs, their average burnout time gets ever closer to the 
mean p/ for all bulbs of this type. 


the average burnout time of a large number of bulbs has a sampling distribution with 
the same shape (strongly skewed) as the population distribution. 


the average burnout time of a large number of bulbs has a sampling distribution with a 
similar shape but not as extreme (skewed, but not as strongly) as the population 
distribution. 


the average burnout time of a large number of bulbs has a sampling distribution that is 
close to Normal. 


e. the average burnout time of a large number of bulbs has a sampling distribution that is 
exactly Normal. 


Recycle and Review 


Exercises 77 and 78 refer to the following setting. In the language of government statistics, you 
are “in the labor force” if you are available for work and either working or actively seeking 
work. The unemployment rate is the proportion of the labor force (not of the entire population) 
that is unemployed. Here are estimates from the Current Population Survey for the civilian 
population aged 25 years and over in a recent year. The table entries are counts in thousands of 
people. 


Highest education Total population In labor force Employed 
Didn't finish high school 27,669 12,470 11,408 
High school but no college 59,860 37,834 35,857 
Less than bachelor’s degree 47,556 34,439 32,977 
College graduate 51,582 40,390 39,293 


77. Unemployment (1.1) Find the unemployment rate for people with each level of 
education. Is there an association between unemployment rate and education? Explain 
your answer. 


78. Unemployment (5.2, 5.3) Suppose that you randomly select one person 25 years of age or 
older. 


a. What is the probability that a randomly chosen person 25 years of age or older is in the 
labor force? 


b. If you know that a randomly chosen person 25 years of age or older is a college 
graduate, what is the probability that he or she is in the labor force? 


c. Are the events “in the labor force” and “college graduate” independent? Justify your 
answer. 


ae "tes 


MM _. FRAPPY! FREE RESPONSE AP® 
PROBLEM, YAY! 


The following problem is modeled after actual AP® Statistics exam free response 
questions. Your task is to generate a complete, concise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you use, because you will be 
scored on the correctness of your methods as well as on the accuracy and completeness of 
your results and explanations. 


The principal of a large high school is concerned about the number of absences for 
students at his school. To investigate, he prints a list showing the number of absences 
during the last month for each of the 2500 students at the school. For this population of 
students, the distribution of absences last month is skewed to the right with a mean of 
w=1.14 = 1.1 and a standard deviation of o=1.4.0 = 1.4. 


Suppose that a random sample of 50 students is selected from the list printed by the 
principal and the sample mean number of absences is calculated. 


a. What is the shape of the sampling distribution of the sample mean? Explain. 


b. What are the mean and standard deviation of the sampling distribution of the 
sample mean? 


c. What is the probability that the mean number of absences in a random sample of 50 
students is less than 1? 


d. Because the population distribution is skewed, the principal is considering using the 
median number of absences last month instead of the mean number of absences to 
summarize the distribution. Describe how the principal could use a simulation to 
estimate the standard deviation of the sampling distribution of the sample median 
for samples of size 50. 


After you finish, you can view two example solutions on the book’s website 
(highschool.bfwpub.comitps6e). Determine whether you think each solution is 
“complete,” “substantial,” “developing,” or “minimal.” If the solution is not 
complete, what improvements would you suggest to the student who wrote it? 
Finally, your teacher will provide you with a scoring rubric. Score your response 


| and note what, if anything, you would do differently to improve your own score. | 


Chapter 7 Review 


Section 7.1: What Is a Sampling Distribution? 


In this section, you learned the “big ideas” of sampling distributions. The first big idea is the 
difference between a statistic and a parameter. A parameter is a number that describes some 
characteristic of a population. A statistic estimates the value of a parameter using a sample 
from the population. Making the distinction between a statistic and a parameter will be crucial 
throughout the rest of the course. 


The second big idea is that statistics vary. For example, the mean weight in a sample of 
high school students is a variable that will change from sample to sample. This means that 
Statistics have distributions. The distribution of a statistic in all possible samples of the same 
size is called the sampling distribution of the statistic. Knowing the sampling distribution of a 
Statistic tells us how far we can expect a Statistic to vary from the parameter value and what 
values of the statistic should be considered unusual. 


The third big idea is the distinction between the distribution of the population, the 
distribution of the sample, and the sampling distribution of a sample statistic. Reviewing the 
illustration on page 446 will help you understand the difference between these three 
distributions. When you are writing your answers, be sure to indicate which distribution you 
are referring to. Don’t make ambiguous statements like “the distribution will become less 
variable.” 


The final big idea is how to describe a sampling distribution. To adequately describe a 
sampling distribution, you need to address shape, center, and variability. If the center (mean) of 
the sampling distribution is the same as the value of the parameter being estimated, then the 
Statistic is called an unbiased estimator. An estimator is unbiased if it doesn’t consistently 
underestimate or consistently overestimate the parameter in many samples. Ideally, the 
variability of a sampling distribution will be very small, meaning that the statistic provides 
precise estimates of the parameter. Larger sample sizes result in sampling distributions with 
less variability. 


Section 7.2: Sample Proportions 

In this section, you learned about the shape, center, and variability of the sampling distribution 
of a sample proportion. When the Large Counts condition (np = 10 and n(1 — p) = 10) is met, 
the sampling distribution of p/P will be approximately Normal. The mean of the sampling 
distribution of p/? is pp\=p"» ~ ”, the population proportion. As a result, the sample 
proportion p/? is an unbiased estimator of the population proportion p. When the sample size 
is less than 10% of the population size (the 10% condition), the standard deviation of the 


o; = /p—p)/n 
sampling distribution of the sample proportion is op{=p(1-p)/n ? vP Bx” The 
standard deviation measures how far the sample proportion p/? typically varies from the 


population proportion p. 


Section 7.3: Sample Means 


In this section, you learned about the shape, center, and variability of the sampling distribution 
of a sample mean. When the population is Normally distributed, the sampling distribution of x — 
© will also be Normal for any sample size. When the population distribution is not Normal and 
the sample size is small, the sampling distribution of x £ will resemble the population shape. 
However, the central limit theorem says that the sampling distribution of x % will become 
approximately Normal for larger sample sizes (typically when n > 30), no matter what the 
population shape. You can use a Normal distribution to calculate probabilities involving the 
sampling distribution of x if the population is Normal or the sample size is at least 30. 


The mean of the sampling distribution of x Z is ux =p4z ~ F, the population mean. As a 
result, the sample mean x = is an unbiased estimator of the population mean pp. When the 
sample size is less than 10% of the population size (the 10% condition), the standard deviation 
of the sampling distribution of the sample mean is ox =o/n?* ~ /\/"_ The standard deviation 


measures how far the sample mean x £ typically varies from the population mean pF. 


Comparing the sampling distribution of a sample 
proportion and a sample mean 


Sampling distribution of 
pp Sampling distribution of x = 
Center Up*= BH, = ux =pyMz — fh 


el 


Variability _ _ . [pp 
op~=p(1-p)n?» ~ Vo 
when the 10% condition is 
met 


Shape Approximately Normal when Normal when the population distribution is 
the Large Counts condition Normal; Approximately Normal if the population 
is met: np = 10 and n(1-p) distribution is not Normal but the sample size is 
> 10 large (n = 30) 


_ = = — 
ox =on vn 
when the 10% condition is met 


What Did You Learn? 


Related 
Example on Relevant Chapter Review 
Learning Target Section Page(s) Exercise(s) 
Distinguish between a re 442 R7.1 


parameter and a statistic. 


Create a sampling 
distribution using all 
possible samples from a 
small population. 


Use the sampling 
distribution of a statistic to 
evaluate a claim about a 
parameter. 


Distinguish among the 
distribution of a population, 
the distribution of a 
sample, and the sampling 
distribution of a statistic. 


Determine if a statistic is 
an unbiased estimator of a 
population parameter. 


Describe the relationship 
between sample size and 
the variability of a statistic. 


Calculate the mean and 
standard deviation of the 
sampling distribution of a 
sample proportion p*P and 
interpret the standard 
deviation. 


Determine if the sampling 
distribution pP of is 
approximately Normal. 


If appropriate, use a 
Normal distribution to 
calculate probabilities 
involving p’.P- 


Calculate the mean and 
standard deviation of the 
sampling distribution of a 
sample mean x £ and 
interpret the standard 
deviation. 


Explain how the shape of 
the sampling distribution 

x £ of is affected by the 
shape of the population 
distribution and the sample 
size. 


If appropriate, use a 
Normal distribution to 
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Chapter 7 Review Exercises 


These exercises are designed to help you review the important ideas and methods of the 
chapter. 


R7.1 \») Bad eggs Selling eggs that are contaminated with salmonella can cause food 


poisoning in consumers. A large egg producer randomly selects 200 eggs from all the 
eggs shipped in one day. The laboratory reports that 9 of these eggs had salmonella 
contamination. Identify the population, the parameter, the sample, and the statistic. 


R7.2 |») Five books An author has written 5 children’s books. The numbers of pages in these 
books are 64, 66, 71, 73, and 76. 


a. List all 10 possible SRSs of size n=3,” = 3; calculate the median number of pages for 
each sample, and display the sampling distribution of the sample median on a dotplot. 


b. Describe how the variability of the sampling distribution of the sample median would 
change if the sample size was increased to n=4.n = 4. 
c. Construct the sampling distribution of the sample median for samples of size n=4. 


n = 4. Does this sampling distribution support your answer to part (b)? Explain your 
reasoning. 


R7.3 |») Birth weights Researchers in Norway analyzed data on the birth weights of 400,000 
newborns over a 6-year period. The distribution of birth weights is approximately 
Normal with a mean of 3668 grams and a standard deviation of 511 grams.12 
a. Sketch a graph that displays the distribution of birth weights for this population. 


b. Sketch a possible graph of the distribution of birth weights for an SRS of size 5. 
Calculate the range for this sample. 

In this population, the range (Maximum — Minimum) of birth weights is 3417 grams. We 

technology to take 500 SRSs of size n=5n = 5 and calculate the range (Maximum — 

Minimum) for each sample. The dotplot shows the results. 


, df tl li. 7 


1000 1500 2000 2500 3000 
Simulated sample range (g) 


c. In the graph provided, there is a dot at approximately 2800. Explain what this value 
represents. 


d. Is the sample range an unbiased estimator of the population range? Give evidence 
from the graph to support your answer. 


R7.4 ») Do you jog? The Gallup Poll asked a random sample of 1540 adults, “Do you 


happen to jog?” Suppose that the true proportion of all adults who jog is p=0.15. 
p= 0.15. 


a. What is the mean of the sampling distribution of p/P? 


b. Calculate and interpret the standard deviation of the sampling distribution of p/?. 
Check that the 10% condition is met. 


c. Is the sampling distribution of p\? approximately Normal? Justify your answer. 


d. Find the probability that between 13% and 17% of people jog in a random sample of 
1540 adults. 


R7.5 |») Bag check Thousands of travelers pass through the airport in Guadalajara, Mexico, 


each day. Before leaving the airport, each passenger must pass through the customs 
inspection area. Customs agents want to be sure that passengers do not bring illegal items 
into the country. But they do not have time to search every traveler’s luggage. Instead, 
they require each person to press a button. Either a red or a green bulb lights up. If the 
red light flashes, the passenger will be searched by customs agents. A green light means 
“Go ahead.” Customs agents claim that 30% of all travelers will be stopped (red light), 
because the light has probability 0.30 of showing red on any push of the button. To test 
this claim, a concerned citizen watches a random sample of 100 travelers push the 
button. Only 20 get a red light. 


a. Assume that the customs agents’ claim is true. Find the probability that the proportion 
of travelers who get a red light in a random sample of 100 travelers is less than or 
equal to the result in this sample. 


b. Based on your results in part (a), is there convincing evidence that less than 30% of 
all travelers will be stopped? Explain your reasoning. 


R7.6 \p) IQ tests The Wechsler Adult Intelligence Scale (WAIS) is a common IQ test for 


adults. The distribution of WAIS scores for persons over 16 years of age is 
approximately Normal with mean 100 and standard deviation 15. 


a. What is the probability that a randomly chosen individual has a WAIS score of 105 or 
greater? 


b. Find the mean and standard deviation of the sampling distribution of the average 
WAIS score x £ for an SRS of 60 people. Interpret the standard deviation. 


c. What is the probability that the average WAIS score of an SRS of 60 people is 105 or 
greater? 


d. Would your answers to any of parts (a), (b), or (c) be affected if the distribution of 
WAIS scores in the adult population was distinctly non-Normal? Explain your 
reasoning. 


R7.7 ») Detecting gypsy moths The gypsy moth is a serious threat to oak and aspen trees. A 


state agriculture department places traps throughout the state to detect the moths. Each 
month, an SRS of 50 traps is inspected, the number of moths in each trap is recorded, and 
the mean number of moths is calculated. Based on years of data, the distribution of moth 
counts is discrete and strongly skewed with a mean of 0.5 and a standard deviation of 
0.7. 


a. Explain why it is reasonable to use a Normal distribution to approximate the sampling 
distribution of x % for SRSs of size 50. 


b. Estimate the probability that the mean number of moths in a sample of size 50 is 
greater than or equal to 0.6. 


c. Ina recent month, the mean number of moths in an SRS of size 50 was x =0.6. 
& = 0.6. Based on this result, is there convincing evidence that the moth population is 
getting larger in this state? Explain your reasoning. 


Chapter 7 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


T7.1 A study of voting chose 663 registered voters at random shortly after an election. Of 
these, 72% said they had voted in the election. Election records show that only 56% of 
registered voters voted in the election. Which of the following statements is true? 


a. 72% is a sample; 56% is a population. 

b. 72% and 56% are both statistics. 

c. 72% is a Statistic and 56% is a parameter. 
d. 72% is a parameter and 56% is a Statistic. 
e. 72% and 56% are both parameters. 


T7.2 The Gallup Poll has decided to increase the size of its random sample of voters from 
about 1500 people to about 4000 people right before an election. The poll is designed 
to estimate the proportion of voters who favor a new law banning smoking in public 
buildings. The effect of this increase is to 


a. reduce the bias of the estimate. 

b. increase the bias of the estimate. 

c. reduce the variability of the estimate. 

d. increase the variability of the estimate. 

e. reduce the bias and variability of the estimate. 


T7.3 Suppose we select an SRS of size n=100n = 100 from a large population having 
proportion p of successes. Let p/? be the proportion of successes in the sample. For 
which value of p would it be safe to use the Normal approximation to the sampling 
distribution of p\P ? 


a. 0.01 
b. 0.09 
c. 0.85 
d. 0.975 
e. 0.999 


T7.4 The central limit theorem is important in statistics because it allows us to use a Normal 
distribution to find probabilities involving the sample mean if the 


a. sample size is reasonably large (for any population). 


b. population is Normally distributed (for any sample size). 


c. population is Normally distributed and the sample size is reasonably large. 


d. population is Normally distributed and the population standard deviation is known 
(for any sample size). 


e. population size is reasonably large (whether the population distribution is known or 
not). 


T7.5 The number of undergraduates at Johns Hopkins University is approximately 2000, 
while the number at Ohio State University is approximately 60,000. At both schools, a 
simple random sample of about 3% of the undergraduates is taken. Each sample is used 
to estimate the proportion p of all students at that university who own an iPod. Suppose 
that, in fact, p = 0.80 at both schools. Which of the following is the best conclusion? 


a. We expect that the estimate from Johns Hopkins will be closer to the truth than the 
estimate from Ohio State because it comes from a smaller population. 


b. We expect that the estimate from Johns Hopkins will be closer to the truth than the 
estimate from Ohio State because it is based on a smaller sample size. 


c. We expect that the estimate from Ohio State will be closer to the truth than the 
estimate from Johns Hopkins because it comes from a larger population. 


d. We expect that the estimate from Ohio State will be closer to the truth than the 
estimate from Johns Hopkins because it is based on a larger sample size. 


e. We expect that the estimate from Johns Hopkins will be about the same distance 
from the truth as the estimate from Ohio State because both samples are 3% of their 
populations. 


T7.6 A researcher initially plans to take an SRS of size 160 from a certain population and 
calculate the sample mean x =. Later, the researcher decides to increase the sample 
size so that the standard deviation of the sampling distribution of x # will be half as 
big as when using a sample size of 160. What sample size should the researcher use? 
a. 40 
b. 80 
c. 320 
d. 640 


e. There is not enough information to determine the sample size. 


T7./ The student newspaper at a large university asks an SRS of 250 undergraduates, “Do 
you favor eliminating the carnival from the term-end celebration?” All in all, 150 of the 
250 are in favor. Suppose that (unknown to you) 55% of all undergraduates favor 
eliminating the carnival. If you took a very large number of SRSs of size n=250 
n = 250 from this population, the sampling distribution of the sample proportion p/? 
would be 


a. exactly Normal with mean 0.55 and standard deviation 0.03. 


es 
d. 


e. 


approximately Normal with mean 0.55 and standard deviation 0.03. 
exactly Normal with mean 0.60 and standard deviation 0.03. 
approximately Normal with mean 0.60 and standard deviation 0.03. 


heavily skewed with mean 0.55 and standard deviation 0.03. 


T7.8 Which of the following statements about the sampling distribution of the sample mean 
is incorrect? 


a. 


The standard deviation of the sampling distribution will decrease as the sample size 
increases. 


. The standard deviation of the sampling distribution measures how far the sample 


mean typically varies from the population mean. 


The sample mean is an unbiased estimator of the population mean. 


. The sampling distribution shows how the sample mean is distributed around the 


population mean. 


The sampling distribution shows how the sample is distributed around the sample 
mean. 


T7.9 A newborn baby has extremely low birth weight (ELBW) if it weighs less than 1000 
grams. A study of the health of such children in later years examined a random sample 
of 219 children. Their mean weight at birth was x = 810 grams. This sample mean is 
an unbiased estimator of the mean weight p in the population of all ELBW babies, 
which means that 


a. 


e. 


in all possible samples of size 219 from this population, the mean of the values of 
x £ will equal 810. 


. in all possible samples of size 219 from this population, the mean of the values of 


x £ will equal p. 
as we take larger and larger samples from this population, x will get closer and 
closer to pL. 


. inall possible samples of size 219 from this population, the values of x will have 


a distribution that is close to Normal. 


the person measuring the children’s weights does so without any error. 


T7.10 Suppose that you are a student aide in the library and agree to be paid according to the 
“random pay” system. Each week, the librarian flips a coin. If the coin comes up heads, 
your pay for the week is $80. If it comes up tails, your pay for the week is $40. You 
work for the library for 100 weeks. Suppose we choose an SRS of 2 weeks and 
calculate your average earnings x Z. The shape of the sampling distribution of x @ will 


be 
a. 


b. 


Normal. 


approximately Normal. 


c. right-skewed. 
d. left-skewed. 


e. symmetric but not Normal. 


Section II: Free Response Show all your work. Indicate clearly the methods you use, 
because you will be graded on the correctness of your methods as well as on the accuracy and 
completeness of your results and explanations. 


T7.11 Here are histograms of the values taken by three sample statistics in several hundred 
samples from the same population. The true value of the population parameter is 
marked with an arrow on each histogram. 


Which statistic would provide the best estimate of the parameter? Justify your answer. 


T7.12 The amount that households pay service providers for access to the Internet varies quite 
a bit, but the mean monthly fee is $50 and the standard deviation is $20. The 
distribution is not Normal: many households pay a low rate as part of a bundle with 
phone or television service, but some pay much more for Internet only or for faster 
connections./! A sample survey asks an SRS of 50 households with Internet access how 
much they pay. Let x @ be the mean amount paid. 


a. Explain why you can’t determine the probability that the amount a randomly 
selected household pays for access to the Internet exceeds $55. 


b. What are the mean and standard deviation of the sampling distribution of x #? 
c. What is the shape of the sampling distribution of x £? Justify your answer. 


d. Find the probability that the average fee paid by the sample of households exceeds 
$55. 


T7.13 According to government data, 22% of American children under the age of 6 live in 
households with incomes less than the official poverty level. A study of learning in 


early childhood chooses an SRS of 300 children from one state and finds that p\? = 
0.29. 


a. Find the probability that at least 29% of the sample are from poverty-level 
households, assuming that 22% of all children under the age of 6 in this state live in 
poverty-level households. 


b. Based on your answer to part (a), is there convincing evidence that the percentage of 
children under the age of 6 living in households with incomes less than the official 
poverty level in this state is greater than the national value of 22%? Explain your 
reasoning. 


Cumulative AP® Practice Test 2 


Section |: Multiple Choice Choose the best answer for each question. 


AP2.1 The five-number summary for a data set is given by min = 5, Q,; = 18, median = 20, Q3 


= 40, max = 75. If you wanted to construct a boxplot for the data set that would show 
outliers, if any existed, what would be the maximum possible length of the right-side 
“whisker”? 


a. 33 
b. 35 
c. 45 
d. 53 
e. 55 
AP2.2 The probability distribution for the number of heads in four tosses of a coin is given by 


Number of heads 0 1 2 3 4 
Probability 0.0625 0.2500 0.3750 0.2500 0.0625 


The probability of getting at least one tail in four tosses of a coin is 
a. 0.2500. 
b. 0.3125. 
c. 0.6875. 
d. 0.9375. 
e. 0.0625. 


AP2.3 In a certain large population of adults, the distribution of IQ scores is strongly left- 
skewed with a mean of 122 and a standard deviation of 5. Suppose 200 adults are 
randomly selected from this population for a market research study. For SRSs of size 
200, the distribution of sample mean IQ score is 


a. left-skewed with mean 122 and standard deviation 0.35. 
b. exactly Normal with mean 122 and standard deviation 5. 

c. exactly Normal with mean 122 and standard deviation 0.35. 

d. approximately Normal with mean 122 and standard deviation 5. 

e. approximately Normal with mean 122 and standard deviation 0.35. 


AP2.4 A 10-question multiple-choice exam offers 5 choices for each question. Jason just 
guesses the answers, so he has probability 1/5 of getting any one answer correct. You 


want to perform a simulation to determine the number of correct answers that Jason 
gets. What would be a proper way to use a table of random digits to do this? 


a. 


e. 


One digit from the random digit table simulates one answer, with 5 = correct and all 
other digits = incorrect. Ten digits from the table simulate 10 answers. 


. One digit from the random digit table simulates one answer, with 0 or 1 = correct 


and all other digits = incorrect. Ten digits from the table simulate 10 answers. 


One digit from the random digit table simulates one answer, with odd = correct and 
even = incorrect. Ten digits from the table simulate 10 answers. 


. One digit from the random digit table simulates one answer, with 0 or 1 = correct 


and all other digits = incorrect, ignoring repeats. Ten digits from the table simulate 
10 answers. 


Two digits from the random digit table simulate one answer, with 00 to 20 = correct 
and 21 to 99 = incorrect. Ten pairs of digits from the table simulate 10 answers. 


AP2.5 Suppose we roll a fair die four times. What is the probability that a 6 occurs on exactly 
one of the rolls? 


a. 


b. 


C. 


e. 
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AP2.6 On one episode of his show, a radio show host encouraged his listeners to visit his 
website and vote in a poll about proposed tax increases. Of the 4821 people who vote, 
4277 are against the proposed increases. To which of the following populations should 
the results of this poll be generalized? 


a. 
b. 


C. 


d. 


e. 


All people who have ever listened to this show 

All people who listened to this episode of the show 
All people who visited the show host’s website 

All people who voted in the poll 


All people who voted against the proposed increases 


AP2.7 The number of unbroken charcoal briquets in a 20-pound bag filled at the factory 
follows a Normal distribution with a mean of 450 briquets and a standard deviation of 
20 briquets. The company expects that a certain number of the bags will be underfilled, 
so the company will replace for free the 5% of bags that have too few briquets. What is 


the minimum number of unbroken briquets the bag would have to contain for the 
company to avoid having to replace the bag for free? 


a. 404 
b. 411 
c. 418 
d. 425 
e. 448 


AP2.8 You work for an advertising agency that is preparing a new television commercial to 
appeal to women. You have been asked to design an experiment to compare the 
effectiveness of three versions of the commercial. Each subject will be shown one of the 
three versions and then asked to reveal her attitude toward the product. You think there 
may be large differences in the responses of women who are employed and those who 
are not. Because of these differences, you should use 
a. a block design, but not a matched pairs design. 

b. a completely randomized design. 
c. a matched pairs design. 
d. asimple random sample. 


e. a Stratified random sample. 


AP2.9 Suppose that you have torn a tendon and are facing surgery to repair it. The 
orthopedic surgeon explains the risks to you. Infection occurs in 3% of such operations, 
the repair fails in 14%, and both infection and failure occur together 1% of the time. 
What is the probability that the operation is successful for someone who has an 
operation that is free from infection? 


a. 0.8342 
b. 0.8400 
c. 0.8600 
d. 0.8660 
e. 0.9900 


AP2.10 Social scientists are interested in the association between high school graduation 
rate (HSGR, measured as a percent) and the percent of U.S. families living in poverty 
(POV). Data were collected from all 50 states and the District of Columbia, and a 
regression analysis was conducted. 

The resulting least-squares regression line is given by POVA=59.2—-0.620(HSGR) 


POV = 59.2 — 0.620 (HSGR) with r2=0.802r” = 0.802, Based on the information, 
which of the following is the best interpretation for the slope of the least-squares 
regression line? 


a. For each 1% increase in the graduation rate, the percent of families living in poverty 
is predicted to decrease by approximately 0.896. 


b. For each 1% increase in the graduation rate, the percent of families living in poverty 
is predicted to decrease by approximately 0.802. 


c. For each 1% increase in the graduation rate, the percent of families living in poverty 
is predicted to decrease by approximately 0.620. 


d. For each 1% increase in the percent of families living in poverty, the graduation rate 
is predicted to decrease by approximately 0.802. 


e. For each 1% increase in the percent of families living in poverty, the graduation rate 
is predicted to decrease by approximately 0.620. 


Questions AP2.11—AP2.13 refer to the following graph. Here is a dotplot of the adult literacy 
rates in 177 countries in a recent year, according to the United Nations. For example, the lowest 
literacy rate was 23.6%, in the African country of Burkina Faso. Mali had the next lowest 
literacy rate at 24.0%. 


Literacy rate (%) 


AP2.11 The overall shape of this distribution is 
a. Clearly skewed to the right. 
b. clearly skewed to the left. 
c. roughly symmetric. 
d. uniform. 
e. There is no clear shape. 
AP2.12 The mean of this distribution (don’t try to find it) will be 
a. very close to the median. 
b. greater than the median. 
c. less than the median. 


d. You can’t say, because the distribution isn’t symmetric. 


e. You can’t say, because the distribution isn’t Normal. 


AP2.13 The country with a literacy rate of 49% is closest to which of the following 
percentiles? 


a. 6th 
b. 10th 
c. 28th 
d. 49th 
e. There is not enough information to calculate the percentile. 


AP2.14 The correlation between the age and height of children under the age of 12 is found to 
be r = 0.60. Suppose we use the age x of a child to predict the height y of the child. 
What can we conclude? 


a. The height is generally 60% of a child’s age. 
b. About 60% of the time, age will accurately predict height. 


c. Thirty-six percent of the variation in height is accounted for by the linear model 
relating height to age. 


d. For every 1 year older a child is, the regression line predicts an increase of 0.6 foot 
in height. 


e. Thirty-six percent of the time, the least-squares regression line accurately predicts 
height from age. 


AP2.15 An agronomist wants to test three different types of fertilizer (A, B, and C) on the 
yield of a new variety of wheat. The yield will be measured in bushels per acre. Six 1- 
acre plots of land were randomly assigned to each of the three fertilizers. The treatment, 
experimental unit, and response variable are, respectively, 


a. a specific fertilizer, bushels per acre, a plot of land. 

b. variety of wheat, bushels per acre, a specific fertilizer. 
c. variety of wheat, a plot of land, wheat yield. 

d. a specific fertilizer, a plot of land, wheat yield. 

e. a specific fertilizer, the agronomist, wheat yield. 

AP2.16 According to the U.S. Census, the proportion of adults in a certain county who owned 
their own home was 0.71. An SRS of 100 adults in a certain section of the county found 
that 65 owned their home. Which one of the following represents the approximate 
probability of obtaining a sample of 100 adults in which 65 or fewer own their home, 


assuming that this section of the county has the same overall proportion of adults who 
own their home as does the entire county? 


100 


(0.71)° (0.29)” 
- cases -) ) (0.29) 


100 


0.29) (0.71)*° 
b. eee pg )" (0.71) 


ae / (0.71)(0.29) 
\ 100 


c. P(z<0.65-0.71(0.71)(0.29)100) 


- } (0.65)(0.35) 


d. P(z<0.65-0.71(0.65)(0.35)100) \ 100 


=  (0.71)(0.29) 


P (- = 0.65—0.71 
e. P(z<0.65—-0.71(0.71)(0.29)100) v100 


AP2.17 Which one of the following would be a correct interpretation if you have a z-score of 
+2.0 on an exam? 


a. It means that you missed two questions on the exam. 

b. It means that you got twice as many questions correct as the average student. 

c. It means that your grade was 2 points higher than the mean grade on this exam. 
d. It means that your grade was in the upper 2% of all grades on this exam. 

e. It means that your grade is 2 standard deviations above the mean for this exam. 


AP2.18 Records from a dairy farm yielded the following information on the number of male 
and female calves born at various times of the day. 


Time of day 
Day Evening Night Total 
Males 261 
Gender 
Females 252 
Total 247 33 233 513 


What is the probability that a randomly selected calf was born in the night or was a 
female? 

a. 369513 513 

b. 485513513 


116 
c. 116513513 
116 
d. 116252 252 
116 


e. 116233 233 


AP2.19 When people order books from a popular online source, they are shipped in boxes. 


Suppose that the mean weight of the boxes is 1.5 pounds with a standard deviation of 
0.3 pound, the mean weight of the packing material is 0.5 pound with a standard 
deviation of 0.1 pound, and the mean weight of the books shipped is 12 pounds with a 
standard deviation of 3 pounds. Assuming that the weights are independent, what is the 
standard deviation of the total weight of the boxes that are shipped from this source? 


a. 1.84 
b. 2.60 
c. 3.02 
d. 3.40 
e. 9.10 


AP2.20 A grocery chain runs a prize game by giving each customer a ticket that may win a 
prize when the box is scratched off. Printed on the ticket is a dollar value ($500, $100, 
$25) or the statement “This ticket is not a winner.” Monetary prizes can be redeemed 
for groceries at the store. Here is the probability distribution of the amount won on a 
randomly selected ticket: 


Amount won $500 $100 $25 $0 
Probability 0.01 0.05 0.20 0.74 


Which of the following are the mean and standard deviation, respectively, of the 
winnings? 


a. $15.00, $2900.00 
b. $15.00, $53.85 
c. $15.00, $26.93 
d. $156.25, $53.85 
e. $156.25, $26.93 


AP2.21 A large company is interested in improving the efficiency of its customer service and 
decides to examine the length of the business phone calls made to clients by its sales 
staff. Here is a cumulative relative frequency graph from data collected over the past 
year. According to the graph, the shortest 80% of calls will take how long to complete? 


a. Less than 10 minutes 
b. At least 10 minutes 
c. Exactly 10 minutes 
d. At least 5.5 minutes 


e. Less than 5.5 minutes 


Percent 


0 2 4 6 8 10 12 14 16 
Length of phone call (min) 


Section II: Free Response Show all your work. Indicate clearly the methods you 
use, because you will be graded on the correctness of your methods as well as on the 
accuracy and completeness of your results and explanations. 


AP2.22 A health worker is interested in determining if omega-3 fish oil can help reduce 
cholesterol in adults. She obtains permission to examine the health records of 200 
people in a large medical clinic and classifies them according to whether or not they 
take omega-3 fish oil. She also obtains their latest cholesterol readings and finds that the 
mean cholesterol reading for those who are taking omega-3 fish oil is 18 points less 
than the mean for the group not taking omega-3 fish oil. 


a. Is this an observational study or an experiment? Justify your answer. 


b. Explain the concept of confounding in the context of this study and give one 
example of a variable that could be confounded with whether or not people take 
omega-3 fish oil. 


c. Researchers find that the 18-point difference in the mean cholesterol readings of the 


two groups is statistically significant. Can they conclude that omega-3 fish oil is the 
cause? Why or why not? 


AP2.23 The scatterplot shows the relationship between the number of yards allowed by teams 
in the National Football League and the number of wins for that team in a recent season, 
along with the least-squares regression line. Computer output is also provided. 


Term Coef SE Coef T-Value P-Value 
Constant 25.66 Soe 4.78 0.000 
Yards_allowed —0.003131 0.000948 —3.30 0.002 


S = 2.65358 R-Sq = 26.65% R-Sq(adj) = 24.21% 


4500 5000 5500 6000 6500 7000 
Yards allowed 


a. State the equation of the least-squares regression line. Define any variables you use. 


b. Calculate and interpret the residual for the Seattle Seahawks, who allowed 4668 
yards and won 10 games. 


c. The Carolina Panthers allowed 5167 yards and won 15 games. What effect does the 
point representing the Panthers have on the equation of the least-squares regression 
line? Explain. 


AP2.24 Every 17 years, swarms of cicadas emerge from the ground in the eastern United 


States, live for about six weeks, and then die. (There are several different “broods,” so 
we experience cicada eruptions more often than every 17 years.) There are so many 
cicadas that their dead bodies can serve as fertilizer and increase plant growth. In a 
study, a researcher added 10 dead cicadas under 39 randomly selected plants in a 
natural plot of American bellflowers on the forest floor, leaving other plants 
undisturbed. One of the response variables measured was the size of seeds produced by 
the plants. Here are the boxplots and summary statistics of seed mass (in milligrams) for 
39 cicada plants and 33 undisturbed (control) plants: 


n Minimum Q, Median Q3 Maximum 
Cicada plants 39 «0.17 0.22 0.25 0.28 0.35 
Control plants 33 0.14 0.19 0.25 0.26 0.29 


Cicada 
plants a as 


Control 
plants 
i | 
0.15 0.2 0.25 0.30 0.35 


Seed mass (mg) 


a. Write a few sentences comparing the distributions of seed mass for the two groups 
of plants. 


b. Based on the graphical displays, which distribution likely has the larger mean? 
Justify your answer. 


c. Explain the purpose of the random assignment in this study. 


d. Name one benefit and one drawback of only using American bellflowers in the 
study. 


AP2.25 In a city library, the mean number of pages in a novel is 525 with a standard deviation 
of 200. Approximately 30% of the novels have fewer than 400 pages. Suppose that you 
randomly select 50 novels from the library. 


a. What is the probability that the average number of pages in the sample is less than 
500? 


b. What is the probability that at least 20 of the novels have fewer than 400 pages? 
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Chapter 8 Review Exercises 


Chapter 8 AP® Statistics Practice Test 


INTRODUCTION 


How long does a battery last on the newest iPhone, on average? What proportion of college 
undergraduates attended all of their classes last week? How much does the weight of a quarter- 
pound hamburger at a fast-food restaurant vary after cooking? These are the types of questions 
we would like to answer. 


anopdesignstock/Getty Images 


It wouldn’t be practical to determine the lifetime of every iPhone battery, to ask all 
undergraduates about their attendance, or to weigh every burger after cooking. Instead, we 
choose a random sample of individuals (batteries, undergraduates, burgers) to represent the 
population and collect data from those individuals. From what we learned in Chapter 4, if we 
randomly select the sample, we should be able to generalize our results to the population of 
interest. However, we cannot be certain that our conclusions are correct—a different sample 
would probably yield a different estimate. Probability helps us account for the chance variation 
due to random selection or random assignment. 

Chapter 8 begins the formal study of statistical inference—using information from a sample 
to draw conclusions about a population parameter such as p or p. This is an important transition 
from Chapter 7, where you were given information about a population and asked questions 
about the distribution of a sample statistic, such as the sample proportion p’? or the sample 
mean x .@. 


The following activity gives you an idea of what lies ahead. 


ACTIVITY The mystery mean 


In this activity, each team of 3 to 4 students will try to estimate the mystery value of the 
population mean p that your teacher selected before class.t 


1. Before class, your teacher stored a value of p (represented by M) in the display calculator 
and then cleared the home screen so you can’t see the value of M. 


2. With the class watching, the teacher will execute the following command: 


mean(andNorm(M, 20,16))mean(andN orm(M, 20, 16)) 


This tells the calculator to choose an SRS of 16 observations from a Normal population 
with mean M and standard deviation 20 and then compute the mean x = of those 16 
sample values. Do you believe that the sample mean shown is equal to the mystery mean 
M? Explain. 

3. Determine an interval of believable values for the population mean p. Use the result from 
Step 2 and what you learned about sampling distributions in the previous chapter. 


4. Share your team’s results with the class. 


In this chapter and the next, we will introduce the two most common types of formal 
Statistical inference. Chapter 8 concerns confidence intervals for estimating the value of a 
parameter. Chapter 9 presents significance tests, which assess the evidence for a claim about a 
parameter. Both types of inference are based on the sampling distributions you studied in 
Chapter 7. 

In this chapter, we start by presenting the idea of a confidence interval in a general way that 
applies to estimating any unknown parameter. In Section 8.2, we show how to estimate a 
population proportion using a confidence interval. Section 8.3 focuses on confidence intervals 
for a population mean. 


SECTION 8.1 Confidence Intervals: The Basics 


LEARNING TARGETS By the end of the section, you should be able to: 


Identify an appropriate point estimator and calculate the value of a point 
estimate. 


Interpret a confidence interval in context. 


Determine the point estimate and margin of error from a confidence interval. 


Use a confidence interval to make a decision about the value of a parameter. 
Interpret a confidence level in context. 
Describe how the sample size and confidence level affect the margin of error. 


Explain how practical issues like nonresponse, undercoverage, and response 
bias can affect the interpretation of a confidence interval. 


Mir. Schiel’s class did the “Mystery mean” activity from the Introduction. The TI screen shot 
displays the information that the students received about the unknown population mean p. Here 
is asummary of what the class said about the calculator output: 


e The population distribution is Normal and its standard deviation is o=20.0 = 20. 
e A simple random sample of n=16” = 16 observations was taken from this population. 


e The sample mean is x =240.80.¢ = 240.80. 
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mean(randNorm(M, 20,16)) 


If we had to give a single number to estimate the value of M that Mr. Schiel chose, what 
would it be? Because the sample mean x @ is an unbiased estimator of the population mean p, 
we use the statistic x % as a point estimator of the parameter p. The best guess for the value of 
pis x =240.80% = 240.80. This value is known as a point estimate. 


DEFINITION Point estimator, Point estimate 


A point estimator is a statistic that provides an estimate of a population 
parameter. 


The value of that statistic from a sample is called a point estimate. 


As we saw in Chapter 7, the ideal point estimator will have no bias and little variability. 
Here’s an example involving some of the more common point estimators. 


EXAMPLE |From batteries to smoking (>) 
Point estimators 


< 
, 
2 
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PROBLEM: Identify the point estimator you would use to estimate the parameter in each of 
the following settings and calculate the value of the point estimate. 


a. Quality control inspectors want to estimate the mean lifetime p of the AA batteries 
produced each hour at a factory. They select a random sample of 50 batteries during each 
hour of production and then drain them under conditions that mimic normal use. Here are 
the lifetimes (in hours) of the batteries from one such sample: 


16.735) 1560 Vlesl) 91757) SIG 17.268 7 i667 17.265 17.27 ~ 7.30 
15.59 17.54 1646 1563 1682 17.16 1662 16.71 1669 17.98 
15.99 1564 17.20 17.24 1668 1655 17.48 £15.58 17.61 15.98 
15.46 1650 1619 1636 1780 1661 1699 1693 1601 1646 
W754 174) 16.91 teo0° 16.78 15.75 17.31 16.507” 16.72 7.55 


b. What proportion p of U.S. adults would classify themselves as vegan or vegetarian? A 
Pew Research Center report surveyed 1473 randomly selected U.S. adults. Of these, 124 


said they were vegan or vegetarian.2 


c. The quality control inspectors in part (a) want to investigate the variability in battery 


lifetimes by estimating the population standard deviation o. 
SOLUTION: 


a. Use the sample mean x= asa point estimator ‘for the population mean w. The point 
—  16.73+15.60+-+-+17.55 


estimate ts X"=16.734+15.60+--°+17.5550=16.718" ini 50 =i8. OS hours. 
b. Use the sample proportion pe asa point estimator for the population proportion 52 
ee ee 
The point estimate ts pr=1241473=0.084.P ~~ 1473 0,084. 


c. Use the sample standard deviation s,. as a point estimator ‘for the population standard 


deviation o. The point estimate is sy = 0.664 hour. 


FOR PRACTICE, TRY EXERCISE 1 


The Idea of a Confidence Interval 


When Mr. Schiel’s class did the “Mystery mean” activity, they obtained a sample mean of 

x =240.80.% = 240.80. Then they added and subtracted 10 to get the interval of plausible 
values from 230.80 to 250.80. Where did the 10 come from? Their reasoning was based on the 
sampling distribution of the sample mean from Chapter 7: 


e Because the population distribution is Normal, the sampling distribution of x @ is also 
Normal. 


e In about 95% of samples, the value of x # will be within 2 standard deviations 


(20. = 20 — 2(5) = 10) 


16 


(20x=22016=2(5)=10) of the mystery mean p. 
e Therefore, in about 95% of samples, the value of the mystery mean p will be within 2 


_ : (20; we NO GY ae 10) _ 
standard deviations (20x =22016=2(5)=10) v16 of x @, 
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When the estimate of a parameter is reported as an interval of values, it is called an interval 


estimate, or confidence interval. 


DEFINITION Confidence interval 


A confidence interval gives an interval of plausible values for a parameter based 
on sample data. 


Plausible does not mean the same thing as possible. You could argue that just about any 
value of a parameter is possible. Plausible means that we shouldn’t be surprised if any one of 
the values in the interval is equal to the value of the parameter. Based on their calculations, the 
class shouldn’t be surprised if Mr. Schiel revealed that the mystery mean was any value from 
230.80 to 250.80. However, it would be surprising if the mystery mean was less than 230.80 or 
greater than 250.80. 


We use an interval of plausible values rather than a single point estimate to increase our 
confidence that we have a correct value for the parameter. Of course, as the cartoon illustrates, 
there is a trade-off between the amount of confidence we have that our estimate is correct and 
how much information the interval provides. 
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Confidence intervals are constructed so that we know how much confidence we should have 
in the interval. The most common confidence level is 95%. You will learn how to interpret 
confidence levels shortly. 


DEFINITION Confidence level 


The confidence level C gives the overall success rate of the method used to 
calculate the confidence interval. That is, in C% of all possible samples, the 
interval computed from the sample data will capture the true parameter value. 


The Associated Press and the NORC Center for Public Affairs Research recently asked a 
random sample of U.S. adults how much financial difficulty they would experience if they had 
to pay an unexpected bill of $1000 right away. Overall, 65% of respondents admitted they 
would have “a little” or “a lot” of difficulty. A summary of the study reported that the 95% 


confidence interval for the proportion of U.S. adults who would admit to experiencing some 
financial difficulty is 0.613 to 0.687. That is, they are 95% confident that the interval from 
0.613 to 0.687 captures the true proportion of all U.S. adults who would admit to experiencing 
some financial difficulty paying an unexpected bill of $1000 right away. 


AP® EXAM TIP 


When interpreting a confidence interval, make sure that you are describing the 
parameter and not the statistic. It’s wrong to say that we are 95% confident the 
interval from 0.613 to 0.687 captures the proportion of U.S. adults who admitted 
they would experience financial difficulty. The “proportion who admitted they would 
experience financial difficulty” is the sample proportion, which is known to be 0.65. 
The interval gives plausible values for the proportion who would admit to 
experiencing some financial difficulty if asked. 


INTERPRETING A CONFIDENCE INTERVAL 


To interpret a C% confidence interval for an unknown parameter, say, “We are C% 
confident that the interval from to captures the [parameter in context].” 


To create an interval of plausible values for a parameter, we need two components: a point 
estimate to use as the midpoint of the interval and a margin of error to account for sampling 
variability. The structure of a confidence interval is 


We can visualize a C% confidence interval like this: 


Point estimate 


Margin of error < < > Margin of error 


C% confidence interval for parameter 


Earlier, we learned that the 95% confidence interval for the proportion of all U.S. adults 
who would admit to experiencing some financial difficulty paying an unexpected bill of $1000 
right away is 0.613 to 0.687. This interval could also be expressed as 


0.65+0.0370.65 + 0.037 


Point estimate 


Margin of error +» Margin of error 


0.60 0.65 0.70 


0.65 — 0.037 = 0.613 0.65 + 0.037 = 0.687 


95% confidence interval for p = proportion of all U.S. adults 
who would admit to experiencing some financial difficul ty 


Confidence intervals reported in the media are often presented as a point estimate and a margin 
of error. 


DEFINITION Margin of error 


The margin of error of an estimate describes how far, at most, we expect the 
estimate to vary from the true population value. That is, in a C% confidence 
interval, the distance between the point estimate and the true parameter value will 
be less than the margin of error in C% of all samples. 


In addition to estimating a parameter, we can also use confidence intervals to assess claims 
about a parameter, as in the following example. 


EXAMPLE | Who will win the election? (>) 
Interpreting a confidence interval 


PROBLEM: Two weeks before a presidential election, a polling organization asked a 
random sample of registered voters the following question: “If the presidential election were 
held today, would you vote for Candidate A or Candidate B?” Based on this poll, the 95% 
confidence interval for the population proportion who favor Candidate A is (0.48, 0.54). 


Burlingham/Shutterstock.com 


a. Interpret the confidence interval. 


b. What is the point estimate that was used to create the interval? What is the margin of 
error? 


c. Based on this poll, a political reporter claims that the majority of registered voters favor 
Candidate A. Use the confidence interval to evaluate this 


SOLUTION: 


a. We are 95% confident that the interval from 0.48 to 0.54 captures the true proportion 
of all registered voters who favor Candidate A in the election. 
: nee _ 0.484054 _ 
b. point estimate=0.48+0.542=0.51 DOME ROMERO Xe 2 0.51 


margin of error=0.54-0.51=0.03 margin oferror = 0.54—0.51 = 0.03 


The point estimate is the midpoint of the interval. The margin of error is the 


distance from the point estimate to the endpoints of the interval. 


c. Because there are plausible values of p less than or equal to 0.50 in the confidence 
interval, the interval does not give convincing evidence that a majority (more than 
50%) of registered voters favor Candidate A. 


Any value from 0.48 to 0.54 is a plausible value for the true proportion who favor 


Candidate A. 


FOR PRACTICE, TRY EXERCISE 5 


Interpreting Confidence Level 


What does it mean to be 95% confident? The following activity gives you a chance to explore 
the meaning of the confidence level. 


pePLe> 


! ¢ 
ACTIVITY The Confidence Intervals applet Y 


In this activity, you will use the Confidence Intervals applet to learn what it means to say 
that we are “95% confident” that our confidence interval captures the parameter value. 


1. Go to highschool.bfwpub.com/tps6e and launch the applet. Use the default settings: 
confidence level 95% and sample size n = 20. The graphs at the top of the display show 
the distribution of the population and the sampling distribution of the sample mean x £ 
for the sample size chosen. Both are centered at the population mean p. 


2. Click “Sample” to choose an SRS and display the resulting confidence interval. The 20 
values in the sample are marked with yellow dots. The confidence interval is displayed as 
a horizontal line segment with a dot representing the sample mean x = in the middle of 
the interval. 


3. Did the interval capture the population mean p (what the applet calls a “hit”)? Click 
“Sample” a total of 10 times. How many of the intervals captured the population mean pl? 
Note: So far, you have used the applet to take 10 SRSs, each of size n = 20. Be sure you 
understand the difference between sample size and the number of samples taken. 


4. Reset the applet. Click “Sample 25” twice to choose 50 SRSs and display the confidence 
intervals based on those samples. How many intervals captured the parameter pl’? Keep 
clicking “Sample 25” and observe the value of “Percent hit.” What do you notice? 


—— Population Distribution ‘iN 
Confidence Level (C): 95 — - Sampling Distribution ; \ 


Sample Size (n): 20 


SAMPLE 25 
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Hit! 48 Total: 50 
Percent hit: 96% Lx 
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RESET 


5. Repeat Step 4 using a 90% confidence level. 
6. Repeat Step 4 using an 80% confidence level. 


7. Summarize what you have learned about the relationship between confidence level and 
“Percent hit” after taking many samples. 
We will investigate the effect of changing the sample size later. 


As the activity confirms, the confidence level is the overall capture rate if the method is 
used many times. 


INTERPRETING A CONFIDENCE LEVEL 


To interpret a confidence level C, say, “If we were to select many random samples from a 
population and construct a C% confidence interval using each sample, about C% of the 
intervals would capture the [parameter in context].” 


Let’s revisit the presidential election poll to practice interpreting a confidence level. 


EXAMPLE | Another look at the election poll (>) 
Interpreting a confidence level 


PROBLEM: Two weeks before a presidential election, a polling organization asked a 
random sample of registered voters the following question: “If the presidential election were 
held today, would you vote for Candidate A or Candidate B?” Based on this poll, the 95% 
confidence interval for the population proportion who favor Candidate A is (0.48, 0.54). 
Interpret the confidence level. 


SOLUTION: 


if we were to select many random samples of registered voters and construct a 95% 
confidence interval using each sample, about 95% of the intervals would capture the true 
proportion of all registered voters who favor Candidate A in the election. 


Remember that interpretations of confidence level are about the method used to 


construct the interval—not one particular interval. In fact, we can interpret confidence 
levels before data are collected! 


FOR PRACTICE, TRY EXERCISE 11 


In the preceding example, there are only two possibilities: 


. The interval from 0.48 to 0.54 captures the population proportion p. Our random sample was 
one of the many samples for which the difference between p and p/P is less than the margin 
of error. When using a 95% confidence level, about 95% of samples result in a confidence 
interval that captures p. 

’. The interval from 0.48 to 0.54 does not capture the population proportion p. Our random 


sample was one of the few samples for which the difference between p and p/P is greater 
than the margin of error. When using a 95% confidence level, only about 5% of all samples 


result in a confidence interval that fails to capture p. 

Without conducting a census, we cannot know whether our sample is one of the 95% for 
which the interval captures p or whether it is one of the unlucky 5% that does not. The 
statement that we are “95% confident” is shorthand for saying, “We got these numbers using a 
method that gives correct results for 95% of samples.” 


AP® EXAM TIP 


On a given problem, you may be asked to interpret the confidence interval, the 


confidence level, or both. Be sure you understand the difference: the confidence 
interval gives a set of plausible values for the parameter and the confidence level 
describes the overall capture rate of the method. 


@ The confidence level does not tell us the probability that a particular confidence 
interval captures the population parameter. Once a particular confidence interval is 
calculated, its endpoints are fixed. And because the value of a parameter is also a constant, a 
particular confidence interval either includes the parameter (probability = 1) or doesn’t include 
the parameter (probability = 0). As Figure 8.1 (on the next page) illustrates, no individual 95% 
confidence interval has a 95% probability of capturing the true parameter value. 


— Population Distribution 


Confidence Level (C): 95 - - Sampling Distribution 


Sample Size (n): 20 


SAMPLE 
Each of the black 
intervals captures 
ut with probability = 1 


SAMPLE 25 


Hit: 48 Total: 50 
Percent hit: 96% 


RESET 
This red interval 


captures ff with 
probability = 0 


FIGURE 8.1 Image from the Confidence Intervals applet showing that the probability a particular 
95% confidence interval captures the true parameter value is either O or 1 (and not 0.95). 


CHECK YOUR UNDERSTANDING 


A 


v The Pew Research Center and Smithsonian magazine recently quizzed a random sample 
of 1006 U.S. adults on their knowledge of science.2 One of the questions asked, “Which gas 
makes up most of the Earth’s atmosphere: hydrogen, nitrogen, carbon dioxide, or oxygen?” 
A 95% confidence interval for the proportion who would correctly answer nitrogen is 0.175 


to 0.225. 

1. Interpret the confidence interval. 

2. Interpret the confidence level. 

3. Calculate the point estimate and the margin of error. 


4. If people guess one of the four choices at random, about 25% should get the answer 


correct. Does this interval provide convincing evidence that less than 25% of all U.S. 
adults would answer this question correctly? Explain your reasoning. 


What Affects the Margin of Error? 


Why settle for 95% confidence when estimating an unknown parameter? Do larger random 
samples yield “better” intervals? The Confidence Intervals applet will shed some light on these 
questions. 


ACTIVITY Exploring margin of error with the Confidence Intervals 


pePLe> 


applet y 


In this activity, you will use the applet to explore the relationship between the confidence 
level, the sample size, and the margin of error. 


Part 1: Adjusting the Confidence Level 


1. Go to highschool.bfwpub.com/tps6e and launch the Confidence Intervals applet. Use the 
default settings: confidence level 95% and sample size n = 20. Click “Sample 25” to select 
25 SRSs and make 25 confidence intervals. 


—- Population Distribution ra 
Confidence Level (C): 95 - - Sampling Distribution yA 


Sample Size (n): 20 
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SAMPLE 25 


Hit! 23 Total: 25 
Percent hit: 92% 


RESET 


. Change the confidence level to 99%. What happens to the length of the confidence 


intervals? 


. Now change the confidence level to 90%. What happens to the length of the confidence 


intervals? 


. Finally, change the confidence level to 80%. What happens to the length of the confidence 


intervals? 


. Summarize what you learned about the relationship between the confidence level and the 


margin of error for a fixed sample size. 


Part 2: Adjusting the Sample Size 


6. 


Reset the applet to the default settings: confidence level 95% and sample size n = 20. 
Click “Sample 25” to select 25 SRSs and make 25 confidence intervals. 


. Using the slider, increase the sample size to n = 100. What do you notice about the length 


of the confidence intervals? 


. Using the slider, increase the sample size to n = 200. What do you notice about the length 


of the confidence intervals? 


. Summarize what you learned about the relationship between the sample size and the 


margin of error for a fixed confidence level. 


.0. Does increasing the sample size increase the capture rate (percent hit)? Use the applet to 
investigate. 


As the activity illustrates, the price we pay for greater confidence is a wider interval. If 
we're Satisfied with 80% confidence, then our interval of plausible values for the parameter 
will be much narrower than if we insist on 90%, 95%, or 99% confidence. For example, here is 
an 80% confidence interval and a 95% confidence interval for Mr. Schiel’s mystery mean. 
Unfortunately, intervals constructed at an 80% confidence level will capture the true value of 
the parameter much less often than intervals that use a 95% confidence level. 


° 80% confidence 


e « 95% confidence 
ys ca ar 
230 232 234 236 238 240 242 244 246 248 250 252 254 


The activity also shows that we can get a more precise estimate of a parameter by 
increasing the sample size. Larger samples generally yield narrower confidence intervals at any 
confidence level. However, larger samples don’t affect the capture rate and cost more time and 
money to obtain. 


DECREASING THE MARGIN OF ERROR 


In general, we prefer an estimate with a small margin of error. The margin of error gets 
smaller when: 


e The confidence level decreases. To obtain a smaller margin of error from the same data, 
you must be willing to accept less confidence. 


e The sample size n increases. In general, increasing the sample size n reduces the margin 
of error for any fixed confidence level. 


To see why these facts are true, let’s look a bit more closely at the method Mr. Schiel’s 
class used to calculate a confidence interval for his mystery mean. They started with a point 
estimate of x =240.80.% = 240.80. Then they added and subtracted 2 standard deviations to get 
the interval of plausible values from 230.80 to 250.80. 


We could rewrite this interval as 
point estimate + margin of error 
S = Zee 


240.80 + 22 
V16 


point estimate+margin of errorx +20x 240.80+22016 


This leads to the more general formula for a confidence interval: 


statistic + (critical value) - (standard deviation of statistic) 
statistic + (critical value) - (standard deviation of statistic) 


The critical value depends on both the confidence level C and the sampling distribution of 
the statistic. Mr. Schiel’s class used a critical value of 2 to be 95% confident. If they wanted to 
be 99.7% confident, they could have gone 3 standard deviations in each direction. Greater 
confidence requires a larger critical value. 


DEFINITION Critical value 


The critical value is a multiplier that makes the interval wide enough to have the 
stated capture rate. 


The margin of error also depends on the standard deviation of the statistic. As you learned 
in Chapter 7, the sampling distribution of a statistic will have a smaller standard deviation 
when the sample size is larger. This is why the margin of error decreases as you increase the 
sample size. 


WHAT THE MARGIN OF ERROR DOESN’T ACCOUNT FOR When we calculate a 
confidence interval, we include the margin of error because we expect the value of the point 
estimate to vary somewhat from the parameter. However, the margin of error accounts for only 
the variability we expect from random sampling. It does not account for practical difficulties, 
such as undercoverage and nonresponse in a sample survey. These problems can produce 
estimates that are much farther from the parameter than the margin of error would suggest. 
Remember this unpleasant fact when reading the results of an opinion poll or other sample 
survey. @ The margin of error does not account for any sources of bias in the data 


collection process. 


EXAMPLE |What’s your GPA? (>) 
Factors that affect the margin of error 


OJO Images Ltd/Alamy 


PROBLEM: As part of a project about response bias, Ellery surveyed a random sample of 
25 students from her school. One of the questions in the survey required students to state 
their GPA aloud. Based on the responses, Ellery said she was 90% confident that the interval 
from 3.14 to 3.52 captures the mean GPA for all students at her school.4 


a. Explain what would happen to the length of the interval if the confidence level were 
increased to 99%. 


b. How would a 90% confidence interval based on a sample of size 200 compare to the 
original 90% interval? 


c. Describe one potential source of bias in Ellery’s study that is not accounted for by the 
margin of error. 


SOLUTION: 


a. The confidence interval would be wider because increasing the confidence level increases 
the margin of error. 


To increase the confidence level (capture rate), we need to use a larger critical 


value, which increases the margin of error. 


b. The confidence interval would be narrower because increasing the sample size decreases 
the margin of error. 


c. The margin of error doesn’t account for the fact that many students might lie about 
their GPAs when having to respond without anonymity. The mean GPA for all 
students might be even less than 3.14! 


Increasing the sample size decreases the standard deviation of the sampling 


distribution of the sample mean (assuming the standard deviation of the sample 
doesn’t change). 


FOR PRACTICE, TRY EXERCISE 19 


Section 8.1 Summary 


e To estimate an unknown population parameter, start with a statistic that will provide a 
reasonable guess. The chosen statistic is a point estimator for the parameter. The specific 
value of the point estimator that we use gives a point estimate for the parameter. 


e A confidence interval gives an interval of plausible values for an unknown population 
parameter. The interval is computed from the data and has the form 


point estimate + margin of errorPoint estimate st margin of error 
When calculating a confidence interval, it is common to use the form 


Statistic + (critical value)-(standard deviation of statistic) 
statistic + (critical value) - (standard deviation of statistic) 


To interpret a C% confidence interval, say, “We are C% confident that the interval from 
to captures the [parameter in context].” Be sure that your interpretation describes 
a parameter and not a Statistic. 


The confidence level C is the success rate (capture rate) of the method that produces the 
interval. If you use 95% confidence intervals often, about 95% of your intervals will capture 
the true parameter value. You don’t know whether a particular 95% confidence interval 
calculated from a set of data actually captures the true parameter value. 


Other things being equal, the margin of error of a confidence interval gets smaller as: 
. the confidence level C decreases; 
« the sample size n increases. 


Remember that the margin of error for a confidence interval only accounts for chance 
variation, not other sources of error like nonresponse and undercoverage. 


Section 8.1 Exercises 


In Exercises 1—4, identify the point estimator you would use to estimate the parameter and 
calculate the value of the point estimate. 


1. 


pg 495 |») Got shoes? How many pairs of shoes, on average, do female teens have? To 


find out, an AP® Statistics class selected an SRS of 20 female students from their school. 
Then they recorded the number of pairs of shoes that each student reported having. Here 
are the data: 


50 26 26 31 57 19 24 pad 23 38 
13 50 13 34 23 30 49 13 15 51 


Got shoes? The class in Exercise 1 wants to estimate the variability in the number of pairs 
of shoes that female students have by estimating the population standard deviation o. 


Going to the prom Tonya wants to estimate the proportion of seniors in her school who 
plan to attend the prom. She interviews an SRS of 50 of the 750 seniors in her school and 
finds that 36 plan to go to the prom. 


Reporting cheating What proportion of students are willing to report cheating by other 
students? A student project put this question to an SRS of 172 undergraduates at a large 
university: “You witness two students cheating on a quiz. Do you go to the professor?” 


Only 19 answered “Yes.”2 


5. PQ 499 |») Prayer in school A New York Times/CBS News Poll asked a random 
sample of U.S. adults the question “Do you favor an amendment to the Constitution that 
would permit organized prayer in public schools?” Based on this poll, the 95% confidence 
interval for the population proportion who favor such an amendment is (0.63, 0.69). 


a. Interpret the confidence interval. 


b. What is the point estimate that was used to create the interval? What is the margin of 
error? 


c. Based on this poll, a reporter claims that more than two-thirds of U.S. adults favor such 
an amendment. Use the confidence interval to evaluate this claim. 


6. Losing weight A Gallup poll asked a random sample of U.S. adults, “Would you like to 
lose weight?” Based on this poll, the 95% confidence interval for the population 
proportion who want to lose weight is (0.56, 0.62).® 


a. Interpret the confidence interval. 


b. What is the point estimate that was used to create the interval? What is the margin of 
error? 


c. Based on this poll, Gallup claims that more than half of U.S. adults want to lose weight. 
Use the confidence interval to evaluate this claim. 


7. Bottling cola A particular type of diet cola advertises that each can contains 12 ounces of 
the beverage. Each hour, a supervisor selects 10 cans at random, measures their contents, 
and computes a 95% confidence interval for the true mean volume. For one particular 
hour, the 95% confidence interval is 11.97 ounces to 12.05 ounces. 


a. Does the confidence interval provide convincing evidence that the true mean volume is 
different than 12 ounces? Explain your answer. 


b. Does the confidence interval provide convincing evidence that the true mean volume is 
12 ounces? Explain your answer. 


8. Fun size candy A candy bar manufacturer sells a “fun size” version that is advertised to 
weigh 17 grams. A hungry teacher selected a random sample of 44 fun size bars and found 
a 95% confidence interval for the true mean weight to be 16.945 grams to 17.395 grams. 


a. Does the confidence interval provide convincing evidence that the true mean weight is 
different than 17 grams? Explain your answer. 


b. Does the confidence interval provide convincing evidence that the true mean weight is 
17 grams? Explain your answer. 


9. Shoes The AP® Statistics class in Exercise 1 also asked an SRS of 20 boys at their school 


10. 


11. 


12, 
13. 


14. 


15. 


how many pairs of shoes they have. A 95% confidence interval for 1»G—B=the true 
difference ~ #s = the true difference jy the mean number of pairs of shoes for girls and 
boys is 10.9 to 26.5. 


a. Interpret the confidence interval. 


b. Does the confidence interval give convincing evidence of a difference in the true mean 
number of pairs of shoes for boys and girls at the school? Explain your answer. 


Lying online Many teens have posted profiles on sites such as Facebook. A sample survey 
asked random samples of teens with online profiles if they included false information in 
their profiles. Of 170 younger teens (ages 12 to 14) polled, 117 said “Yes.” Of 317 older 
teens (ages 15 to 17) polled, 152 said “Yes.”2 A 95% confidence interval for pY-pO=the 
true differencePY ~ Po = the true difference jn the proportions of younger teens and older 
teens who include false information in their profile is 0.120 to 0.297. 


a. Interpret the confidence interval. 


b. Does the confidence interval give convincing evidence of a difference in the true 
proportions of younger and older teens who include false information in their profiles? 
Explain your answer. 
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More weight loss Refer to Exercise 6. Interpret the confidence level. 


Household income The 2015 American Community Survey estimated the median 
household income for each state. According to ACS, the 90% confidence interval for the 
2015 median household income in Arizona is $51,492 + $431. 


a. Interpret the confidence interval. 


b. Interpret the confidence level. 


More income The 2015 American Community Survey estimated the median household 
income for each state. According to ACS, the 90% confidence interval for the 2015 
median household income in New Jersey is $72,222 + $610. 


a. Interpret the confidence interval. 


b. Interpret the confidence level. 


How confident? The figure shows the result of taking 25 SRSs from a Normal population 
and constructing a confidence interval for the population mean using each sample. Which 
confidence level—80%, 90%, 95%, or 99%—do you think was used? Explain your 
reasoning. 


16. How confident? The figure shows the result of taking 25 SRSs from a Normal population 
and constructing a confidence interval for the population mean using each sample. Which 
confidence level—80%, 90%, 95%, or 99%—do you think was used? Explain your 
reasoning. 


17. Explaining confidence A 95% confidence interval for the mean body mass index (BMI) 
of young American women is 26.8 + 0.6. Discuss whether each of the following 
explanations is correct, based on that information. 


a. We are confident that 95% of all young women have BMI between 26.2 and 27.4. 


b. We are 95% confident that future samples of young women will have mean BMI 
between 26.2 and 27.4. 


c. Any value from 26.2 to 27.4 is believable as the true mean BMI of young American 
women. 


d. If we take many samples, the population mean BMI will be between 26.2 and 27.4 in 
about 95% of those samples. 


e. 


The mean BMI of young American women cannot be 28. 


18. Explaining confidence The admissions director for a university found that (107.8, 116.2) 
is a 95% confidence interval for the mean IQ score of all freshmen. Discuss whether each 
of the following explanations is correct, based on that information. 


19. 


20. 


21. 


a. 


b. 


There is a 95% probability that the interval from 107.8 to 116.2 contains p. 
There is a 95% chance that the interval (107.8, 116.2) contains x Z. 


This interval was constructed using a method that produces intervals that capture the 
true mean in 95% of all possible samples. 


If we take many samples, about 95% of them will contain the interval (107.8, 116.2). 


The probability that the interval (107.8, 116.2) captures p is either 0 or 1, but we don’t 
know which. 
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a. 


Explain what would happen to the length of the interval if the confidence level were 
increased to 99%. 


How would a 95% confidence interval based on double the sample size compare to the 
original 95% interval? 


The news article goes on to say: “The theoretical errors do not take into account 
additional errors resulting from the various practical difficulties in taking any survey of 
public opinion.” List some of the “practical difficulties” that may cause errors which 
are not included in the +3 percentage point margin of error. 


Losing weight Refer to Exercise 6. 


a. 


Explain what would happen to the length of the interval if the confidence level was 
decreased to 90%. 


. How would a 95% confidence interval based on triple the sample size compare to the 


original 95% interval? 


. As Gallup indicates, the 3 percentage point margin of error for this poll includes only 


sampling variability (what they call “sampling error”). What other potential sources of 
error (Gallup calls these “nonsampling errors”) could affect the accuracy of the 95% 
confidence interval? 


California’s traffic People love living in California for many reasons, but traffic isn’t one 
of them. Based on a random sample of 572 employed California adults, a 90% confidence 
interval for the average travel time to work for all employed California adults is 23 


minutes to 26 minutes.2 


a. 


8 


Interpret the confidence level. 


b. Name two things you could do to reduce the margin of error. What drawbacks do these 
actions have? 


c. Describe how nonresponse might lead to bias in this survey. Does the stated margin of 
error account for this possible bias? 


22. Employment in California Each month the government releases unemployment 
Statistics. The stated unemployment rate doesn’t include people who choose not to be 
employed, such as retirees. Based on a random sample of 1000 California adults, a 99% 
confidence interval for the proportion of all California adults employed in the workforce is 


0.532 to 0.612.2 
a. Interpret the confidence level. 


b. Name two things you could do to reduce the margin of error. What drawbacks do these 
actions have? 


c. Describe how untruthful answers might lead to bias in this survey. Does the stated 
margin of error account for this possible bias? 


Multiple Choice Select the best answer for Exercises 23-26. 
Exercises 23 and 24 refer to the following setting. A researcher plans to use a random sample 
of houses to estimate the mean size (in square feet) of the houses in a large population. 


23. The researcher is deciding between a 95% confidence level and a 99% confidence level. 
Compared with a 95% confidence interval, a 99% confidence interval will be 


a. narrower and would involve a larger risk of being incorrect. 
b. wider and would involve a smaller risk of being incorrect. 

c. narrower and would involve a smaller risk of being incorrect. 
d. wider and would involve a larger risk of being incorrect. 


e. wider and would have the same risk of being incorrect. 


24. After deciding on a 95% confidence level, the researcher is deciding between a sample of 
size n = 500 and a sample of size n = 1000. Compared with using a sample size of n = 
500, a confidence interval based on a sample size of n = 1000 will be 


a. narrower and would involve a larger risk of being incorrect. 
b. wider and would involve a smaller risk of being incorrect. 

c. narrower and would involve a smaller risk of being incorrect. 
d. wider and would involve a larger risk of being incorrect. 


e. narrower and would have the same risk of being incorrect. 


25. In a poll conducted by phone, 
I. Some people refused to answer questions. 
II. People without telephones could not be in the sample. 


III. Some people never answered the phone in several calls. 


Which of these possible sources of bias is included in the +2% margin of error announced 
for the poll? 


a. I only 

b. Il only 

c. I only 

d. I, Il, and III 
e. None of these 


26. You have measured the systolic blood pressure of an SRS of 25 company employees. A 
95% confidence interval for the mean systolic blood pressure for the employees of this 
company is (122, 138). Which of the following statements is true? 


a. 95% of the sample of employees have a systolic blood pressure between 122 and 138. 


b. 95% of the population of employees have a systolic blood pressure between 122 and 
138. 


c. If the procedure were repeated many times, 95% of the resulting confidence intervals 
would contain the population mean systolic blood pressure. 


d. If the procedure were repeated many times, 95% of the time the population mean 
systolic blood pressure would be between 122 and 138. 


e. If the procedure were repeated many times, 95% of the time the sample mean systolic 
blood pressure would be between 122 and 138. 


Recycle and Review 


27. Power lines and cancer (4.2, 4.3) Does living near power lines cause leukemia in 
children? The National Cancer Institute spent 5 years and $5 million gathering data on this 
question. The researchers compared 638 children who had leukemia with 620 who did not. 
They went into the homes and measured the magnetic fields in children’s bedrooms, in 
other rooms, and at the front door. They recorded facts about power lines near the family 
home and also near the mother’s residence when she was pregnant. Result: No association 
between leukemia and exposure to magnetic fields of the kind produced by power lines 
was found.22 


a. Was this an observational study or an experiment? Justify your answer. 


b. Does this study prove that living near power lines doesn’t cause cancer? Explain your 
answer. 


28. Sisters and brothers (3.1, 3.2) How strongly do physical characteristics of sisters and 
brothers correlate? Here are data on the heights (in inches) of 11 adult pairs:44 


Brother 71 68 66 67 70 71 70 73 72 65 66 
Sister 69 64 65 63 65 62 65 64 66 59 62 


a. Construct a scatterplot using brother’s height as the explanatory variable. Describe 
what you see. 


b. Use technology to compute the least-squares regression line for predicting sister’s 
height from brother’s height. 


c. Interpret the slope in context. 


d. Calculate and interpret the residual for the first pair listed in the table. 


SECTION 8.2 Estimating a Population Proportion 


LEARNING TARGETS By the end of the section, you should be able to: 


e State and check the Random, 10%, and Large Counts conditions for 
constructing a confidence interval for a population proportion. 


Determine the critical value for calculating a C% confidence interval for a 
population proportion using a table or technology. 


Construct and interpret a confidence interval for a population proportion. 


Determine the sample size required to obtain a C% confidence interval for a 
population proportion with a specified margin of error. 


In Section 8.1, we saw that a confidence interval can be used to estimate an unknown 
population parameter. We are often interested in estimating the proportion p of some outcome 
in a population. Here are some examples: 


e What proportion of U.S. adults are unemployed right now? 

e What proportion of high school students have cheated on a test? 

e What proportion of pine trees in a national park are infested with beetles? 
e What proportion of college students pray daily? 


e What proportion of a company’s laptop batteries last as long as the company claims? 


This section shows you how to construct and interpret a confidence interval for a 
population proportion. The following activity gives you a feel for what lies ahead. 


ACTIVITY The beads 


Before class, your teacher prepared a large population of different-colored beads and put 
them into a container. In this activity, you and your team will create an interval estimate for 
the actual proportion of beads in the population that have a particular color (e.g., red). 


1. Asaclass, discuss how to use the cup provided to get a simple random sample of beads 
from the container. 


2. Have a student take an SRS of beads. Separate the beads into two groups: those that are 
red and those that aren’t. Count the number of beads in each group. 


3. Determine the point estimate p/? for the unknown population proportion p of red beads in 
the container. 


4. In teams of 3 or 4 students, calculate a 95% confidence interval for the true proportion of 
red beads p. Hint: You may want to refer to Section 7.2 about the sampling distribution of 
a sample proportion. 


5. Compare the results with those of the other teams in the class. Discuss any problems you 
encountered and how you dealt with them. 


billnoll/Getty Images 


Constructing a Confidence Interval for p 


When Mr. Buckley’s class did the preceding activity, the random sample of 251 beads they 
selected included 107 red beads and 144 other beads. Starting with the general formula for a 
confidence interval from Section 8.1: 


point estimate=statistict+margin of error(critical value) - (standard deviation of statistic) 
point estimate + margin of error 


=statistic + (critical value) - (standard deviation of statistic) 


they determined the values to substitute into the formula using what they learned about the 
sampling distribution of the sample proportion p/? in Section 7.2. 


e Statistic: The class decided to use p\=107/251=0.426P = 107/251 = 0.426 pecause p? is 
an unbiased estimator of p. 


e Critical value: The class decided to use critical value = 2 based on the 68—95-—99.7 rule for 
Normal distributions. 


e Standard deviation of statistic: Remembering that the standard deviation of the sampling 
| p(l—p) 


distribution of pip is op\=p(1-p)n,”? = VW > the class decided to use p=0.426 
(Ade 1—0426) 0.031 


p = 0.426 in the formula to get 0.426(1-0.426)251=0.031.V 251 


Pp 
(unknown) 


+— Values of p —— 


The class’s 95% confidence interval is 
0.426 + 2(0.031) 
0.426=0.426++2(0.031)0.062=(0.364, 0.488) = 9-426 + 0.062 = (0.364, 0.488) 


The class is 95% confident that the interval from 0.364 to 0.488 captures the true proportion of 
red beads in Mr. Buckley’s container. 


The interval constructed by Mr. Buckley’s class is nearly correct. Here is the exact formula 
for a one-sample z interval for a population proportion. 


ONE-SAMPLE z INTERVAL FOR A POPULATION PROPORTION 


When the conditions are met, a C% confidence interval for the unknown proportion p is 


pez? 2 
n 


pAtz*p\(1-p)n 


where z* is the critical value for the standard Normal curve with C% of its area between —z* 
and z*. 


CONDITIONS FOR ESTIMATING p There are three conditions that must be met for this 
formula to be valid—one for each of the three components in the formula. 


Let’s discuss them one at a time. 


. The Random Condition and pA? When our data come from a random sample, we can 
make an inference about the population from which the sample was selected. If the data 
come from a convenience sample or voluntary response sample, we should have no 
confidence that the resulting value of pip is a good estimate of p. To be sure that pAP isa 
valid point estimate, we check the Random condition: The data come from a random sample 
from the population of interest. 


Another important reason for random selection is to introduce chance into the data- 
production process. We can model chance behavior with a probability distribution, like the 
sampling distributions of Chapter 7. Our method of calculation assumes that the data come 
from an SRS of size n from the population of interest. Other types of random samples (e.g., 
stratified or cluster) might be preferable to an SRS in a given setting, but they require more 
complex calculations than the ones we’!l use. When an example, exercise, or AP® Statistics 
exam item refers to a “random sample” without saying “stratified” or “cluster,” you can 
assume the sample is an SRS. 


’. The Large Counts Condition and z* When Mr. Buckley’s class used the 68—95—99.7 rule 
to determine the critical value for their confidence interval, they were assuming that the 
sampling distribution of p\? was approximately Normal. If the distribution of p/? is 
approximately Normal, then p/? will be within 2 standard deviations of p in about 95% of 
samples. This means the value of p will be within 2 standard deviations of p/P in about 95% 
of samples. Thus, using a critical value of 2 will result in approximately 95% confidence. 


From what we learned in Chapter 7, we can use the Normal approximation to the 
sampling distribution of p/P as long as np > 10 and n(1 — p) > 10. In practice, of course, we 
don’t know the value of p. If we did, we wouldn’t need to construct a confidence interval for 
it! In large random samples, p’? will be close to p. So we replace p by p’? checking the 
Large Counts condition:np\>10"P 2 10 and n(1-p)=10."(4 —p) = 10. 

When the Large Counts condition is met, we can use a Normal distribution to calculate 
the critical value z* for any confidence level. You’ll learn how to do this soon. 


| pap) 
3. The 10% Condition and pA(i-pA)nV "As you learned in Chapter 7, the formula for 


the standard deviation of the sampling distribution of pAP assumes that individual 
observations are independent. However, when we’re sampling without replacement from a 
(finite) population, the observations are not independent because knowing the outcome of 
one trial helps us predict the outcome of future trials. Whenever we are sampling without 
replacement—which is nearly always—we need to check the 10% condition: n < 0.10N, 
where n is the sample size and N is the population size. 

When the 10% condition is met, the standard deviation of the sampling distribution of p/ 
P is 
| p(—p) 

if) 


oo; = 
p 


op=p(1-p)n 
Because we don’t know the value of p, we replace it with the sample proportion pip as we 
did with the Large Counts condition. The resulting quantity is called the standard error 
(SE) of the sample proportion pp: 


DEFINITION Standard error 


When the standard deviation of a statistic is estimated from data, the result is 
called the standard error of the statistic. 


/p(1—p 


SE- = v 
SEp/=p(1-p‘)n F nL 


oO: ‘ 
Some books refer to op’ ? as the standard error of p’P and to what we call the standard error 
as the estimated standard error. 


Like the standard deviation, the standard error describes how much the sample proportion p/ 
P typically varies from the population proportion p in repeated SRSs of size n. 


Here is a summary of the three conditions for constructing a confidence interval for p. 


CONDITIONS FOR CONSTRUCTING A CONFIDENCE INTERVAL 
ABOUT A PROPORTION 


e Random: The data come from a random sample from the population of interest. 


© 10%: When sampling without replacement, n < 0.10N. 


e Large Counts: Both npAnp and n(1-pryr( ~ P) are at least 10. 


Let’s verify that the conditions were met for the interval calculated by Mr. Buckley’s class. 


EXAMPLE |The beads (>) 
Checking conditions 


Studioshots/Alamy 


PROBLEM: Mr. Buckley’s class wants to construct a confidence interval for p = the true 
proportion of red beads in the container, which includes over 3000 beads. Recall that the 
class’s sample of 251 beads had 107 red beads and 144 other beads. Check if the conditions 
for constructing a confidence interval for p are met. 


SOLUTION: 


© Random: The class took a random sample of 251 beads from the container. 4 
© 10%: 251 beads is less than 10% of 3000. / 


e Large Counts: 
np\=251(107251)=107210 andn(1-pA)=251(1-107251)=251(144251)=144210 
np = 251 (#7) = 107 > 10 and 


n(1 —p) = 251 (1 = wt) = 251 (#4) = 144 > 10 
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Notice that np\”"’? and n(1-payr( ~ P) are the number of successes and failures in the 
sample. In the preceding example, we could address the Large Counts condition simply by 
saying, “The numbers of successes (107) and failures (144) in the sample are both at least 10.” 


WHAT HAPPENS IF ONE OF THE CONDITIONS IS VIOLATED? If the data come from 
a voluntary response or convenience sample, there’s no point in constructing a confidence 
interval for p. Violation of the Random condition severely limits our ability to make any 
inference beyond the data at hand. 


Simulation studies have shown that a variation of our method for calculating a 95% confidence 
interval for p can result in closer to a 95% capture rate in the long run, especially for small sample 
sizes. This simple adjustment, first suggested by Edwin Bidwell Wilson in 1927, is sometimes 
called the p/us four estimate. Just pretend we have four additional observations, two of which are 
successes and two of which are failures. Then calculate the “plus four interval” using the plus 


four estimate in place of p4? and sample size n + 4 in our usual formula. 


The figure shows a screen shot from the Confidence Intervals for Proportions applet at the 
book’s website, highschool.bfwpub.com/tps6e. We set n = 20 and p = 0.25. The Large Counts 
condition is not met because np = 20(0.25) = 5. We used the applet to generate 1000 random 
samples and construct 1000 95% confidence intervals for p. Only 902 of those 1000 intervals 
contained p = 0.25, a capture rate of 90.2%. When the Large Counts condition is violated, the 
capture rate will almost always be less than the one advertised by the confidence level when the 
method is used many times. 
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Violating the 10% condition means that we are sampling a large fraction of the population, 
which should be a good thing! Unfortunately, the formula we use for the standard deviation of 
p/P gives a value that is too large when the 10% condition is violated. Confidence intervals 


based on this formula are longer than they need to be. If many 95% confidence intervals for a 
population proportion are constructed in this way, more than 95% of them will capture p. The 
actual capture rate is almost always greater than the reported confidence level when the 10% 
condition is violated. 


CALCULATING CRITICAL VALUES How do we get the critical value z* for our 
confidence interval? If the Large Counts condition is met, we can use a Normal curve. For their 
95% confidence interval in the beads activity, Mr. Buckley’s class used a critical value of 2. 
Based on the 68-95-99.7 rule for Normal distributions, they figured that p’? will be within 2 
standard deviations of p in about 95% of all samples. Thus, p should be within 2 standard 
deviations of pp in about 95% of all samples. 


We can get a more precise critical value from Table A or a calculator. As Figure 8.2 shows, 
the central 95% of the standard Normal distribution is marked off by 2 points, z* and —z*. We 
use the * to remind you that this is a critical value, not a standardized score that has been 
calculated from data. 


Standard 
Normal curve 


Area = ().95 


Area = ().025 Area = ().025 


—Z" Q) z 
FIGURE 8.2 Finding the critical value z* for a 95% confidence interval starts by labeling the 
middle 95% under a standard Normal curve and calculating the area in each tail. 


Because of the symmetry of the Normal curve, the area in each tail is 0.05 / 2 = 0.025. Once 
you know the tail areas, there are two ways to calculate the value of z*: 


e Using Table A: Search the body of Table A to find the point —z* with area 0.025 to its left. 
The entry z = —1.96 is what we are looking for, so z* = 1.96. 


Z 05 06 07 
-2.0 0202 0197 0192 
11.9 0256 _— a 0244 
-1.8 0322 0314 .0307 


e Using technology: The command invNorm(area:0.025, mean:0, SD:1) gives z = —1.960, so 
z* = 1.960. 


Now we can officially calculate a 95% confidence interval using the data from Mr. 


Buckley’s class: 
pA+z*pA(1—-p/)n=0.426+1.960.426(1-0.426)251=0.426+0.061=(0.365, 0.487) 

, /p(1—p) /0.426(1—0.426) 

2" | — 0. > + 1.964 | —cc— 

pry 0.426 + 1.964, = 


= 0.426 + 0.061 = (0.365, 0.487) 


TL 


Mr. Buckley’s class is 95% confident that the interval from 0.365 to 0.487 captures the true 
proportion of red beads in his container. Notice that the margin of error is slightly smaller for 
this interval than when the class used 2 for the critical value. 

To find a level C confidence interval, we need to catch the central C% under the standard 
Normal curve. Here’s an example that shows how to get the critical value z* for a different 
confidence level and use it to calculate a confidence interval. 


EXAMPLE | Read any good books lately? (>) 
Calculating a critical value and confidence interval 


PROBLEM: According to a 2016 Pew Research Center report, 73% of American adults 
have read a book in the previous 12 months. This estimate was based on a random sample of 
1520 American adults.42 Assume the conditions for inference are met. 


a. Determine the critical value z* for a 90% confidence interval for a proportion. 


b. Construct a 90% confidence interval for the proportion of all American adults who have 
read a book in the previous 12 months. 


c. Interpret the interval from part (b). 


Lisa Solonynko/Alamy 
SOLUTION: 


a. Using Table A: z* = 1.64 or z* = 1.65 
Using technology: The command invNorm (area:0.05, mean:o, SD:1) gives Z = 


=1,045, 50 27 = 1.645, 
0.73 + 1.645,/ 
= 0.73 + 0.019 


b. 0.73£1.6450.73(1-0.73)1520=0.7340.019=(0.711,0.749) = (0.711, 0.749) 


Area = 0.90 


Area = 0.05 


Area = 0.05 


—Z*= -1.645 0 Z*= 1.645 


c. We are 90% confident that the interval from 0.711 to 0.749 captures p = the true 
proportion of American adults who have read a book in the previous 12 months. 
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CHECK YOUR UNDERSTANDING 


i. 


”, Sleep Awareness Week begins in the spring with the release of the National Sleep 
Foundation’s annual poll of U.S. sleep habits and ends with the beginning of daylight 


savings time, when most people lose an hour of sleep.t2 In the foundation’s random sample 
of 1029 U.S. adults, 48% reported that they “often or always” got enough sleep during the 


last 7 nights. 
1. Identify the parameter of interest. 
2. Check if the conditions for constructing a confidence interval for p are met. 


3. Find the critical value for a 99% confidence interval. Then calculate the interval. 


4. Interpret the interval in context. 


Putting It All Together: The Four-Step Process 


Taken together, the examples about Mr. Buckley’s class and the beads activity show you how 
to get a confidence interval for an unknown population proportion p. Because there are many 


details to remember when constructing and interpreting a confidence interval, it is helpful to 
group them into four steps. 


CONFIDENCE INTERVALS: A FOUR-STEP PROCESS 


State: State the parameter you want to estimate and the confidence level. 
4STEP 


Plan: Identify the appropriate inference method and check conditions. 
Do: If the conditions are met, perform calculations. 


Conclude: Interpret your interval in the context of the problem. 


The next example illustrates the four-step process in action. 


EXAMPLE |Distracted walking (>) fi. 
4 STEP 
Constructing and interpreting a confidence interval for p 


mimagephotography 
/Shutterstock.com 


PROBLEM: A recent poll of 738 randomly selected cell-phone users found that 170 of the 
respondents admitted to walking into something or someone while talking on their cell 
phone./4 Construct and interpret a 95% confidence interval for the proportion of all cell- 
phone users who would admit to walking into something or someone while talking on their 
cell phone. 


SOLUTION: 


STATE: 95% C4 for We the true proportion of all cell-phone users who would admit to 
walking into something or someone while talking on their cell phone. 


STATE: State the parameter you want to estimate and the confidence level. 


PLAN: One-sample z interval for 52 


PLAN: Identify the appropriate inference method and check conditions. 


© Random: Random sample of 738 cell-phone users. / 
© 10%: It is reasonable to assume that 738 is less than 10% of all cell-phone users. 4 


e Large Counts: The number of successes (1 70) and the number of ‘failures (738 = 170 = 
568) are both at least 10.¥ 


Remember that np*”-P is the number of successes and n(i-pryr( ~P) is the 
number of failures: 


170 
np = 738 (Fs 
7 


DO:p=170/738=0.2300.230+1.9600.230(1-0.230)738=0.230+0.030=(0.200, 0.260) 
DO: p = 170/738 = 0.230 


/ 0,230(1--0.230) 
738 


0.230 + 1.960) 
= 0.230 + 0.030 
= (0.200, 0.260) 


DO: If the conditions are met, perform calculations. 


CONCLUDE: We are 95% confident that the interval from 0.200 to 0.260 captures p = 
the true proportion of all cell-phone users who would admit to walking into something or 
someone while talking on their cell phone. 


CONCLUDE: Interpret your interval in the context of the problem. 


Make sure your conclusion is about the population proportion (users who would 


admit) and not the sample proportion (those who admitted). 


FOR PRACTICE, TRY EXERCISE 41 


AP® EXAM TIP 


If a free response question asks you to construct and interpret a confidence interval, 


you are expected to do the entire four-step process. That includes clearly defining 
the parameter, identifying the procedure, and checking conditions. 


og Remember that the margin of error in a confidence interval only accounts for 
sampling variability! There are other sources of error that are not taken into account. As is the 
case with many surveys, we are forced to assume that respondents answer truthfully. If they 
don’t, then we shouldn’t be 95% confident that our interval captures the truth. Other problems 
like nonresponse and question wording can also affect the results of a survey. Lesson: 
Sampling beads is much easier than sampling people! 

Your calculator will handle the “Do” part of the four-step process, as the following 
Technology Corner illustrates. 


17. Technology Corner | CONSTRUCTING A CONFIDENCE 
INTERVAL FOR A POPULATION PROPORTION (>) 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 


The TI-83/84 can be used to construct a confidence interval for an unknown population 
proportion. We’|l demonstrate using the previous example. Of n = 738 cell-phone users 
surveyed, X = 170 admitted to walking into something or someone while talking on their 
cell phone. To construct a confidence interval: 


e Press |STAT|, then choose TESTS and 1-PropZInt. 


e When the 1-PropZInt screen appears, enter X = 170, n = 738, and confidence level = 0.95. 
Note: The value you enter for X is the number of successes (not the proportion of 
successes) and must be an integer. 

e Highlight “Calculate” and press |ENTER|. The 95% confidence interval for p is reported, 
along with the sample proportion p/? and the sample size, as shown here. 


x:17@ 

n:738 
C-Level:8.95 
Calculate 


NORMAL FLOAT AUTO REAL RADIAN HP fl 


(0.19997,8.26073) 
P=0, 2303523035 
n=738 


AP® EXAM TIP 


You may use your calculator to compute a confidence interval on the AP® Statistics 
exam. But there’s a risk involved. If you just give the calculator answer with no work, 
you'll get either full credit for the “Do” step (if the interval is correct) or no credit (if 
it's wrong). If you opt for the calculator-only method, be sure to complete the other 
three steps, including identifying the procedure (e.g., one-sample z interval for p) 
and give the interval in the Do step (e.g., 0.19997 to 0.26073). 


Choosing the Sample Size 


In planning a study, we may want to choose a sample size that allows us to estimate a 
population proportion within a given margin of error. The formula for the margin of error (ME) 
in the confidence interval for p is 


ME =2*\/ 
ME=z*p((1-p)n n 


To calculate the sample size, substitute values for ME, z*, and pip , and solve for n. 

Unfortunately, we won’t know the value of p/? until after the study has been conducted. This 

means we have to guess the value of p\? when choosing n. Here are two ways to do this: 

.. Use a guess for pAP based on a pilot (preliminary) study or past experience with similar 
studies. 


’. Use pA=0.5P = 0.5 as the guess. The margin of error ME is largest when pA=0.5P = 0.5 | so 


this guess is conservative. If we get any other p\? when we do our study, the margin of error 
will be smaller than planned. 


Once you have a guess for p/?, the formula for the margin of error can be solved to give the 
required sample size n. 


SAMPLE SIZE FOR DESIRED MARGIN OF ERROR WHEN ESTIMATING 
p 


To determine the sample size n that will yield a C% confidence interval for a population 
proportion p with a maximum margin of error ME, solve the following inequality for n: 


2 | p(1—-P) < ME 
z*p\(1-p’)n<ME n 
where pip is a guessed value for the sample proportion. The margin of error will always be 
less than or equal to ME if you use p\=0.5P = 9.5, 


Here’s an example that shows you how to determine the sample size. 


EXAMPLE |Customer satisfaction (>) 
Determining sample size 


wbritten/Getty Images 


PROBLEM: A company has received complaints about its customer service. The managers 
intend to hire a consultant to carry out a survey of customers. Before contacting the 
consultant, the company president wants some idea of the sample size that she will be 
required to pay for. One value of interest is the proportion p of customers who are satisfied 
with the company’s customer service. She decides that she wants the estimate to be within 3 
percentage points (0.03) at a 95% confidence level. How large a sample is needed? 


SOLUTION: 


We have no idea about the true proportion p of satisfied customers, So we decide to 


use p*=0.5? = 9-5 as our guess. 


0.5(1-0.5)n<0.03196 V2” 1.96 


Divide both sides by 1.96. 


Square both sides. 
0.5(1-0.5) < n(n) 
0.5(1 — 0.5)<n(0.031.96)2 1.96 


Multiply both sides by n. 


).0° 


Divide both sides by Pea _ 


n>1067.111 n > 1067.111 


Make sure to follow the inequality when rounding your answer. 


The sample needs to include at least 1068 customers. 


FOR PRACTICE, TRY EXERCISE 49 


Why not round to the nearest whole number—in this case, 1067? Because a smaller sample 
size will result in a larger margin of error, possibly more than the desired 3 percentage points 


for the poll. In general, we round to the next highest integer when solving for sample size to 
make sure the margin of error is less than or equal to the desired value. 


CHECK YOUR UNDERSTANDING 


. 


y Refer to the preceding example about the company’s customer satisfaction survey. 


1. In the company’s prior-year survey, 80% of customers surveyed said they were satisfied. 
Using this value as a guess for pr, find the sample size needed for a margin of error of at 
most 3 percentage points with 95% confidence. How does this compare with the required 
sample size from the example? 


2. What if the company president demands 99% confidence instead of 95% confidence? 
Would this require a smaller or larger sample size, assuming everything else remains the 
same? Explain your answer. 


Section 8.2 Summary 


e The conditions for constructing a confidence interval about a population proportion p are as 
follows: 


= Random: The data come from a random sample from the population of interest. 
© 10%: When sampling without replacement, n < 0.10N. 


. Large Counts: Both np and n(1—pA)"P andn(1— Pp) are at least 10. That is, the number of 
successes and the number of failures in the sample are both at least 10. 


e When the conditions are met, the C% one-sample z interval for p is 


. [Pay 


pAtz*pA(1—pAyn> rue \ nT 


where z* is the critical value for the standard Normal curve with C% of its area between —z* 
and z*. 


e When we use the value of p/? to estimate the standard deviation of the sampling distribution 


i /p(l-p) 
Fi ie SE; =/— -- ; 


T 


of p\P, the result is the standard error of p: SEp\=p(1-p))n. 


When asked to construct and interpret a confidence interval, follow the four-step process: 
STATE: State the parameter you want to estimate and the confidence level. 
“i PLAN: Identify the appropriate inference method and check conditions. 


DO: If the conditions are met, perform calculations. 


CONCLUDE: Interpret your interval in the context of the problem. 
e The sample size needed to obtain a confidence interval with a maximum margin of error ME 
for a population proportion involves solving 
.. [bU-B) 
z*p/\(1-p/)n<ME Vn 


< ME 


for n, where p/P is a guessed value for the sample proportion, and z* is the critical value for 
the confidence level you want. Use p\=0.5? = 9-5 if you don’t have a good idea about the 
value of p/.P: 


8.2 Technology Corner | @ 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 


17. Constructing a confidence interval for a population proportion 


Section 8.2 Exercises 


For Exercises 29 to 32, check whether each of the conditions is met for calculating a 
confidence interval for the population proportion p. 


29. pg 513 Rating school food Latoya wants to estimate the proportion of the seniors 
Ne 


at her boarding school who like the cafeteria food. She interviews an SRS of 50 of the 175 
seniors and finds that 14 think the cafeteria food is good. 


30. High tuition costs Glenn wonders what proportion of the students at his college believe 
that tuition is too high. He interviews an SRS of 50 of the 2400 students and finds 38 of 
those interviewed think tuition is too high. 


31. Salty chips A quality control inspector takes a random sample of 25 bags of potato chips 
from the thousands of bags filled in an hour. Of the bags selected, 3 had too much salt. 


32. Whelks and mussels The small round holes you often see in seashells were drilled by 
other sea creatures, who ate the former dwellers of the shells. Whelks often drill into 
mussels, but this behavior appears to be more or less common in different locations. 
Researchers collected whelk eggs from the coast of Oregon, raised the whelks in the 
laboratory, then put each whelk in a container with some delicious mussels. Only 9 of 98 


whelks drilled into a mussel.12 


33. The 10% condition When constructing a confidence interval for a population proportion, 
we check that the sample size is less than 10% of the population size. 


34. 


35. 


36. 


37. 


38. 


39. 


a. Why is it necessary to check this condition? 


b. What happens to the capture rate if this condition is violated? 
The Large Counts condition When constructing a confidence interval for a population 
proportion, we check that both np/ and n(1—-p4)"? andn(1—P) are at least 10. 


a. Why is it necessary to check this condition? 


b. What happens to the capture rate if this condition is violated? 


Pg 516 |») Selling online According to a recent Pew Research Center report, many 


American adults have made money by selling something online. In a random sample of 
4579 American adults, 914 reported that they earned money by selling something online 
in the previous year.1® Assume the conditions for inference are met. 


a. Determine the critical value z* for a 98% confidence interval for a proportion. 


b. Construct a 98% confidence interval for the proportion of all American adults who 
would report having earned money by selling something online in the previous year. 


c. Interpret the interval from part (b). 


Reporting cheating What proportion of students are willing to report cheating by other 
students? A student project put this question to an SRS of 172 undergraduates at a large 
university: “You witness two students cheating on a quiz. Do you go to the professor?” 
Only 19 answered “Yes.”42 Assume the conditions for inference are met. 


a. Determine the critical value z* for a 96% confidence interval for a proportion. 


b. Construct a 96% confidence interval for the proportion of all undergraduates at this 
university who would go to the professor. 


c. Interpret the interval from part (b). 


More online sales Refer to Exercise 35. Calculate and interpret the standard error of p/\? 
for these data. 


More cheating Refer to Exercise 36. Calculate and interpret the standard error of pip for 
these data. 


Going to the prom Tonya wants to estimate what proportion of her school’s seniors plan 
to attend the prom. She interviews an SRS of 50 of the 750 seniors in her school and finds 
that 36 plan to go to the prom. 


a. Identify the population and parameter of interest. 
b. Check conditions for constructing a confidence interval for the parameter. 


c. Construct a 90% confidence interval for p. 


d. Interpret the interval in context. 


40. Student government The student body president of a high school claims to know the 
names of at least 1000 of the 1800 students who attend the school. To test this claim, the 
student government advisor randomly selects 100 students and asks the president to 
identify each by name. The president successfully names only 46 of the students. 


a. Identify the population and parameter of interest. 
b. Check conditions for constructing a confidence interval for the parameter. 
c. Construct a 99% confidence interval for p. 


d. Interpret the interval in context. 


41. pg 517 |») Video games A Pew Research Center report on gamers and gaming 


estimated that 49% of U.S. adults play video games on a computer, TV, game console, or 
portable device such as a cell phone. This estimate was based on a random sample of 2001 
U.S. adults. Construct and interpret a 95% confidence interval for the proportion of all 


U.S. adults who play video games.48 


42. September 11 A recent study asked U.S. adults to name 10 historic events that occurred 
in their lifetime that have had the greatest impact on the country. The most frequently 
chosen answer was the September 11, 2001, terrorist attacks, which was included by 76% 
of the 2025 randomly selected U.S. adults. Construct and interpret a 95% confidence 
interval for the proportion of all U.S. adults who would include the 9/11 attacks on their 
list of 10 historic events. 


43. Age and video games Refer to Exercise 41. The study also estimated that 67% of adults 
aged 18—29 play video games, but only 25% of adults aged 65 and older play video 
games. 


a. Explain why you do not have enough information to give confidence intervals for these 
two age groups separately. 


b. Do you think a 95% confidence interval for adults aged 18-29 would have a larger or 
smaller margin of error than the estimate from Exercise 41? Explain your answer. 
44. Age and September 11 Refer to Exercise 42. The study also reported that 86% of 
millennials included 9/11 in their top-10 list and 70% of baby boomers included 9/11. 


a. Explain why you do not have enough information to give confidence intervals for 
millennials and baby boomers separately. 


b. Do you think a 95% confidence interval for baby boomers would have a larger or 
smaller margin of error than the estimate from Exercise 42? Explain your answer. 


45. Food fight A 2016 survey of 1480 randomly selected U.S. adults found that 55% of 
respondents agreed with the following statement: “Organic produce is better for health 


than conventionally grown produce.”!2 


a. Construct and interpret a 99% confidence interval for the proportion of all U.S. adults 
who think that organic produce is better for health than conventionally grown produce. 


b. Does the interval from part (a) provide convincing evidence that a majority of all U.S. 
adults think that organic produce is better for health? Explain your answer. 


46. Three branches According to a recent study by the Annenberg Foundation, only 36% of 
adults in the United States could name all three branches of government. This was based 
on a survey given to a random sample of 1416 U.S. adults.22 


a. Construct and interpret a 90% confidence interval for the proportion of all U.S. adults 
who could name all three branches of government. 


b. Does the interval from part (a) provide convincing evidence that less than half of all 
U.S. adults could name all three branches of government? Explain your answer. 


47. Prom totals Use your interval from Exercise 39 to construct and interpret a 90% 
confidence interval for the total number of seniors planning to go to the prom. 


48. Student body totals Use your interval from Exercise 40 to construct and interpret a 99% 
confidence interval for the total number of students at the school that the student body 
president can identify by name. Then use your interval to evaluate the president’s claim. 


49. pg 520 |») School vouchers A small pilot study estimated that 44% of all American 


adults agree that parents should be given vouchers that are good for education at any 
public or private school of their choice. 


a. How large a random sample is required to obtain a margin of error of at most 0.03 with 
99% confidence? Answer this question using the pilot study’s result as the guessed 
value for p/P. 


b. Answer the question in part (a) again, but this time use the conservative guess p4=0.5 
P = 0.5, By how much do the two sample sizes differ? 


50. Can you taste PTC? PTC is a substance that has a strong bitter taste for some people and 
is tasteless for others. The ability to taste PTC is inherited. About 75% of Italians can taste 
PTC, for example. You want to estimate the proportion of Americans who have at least 
one Italian grandparent and who can taste PTC. 


a. How large a sample must you test to estimate the proportion of PTC tasters within 0.04 
with 90% confidence? Answer this question using the 75% estimate as the guessed 
value for p/P. 


b. Answer the question in part (a) again, but this time use the conservative guess p4=0.5 
P = 0.5, By how much do the two sample sizes differ? 


51. Starting a nightclub A college student organization wants to start a nightclub for students 


under the age of 21. To assess support for this proposal, they will select an SRS of 
students and ask each respondent if he or she would patronize this type of establishment. 
What sample size is required to obtain a 90% confidence interval with a margin of error of 
at most 0.04? 


52. Election polling Gloria Chavez and Ronald Flynn are the candidates for mayor in a large 
city. We want to estimate the proportion p of all registered voters in the city who plan to 
vote for Chavez with 95% confidence and a margin of error no greater than 0.03. How 
large a random sample do we need? 


53. Teens and their TV sets According to a Gallup Poll report, 64% of teens aged 13 to 17 
have TVs in their rooms. Here is part of the footnote to this report: 


These results are based on telephone interviews with a randomly selected national sample 
of 1028 teenagers in the Gallup Poll Panel of households, aged 13 to 17. For results 
based on this sample, one can say ... that the maximum error attributable to sampling and 
other random effects is +3 percentage points. In addition to sampling error, question 
wording and practical difficulties in conducting surveys can introduce error or bias into 
the findings of public opinion polls.24 


a. We omitted the confidence level from the footnote. Use what you have learned to 
estimate the confidence level, assuming that Gallup took an SRS. 


b. Give an example of a “practical difficulty” that could lead to bias in this survey. 


54. Gambling and the NCAA Gambling is an issue of great concern to those involved in 
college athletics. Because of this concern, the National Collegiate Athletic Association 
(NCAA) surveyed randomly selected student athletes concerning their gambling-related 
behaviors.22 Of the 5594 Division I male athletes who responded to the survey, 3547 
reported participation in some gambling behavior. This includes playing cards, betting on 
games of skill, buying lottery tickets, betting on sports, and similar activities. A report of 
this study cited a 1% margin of error. 


a. The confidence level was not stated in the report. Use what you have learned to 
estimate the confidence level, assuming that the NCAA took an SRS. 


b. The study was designed to protect the anonymity of the student athletes who responded. 
As a result, it was not possible to calculate the number of students who were asked to 
respond but did not. How does this fact affect the way that you interpret the results? 


Multiple Choice Select the best answer for Exercises 55-58. 


55. A Gallup poll found that only 28% of American adults expect to inherit money or valuable 
possessions from a relative. The poll’s margin of error was +3 percentage points at a 95% 
confidence level. This means that 


a. the poll used a method that gets an answer within 3% of the truth about the population 
95% of the time. 


56. 


o7. 


58. 


b. the percent of all adults who expect an inheritance must be between 25% and 31%. 


c. if Gallup takes another poll on this issue, the results of the second poll will lie between 
25% and 31%. 


d. there’s a 95% chance that the percent of all adults who expect an inheritance is between 
25% and 31%. 


e. Gallup can be 95% confident that between 25% and 31% of the sample expect an 
inheritance. 


Refer to Exercise 55. Suppose that Gallup wanted to cut the margin of error in half from 3 
percentage points to 1.5 percentage points. How should they adjust their sample size? 


a. Multiply the sample size by 4. 
b. Multiply the sample size by 2. 
c. Multiply the sample size by 1/2. 
d. Multiply the sample size by 1/4. 


e. There is not enough information to answer this question. 


Most people can roll their tongues, but many can’t. The ability to roll the tongue is 
genetically determined. Suppose we are interested in determining what proportion of 
students can roll their tongues. We test a simple random sample of 400 students and find 
that 317 can roll their tongues. The margin of error for a 95% confidence interval for the 
true proportion of tongue rollers among students is closest to which of the following? 


a. 0.0008 
b. 0.02 
c. 0.03 
d. 0.04 
e. 0.05 


A newspaper reporter asked an SRS of 100 residents in a large city for their opinion about 
the mayor’s job performance. Using the results from the sample, the C% confidence 
interval for the proportion of all residents in the city who approve of the mayor’s job 
performance is 0.565 to 0.695. What is the value of C? 


a. 82 
b. 86 
c. 90 
d. 95 


e. 99 
Review and Recycle 


59. Oranges (6.1, 7.3) A home gardener likes to grow various kinds of citrus fruit. One of his 
mandarin orange trees produces oranges whose circumferences follow a Normal 
distribution with mean 21.1 cm and standard deviation 1.8 cm. 


a. What is the probability that a randomly selected orange from this tree has a 
circumference greater than 22 cm? 


b. What is the probability that a random sample of 20 oranges from this tree has a mean 
circumference greater than 22 cm? 


60. More oranges (1.2, 2.2) The gardener in the previous exercise randomly selects 20 
mandarin oranges from the tree and counts the number of seeds in each orange. Here are 


the data: 
3 4669 11 11 12 13 138 14 14 16 17 22 23 23 24 28 «30 


a. Graph the data using a dotplot. 


b. Based on your graph, is it plausible that the number of seeds from oranges on this tree 
follows a distribution that is approximately Normal? Explain your answer. 


SECTION 8.3 Estimating a Population Mean 


LEARNING TARGETS By the end of the section, you should be able to: 


e Determine the critical value for calculating a C% confidence interval for a 
population mean using a table or technology. 


State and check the Random, 10%, and Normal/Large Sample conditions for 


constructing a confidence interval for a population mean. 
Construct and interpret a confidence interval for a population mean. 


Determine the sample size required to obtain a C% confidence interval for a 
population mean with a specified margin of error. 


Inference about a population proportion usually arises when we study categorical variables. 
We learned how to construct and interpret confidence intervals for a population proportion p in 
Section 8.2. To estimate a population mean, we have to record values of a quantitative variable 
for a sample of individuals. It makes sense to try to estimate the mean amount of sleep that 
students at a large high school got last night but not their mean eye color! In this section, we’ll 
examine confidence intervals for a population mean p. 


The Problem of Unknown ao 


In Section 8.2, we used this formula for a confidence interval for a population proportion: 


, » [(PaB) 


+ 
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In more general terms, this is 


point estimate+ margin of error= statistic+ (critical value)(standard deviation of the statistic) 
point estimate -+ margin of error 


= statistic + (critical value)(standard deviation of the statistic) 


The formula sheet on the AP® Statistics exam includes the general formula stated here. However, 


some people prefer to use 
statistic + (critical value)(Standard error of the statistic) 
statistic + (critical value)(standard error of the statistic) 


because we are usually estimating the standard deviation with data from a sample. 


A confidence interval for a population mean has a formula with the same structure. Using 


a/./n 


the sample mean x = as the point estimate for the population mean p and o/n as the 


standard deviation of the sampling distribution of x ® gives 


Co 
zt 2*— 
x +z*on vn 
This interval is called a one-sample z interval for a population mean. Unfortunately, if we 
don’t know the true value of 1, we rarely know the true value of o either. We can use s, as an 


estimate for o, but things don’t work out as nicely as we might like. Let’s explore why this is 
true. 


prPLe> 


j ; sae te : (ee 
ACTIVITY Confidence interval BINGO! ¢€ 


In this activity, you will investigate the problem caused by replacing o with s, when 
calculating a confidence interval for p/, and how to fix it. 


photowind/Shutterstock 


A farmer wants to estimate the mean weight (in grams) of all tomatoes grown on her 
farm. To do so, she will select a random sample of 4 tomatoes, calculate the mean weight (in 
grams), and use the sample mean x @ to create a 99% confidence interval for the population 
mean p!. Suppose that the weights of all tomatoes on the farm are approximately Normally 
distributed with a mean of 100 grams and a standard deviation of 20 grams. 


Let’s use an applet to simulate taking an SRS of n = 4 tomatoes and calculating a 99% 
confidence interval for p using three different methods. 


Method 1 (assuming o is known) 


i a 


x +z*on=x +2.576204 vn v4 


1. Launch the Simulating Confidence Intervals for a Population Parameter applet at 
www.rossmanchance.com/applets. Choose “Means” from the drop-down menu and leave 
the other menus as “Normal” and “z with sigma.” Then enter p = 100, o = 20, n = 4, and 
intervals = 1. Set the confidence level = 99%, as shown in the screen shot. 


Means 


Normal 


z with sigma 


Intervals 1 


Sample 


Conflevel 99 


Recalculate 


2. Press “Sample.” The applet will select an SRS of n = 4 tomatoes and calculate a 
confidence interval for p. This interval will be displayed as a horizontal line. The vertical 
line identifies = 100. If the interval captures pz = 100, the interval will be green. If the 
interval misses pt = 100, the interval will be red. 


100 

Means v 
Normal v 
z with sigma v 

u | 100 

o 20 

n 4 

Intervals 1 


Sample 


Conf level 99 % 


Recalculate 


Intervals containing 
1/1 = 100.0% 
Running Total 
1/1 = 100.0% 


Sort 


Reset 


60 80 100 120 140 


3. Press “Sample” many times, shouting out “BINGO!” whenever you get an interval that 
misses p! = 100 (i.e., a red interval). Stop when your teacher calls time. 


4. How well did Method 1 work? Compare the running total in the lower left corner with the 
stated confidence level of 99%. 


Method 2 (using s, as an estimate for 0) 


Sa 8 
S42 — = 242576— 


x +z*sxn=x +2.576sx4 vn v4 
1. Press the “Reset” button in the lower left. Then, in the third drop-down menu, choose “z 
with s.” This replaces o with s, in the formula for the confidence interval. Keep everything 
else the same. 


2. Press “Sample” many times, shouting out “BINGO!” whenever you get an interval that 
misses p! = 100 (i.e., a red interval). Stop when your teacher calls time. 


3. Did Method 2 work as well as Method 1? Discuss with your classmates. 


4. Now change the number of intervals to 50 and press “Sample” 20 times, for a total of 
more than 1000 intervals. Compare the running total in the lower left corner with the 
stated confidence level of 99%. What do you notice about the length of the intervals that 
missed? 


To increase the capture rate of the intervals to 99%, we need to make the intervals 
longer. We can do this by using a different critical value, called a ¢t* critical value. You’ll 
learn how to calculate this number soon. 


Method 3 (using s, as an estimate for o and a ¢* critical value instead of a z* critical value) 


zi t*— = Fit —— 
x +t*sxn=x +???sx4 yn v4 


66499 
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1. Press the “Reset” button in the lower left. Then, in the third drop-down menu, choose 
and change “Intervals” to 1. Keep everything else the same. 


2. Press “Sample” many times, shouting out “BINGO!” whenever you get an interval that 
misses p! = 100 (i.e., a red interval). Stop when your teacher calls time. 

3. Did Method 3 work better than Method 2? How does it compare to Method 1? Discuss 
with your classmates. 

4. Now change the number of intervals to 50 and press “Sample” 20 times, for a total of at 
least 1000 intervals. Compare the running total in the lower left corner with the stated 
confidence level of 99%. What do you notice about the length of the intervals compared to 
Method 2? 


Figure 8.3 shows the results of repeatedly constructing confidence intervals using a z* 
critical value and the sample standard deviation s,, as described in Method 2 of the preceding 
activity. Of the 1000 intervals constructed, only 925 (that’s 92.5%) captured the population 
mean. That’s far below our desired 99% confidence level! 


u! 100 Even though ¥ wasn't far 
from y, this interval 
o 20 es missed because the value 
nl4 =— — | of s, was much smaller 
than o. 


Intervals 100 


Sample 


Conf level 99 


Recalculate 


Intervals containing p 
90/100 = 90.0% 
Running Total 
925/1000 = 92.5% 


Sort 


Reset 


FIGURE 8.3 Screen shot from the Simulating Confidence Intervals for a Population Parameter 
applet at www.rossmanchance.com/applets, showing the capture rate of a “99%” confidence 
interval when using a Z* critical value with the sample standard deviation sy. 


What went wrong? The intervals that missed (those in red) came from samples with small 
standard deviations s, and from samples in which the sample mean x @ was far from the 


population mean p. In those cases, multiplying gx/482/V4 by z* = 2.576 didn’t produce long 
enough intervals to reach p = 100. To achieve a 99% capture rate, we need to multiply by a 
larger critical value. But what critical value should we use? 


CALCULATING f* CRITICAL VALUES In the activity, you discovered that Method 3 
produces confidence intervals that capture p/ at the advertised confidence level. That is, when 


constructing “99%” confidence intervals using the formula x +t*sxn, vn’ about 99% of 
the intervals capture the population mean p. This interval is called a one-sample t interval for a 
population mean. 

The critical value in this interval is denoted t* because it comes from a t distribution, not 
the standard Normal distribution. The critical value t* has the same interpretation as z*: it 
measures how many standard deviations we need to extend from the point estimate to get the 
desired level of confidence. There is a different t distribution for each sample size. We specify 
a particular t distribution by giving its degrees of freedom (df). When we perform inference 


about a population mean p/ using a t distribution, the appropriate degrees of freedom are found 
by subtracting 1 from the sample size n, making df =n - 1. 

Figure 8.4 shows two different t distributions, along with the standard Normal distribution. 
Because the t distributions are wider than the standard Normal distribution, t* critical values 
will always be larger than z* critical values for a specified level of confidence. As you learned 
in the “Bingo” activity, we need to use a critical value larger than z* to compensate for the 
variability introduced by using the sample standard deviation s, as an estimate for the 
population standard deviation o. As the degrees of freedom increase, the ¢ distributions 
approach the standard Normal distribution. This makes sense because the value of s, will 


typically be closer to o as the sample size increases. 


ft distributions have 
more area in the tails 
than the standard 

Normal distribution. 


—- /,2 degrees 
of freedom 


wom t,9 degrees 
of freedom 

— Standard 

Normal 


() 


FIGURE 8.4 Density curves for the ft distributions with 2 and 9 degrees of freedom and the 
standard Normal distribution. All are symmetric with center 0. The ¢ distributions have more 
variability than the standard Normal distribution, but approach the standard Normal distribution 
as df increases. 


You will learn more about t distributions in Chapter 9. For now, we will focus on how to 
calculate the critical value t* for various sample sizes and confidence levels. 


Table B in the back of the book gives t* critical values for the t distributions. Each row in 
the table contains critical values for the t distribution whose degrees of freedom (df) appear at 
the left of the row. To make the table easy to use, several of the more common confidence 
levels C are given at the bottom of the table. 


The t distribution and the t inference procedures were invented by William S. Gosset (1876-1937). 


Gosset worked for the Guinness brewery, and his goal in life was to make better beer. He used his 
new t procedures to find the best varieties of barley and hops. Gosset’s statistical work helped 
him become head brewer. Because Gosset published under the pen name “Student,” you will 
often see the t distribution called “Student’s t” in his honor. 


THE ONE-SAMPLE t INTERVAL FOR A POPULATION MEAN 


When the conditions are met, a C% confidence interval for the unknown mean p is 


= sz 
ztt*— 


x +t*sxn Jn 


where ¢* is the critical value for the t distribution with n — 1 degrees of freedom (df) and C% 
of the area between —¢* and ¢*. 


When you use Table B to determine the correct value of t* for a given confidence interval, 
all you need to know are the confidence level C and the degrees of freedom (df). In the activity, 
we calculated 99% confidence intervals with n = 4, so df =4-1=3. 


Tail probability p 


df .02 01 .005 .0025 

1 15.89 31.82 63.66 127.3 

2 4.849 6.965 9.925 14.09 

3 3.482 4541 Sa 7.453 

6 2.054 2.326 2.576 2.807 
96% 98% 99% 99.5% 


Confidence level C 


According to the Table B excerpt in the margin, for 99% confidence and 3 degrees of 
freedom, t* = 5.841. That is, the interval should extend 5.841 standard deviations on both sides 
of the point estimate to have a capture rate of 99%. 


Unfortunately, Table B doesn’t include every possible df. If the correct df isn’t listed, use 
the greatest df available that is less than the correct df. “Rounding up” to a larger df will result 
in confidence intervals that are too narrow. The intervals won’t be wide enough to include the 
true population value as often as suggested by the confidence level. 


EXAMPLE | Finding f* critical values (») 
Using Table B 


PROBLEM: What critical value t* from Table B should be used in constructing a 
confidence interval for the population mean in each of the following settings? 


a. A 95% confidence interval based on an SRS of size n = 12 


b. A 90% confidence interval from a random sample of 48 observations 


SOLUTION: 


Gl ee =: OXON 


Tail probability p 


df 05 .025 .02 01 
10 1.812 2.228 2.359 2.764 
‘lal 1.796 Sz S9 2.328 2.718 
AD 1.782 2.179 2.303 2.681 
1.645 1.960 2.054 2.326 

90% 95% 96% 98% 


Confidence level C 


In Table B, we consult the row corresponding to df = 12 - 1 = 11. We move across 


that row to the entry that is directly above 95% confidence level on the bottom of 


the chart. 
6b. t* =1.684 
Tail probability p 
df 10 05 025 .02 
30 1.310 1.697 2.042 2 1a7 
40 1.303 Sees 7021 Doe 
50 1.299 1.676 2.009 2.109 
oo 1.282 1.645 1.960 2.054 


80% 90% 95% 96% 
Confidence level C 


With 48 observations, we want to find the f* critical value for df = 48 - 1 = 47 and 


90% confidence. There is no df = 47 row in Table B, so we use the more 
conservative df = 40. 
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The bottom row of Table B gives z* critical values. That’s because t distributions approach 
the standard Normal distribution as the degrees of freedom approach infinity. 


Because ¢t* is larger than z* for a given sample size and confidence level, intervals 
calculated with ¢* will have a higher capture rate than intervals using z*. When the conditions 
are met, the actual percent of intervals that contain the population mean should be equal to the 
stated confidence level. Using t* rather than z* solves the problem created by using the sample 
standard deviation s, as an estimate for the population standard deviation o. 


Technology will quickly produce t* critical values for any sample size. 


18. Technology Corner |USING INVERSE t @ 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


Most newer TI-84 calculators allow you to find critical values t* using the inverse t 
command. Let’s use the inverse t command to find the critical values in parts (a) and (b) of 
the example. 


e Press (DISTR) and choose invT(. 


e For part (a), we need to find the critical value for 95% confidence, so we want an area of 
0.025 in each tail. 
OS 2.55 or later: In the dialog box, enter these values: area: 0.025, df: 11, choose Paste, 
and then press [ENTER|. Note: For the inverse t command, the area always refers to area 
to the left. 
Older Os: Complete the command invT(0.025,11) and press [ENTER|, 


For part (b), we need 90% confidence, so we want an area of 0.05 in each tail. Use the 
command invT(0.05,47). 


invT(@.@25,11) 

-2. 200985143 
invT(@.@5.47) 

-1.677926664 


Note that the t critical values are t* = 2.201 and t* = 1.678, respectively. The critical value 
for part (b) is slightly smaller here, because we were able to use df = 47 rather than rounding 
down to df = 40. 


Now that you’ve learned how to calculate a t* critical value, it’s time to make a simple 
observation. Inference for proportions uses z; inference for means uses t. That’s one reason 
why distinguishing categorical from quantitative variables is so important. 


CHECK YOUR UNDERSTANDING 


q 


”, Use Table B to find the critical value t* that you would use for a confidence interval for 


a population mean p in each of the following settings. If possible, check your answer with 
technology. 


1. A 96% confidence interval based on a random sample of 22 observations 


2. A 99% confidence interval from an SRS of 71 observations 


Conditions for Estimating p 


As with proportions, you must check some important conditions before constructing a 
confidence interval for a population mean. The first two conditions should be familiar by now. 
The Random condition is crucial for doing inference. If the data don’t come from a random 
sample, you can’t draw conclusions about a larger population. When sampling without 
replacement, the 10% condition ensures that our formula for the standard deviation is 
approximately correct. 

The third condition is different, however. When calculating a confidence interval for a 
population proportion, we check the Large Counts condition to ensure it is appropriate to use 
the standard Normal distribution to calculate the z* critical value. When calculating a 
confidence interval for a population mean, we check the Normal/Large Sample condition to 
ensure it is appropriate to use a ¢ distribution to calculate the t* critical value. 

For the t* critical value to produce confidence intervals with a capture rate that is equal to 
the confidence level, the population distribution must be Normal. If we are told that the 
population is Normally distributed, then the condition has been met. Unfortunately, it is rare 
that we actually know a population distribution is Normal. When the population shape is 
unknown, there are two ways to satisfy the Normal/Large Sample condition. 


e If the sample size is small (n < 30), graph the sample data and ask, “Is it plausible that these 
data came from a Normal population?” If there is no strong skewness or outliers in the data, 
then the answer is “Yes.” Remember to include the graph of sample data when checking the 
Normal/Large Sample condition in this way. 

e If the sample size is large (n = 30), using a ¢* critical value will produce confidence intervals 
with a capture rate that is approximately equal to the confidence level, even when the 
population distribution is not Normal. 

Here is an image from the Simulating Confidence Intervals for a Population Parameter 
applet at www.rossmanchance.com/applets, showing the capture rate of a 95% one-sample t 
interval when selecting 1000 samples of size n = 50 from a population that is strongly skewed 
to the right (an exponential distribution). 

The capture rate isn’t quite 95%, but it’s fairly close. The agreement between the stated 
confidence level and actual capture rate will improve by increasing the sample size or by 
sampling from a population that is closer to Normal. 


Means 
Exponential 


t 


pu 0.5 


n 50 
Intervals 100 


Sample 


Conf level 95 


Recalculate 


Intervals containing py 
96/100 = 96.0% 
Running Total 
943/1000 = 94.3% 


Sort 


Reset 


CONDITIONS FOR CONSTRUCTING A CONFIDENCE INTERVAL 
ABOUT A MEAN 


e Random: The data come from a random sample from the population of interest. 
© 10%: When sampling without replacement, n < 0.10N. 


Normal/Large Sample: The population has a Normal distribution or the sample size is 
large (n = 30). If the population distribution has unknown shape and n < 30, use a graph 
of the sample data to assess the Normality of the population. Do not use t procedures if 
the graph shows strong skewness or outliers. 


Here are some examples of checking the Normal/Large Sample condition. 


EXAMPLE |GPAs, wood, and SATs (>) 
Checking the Normal/Large Sample condition 


Ken Cavanagh/Alamy 


PROBLEM: Determine if the Normal/Large Sample condition is met in each of the 
following settings. 


a. To estimate the average GPA of students at your school, you randomly select 50 students. 
Here is a histogram of their GPAs: 


10) 


ox 


Frequency 


rm 


0) 
2.0 2.5 3.0 3.5 40 
GPA 


b. How much force does it take to pull wood apart? The stemplot shows the force (in 
pounds) required to pull a piece of Douglas fir apart for each of 20 randomly selected 
pieces. 


c. Suppose you want to estimate the mean SAT Math score at a large high school. Here is a 
boxplot of the SAT Math scores for a random sample of 20 students at the school: 


ee : Key: 31/3 = 313 pounds of 
force required to pull a 

25 piece of Douglas fir apart 

26/5 

27 

28/7 

29 

30/)259 

31/399 

32|033677 

33|0236 


400) S00 600 700 S00 
SAT Math score 


SOLUTION: 


a. Yes; the sample size is large (50=30)450 = 30). 


In addition, it is also plausible that the sample came from a Normal population 


because the histogram doesn’t have strong skewness or outliers. 


b. No; the ee 1s strongly skewed to the le left with possible low outliers and n= =20<30. 


n = 20 < 


It is not plausible that this sample came from a Normal population because of the 
strong skewness and outliers in the stemplot. 


c. Yes; even though io 0) — 20 < 30, the boxplot 1s only moderately skewed to the 
right and there are no outliers. 


It is plausible that this sample came from a Normal population because the boxplot 


doesn’t show strong skewness or outliers. 


FOR PRACTICE, TRY EXERCISE 63 


In the preceding example, we used a histogram, a stemplot, and a boxplot to address the 
Normal/Large Sample condition. You can also use dotplots and Normal probability plots to 
assess Normality. Each of these graphs has strengths and weaknesses, so we recommend 
following the advice of your teacher when choosing a graph to use. 


In some cases, it is challenging to determine if the skewness in a graph should be 


considered “strong.” One way to judge skewness is to compare the distance from the maximum 
to the median and from the median to the minimum. Let’s take a closer look at the stemplot 
from part (b) and the boxplot from part (c) in the preceding example. 


e Inthe stemplot that records the force required for pulling a piece of wood apart, maximum — 
median = 336 — 319.5 = 16.5 and median — minimum = 319.5 — 230 = 89.5. The half of the 
stemplot with smaller values is more than 5 times as long as the half of the stemplot with 
larger values. 


e Inthe SAT Math example, maximum - median * 775 — 525 = 250 and median — minimum ~ 
525 — 375 = 150. The right half of the boxplot (everything greater than the median, including 
the whisker) is less than twice as long as the left half. 


400 500 600 700 SOO 
SAT Math score 


The stemplot is quite a bit more skewed than the boxplot. Unfortunately, there is no 
accepted rule of thumb for identifying strong skewness. For that reason, we have chosen the 
data sets in examples and exercises to avoid borderline cases. 


AP® EXAM TIP 


If a question on the AP® Statistics exam asks you to construct and interpret a 


confidence interval, all the conditions should be met. However, you are still required 
to state the conditions and show evidence that they are met—including a graph if 
the sample size is small and the data are provided. 


Constructing a Confidence Interval for p 


When the conditions are met, a C% confidence interval for the unknown mean rp is 


Sa 
= +t*— 


x +t*sxn vn 


Because we don’t know the value of the population standard deviation o, we replace it with the 


sample standard deviation s,. The value sxnv" is called the standard error of the sample 
meanx =, or just the standard error of the mean: 


$x 


| 
SEx =sxn vn 


Like the standard deviation of x £, the standard error of x © describes how much the sample 
mean x = typically varies from the population mean p in repeated SRSs of size n. 


As with proportions, some books refer to the standard deviation of the sampling distribution of x — 
£ as the standard error and what we call the standard error of the mean as the estimated standard 
error. The standard error of the mean is often abbreviated SEM. 


The following example illustrates how to construct and interpret a confidence interval for a 
population mean. By now, you should recognize the four-step process. 


EXAMPLE |More books (>) f. 
4 STEP 
A one-sample t interval for p 


Niloo/Shutterstock.com 


PROBLEM: In the previous section, you learned that about 73% of American adults claim 
to have read a book in the previous 12 months. This was based on a 2016 Pew Research 
Center study that interviewed a random sample of 1520 American adults. The same study 
also reported that the average number of books read by the members of the sample 


(including those who reported reading 0 books) was x =12% = 12 books with a standard 
deviation of s, = 18 books.22 


a. Construct and interpret a 95% confidence interval for the mean number of books read by 
all Americans in the previous 12 months. 


b. In 2011, Pew reported that American adults read an average of 14 books in the previous 
12 months. Does your interval from part (a) provide convincing evidence that the 2016 
mean is different than 14 books? Explain your answer. 


SOLUTION: 


a. STATE: 95% CI for u = the true mean number of books read by all American adults 
in the previous 12 months. 


Make sure to do the four-step process! 


PLAN: One-sample t interval for tt 
© Random: Random sample of 1520 American adults / 
© 10%: 1520 ts less than 10% of all American adults. 4 


e Normal/Large Sample: The sample size 1s large (n=1520>30).\" = 1520 > 30). / 


DO: Using Table B: df = 1000 and t* = 1.962 
Using technology: df = 1519 and t* = 1.962 


12 + 1.962 —S— = 12+ 0.91 = (11.09, 12.91) 
12+1.962181520=12+0.91=(11.09, 12.91) V1520 


S. 
Et+t*— 


x +t*Sxn Vit 


CONCLUDE: We are 95% confident that the interval from 11.09 books to 12.91 books 
captures p= the true mean number of books read by all American adults in the previous 
12 months. 


b. Yes; because 14 is not in the confidence interval, 14 is not a plausible value for the 
mean number of books read by all American adults in the previous 12 months. 


Make sure your conclusion is in context, includes units (books), and is about a 


population mean. 
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In this example, the sample standard deviation (s, = 18) is greater than the sample mean 


(x =12@ = 12), If the population were Normally distributed, we would expect the minimum to 
be about 2 or 3 standard deviations below the mean. However, because the minimum of 0 
books is not even 1 standard deviation below the mean, we know the population distribution is 
skewed to the right. Furthermore, Pew reported that the median number of books read by the 
members of the sample was 4 books. This is quite a bit less than the mean, again suggesting 
that the population distribution is skewed to the right. Of course, because of the large sample 
size, we don’t need to worry about the shape of the population distribution when checking the 
Normal/Large Sample condition. 


Here is another example, this time with the actual data values and a smaller sample size. 


fi 


4 STEP 
Constructing a confidence interval for py 


EXAMPLE |Video screen tension (>) 


Pgiam/Getty Images 


PROBLEM: A manufacturer of high-resolution video terminals must control the tension on 
the mesh of fine wires that lies behind the surface of the viewing screen. Too much tension 
will tear the mesh, and too little will allow wrinkles. The tension is measured by an electrical 
device with output readings in millivolts (mV). Some variation is inherent in the production 
process. Here are the tension readings from a random sample of 20 screens from a single 
day’s production: 


269.5 297.0 269.6 283.3 304.8 280.4 233.5 257.4 317.5 327.4 
264.7 307.7 310.0 343.3 e2o.8 342.6 338.8 340.1 374.6 330.1 


Construct and interpret a 90% confidence interval for the mean tension p of all the screens 
produced on this day. 
SOLUTION: 


STATE: 90% CI for u = the true mean tension of all the video terminals produced this 
day. 


Remember to do the four-step process! 


PLAN: One-sample t interval for UL 
e Random: Random sample of 20 screens produced that day / 
© 10%: Assume that 20 is less than 10% of all video terminals produced that day. / 


e Normal/Large Sample: The dotplot does not show strong skewness OT outliers. / 


Screen tension (mv) 


Because there is no strong skewness or outliers in the sample, it is plausible that the 


population distribution of screen tension is Normal. 


DO: x7=306.32 mVandsx=36.21mVe = 306.32 mV and s, = 36.21 mV 
With df=19, t*=1.729306.32+1.72936.2120=306.32+14.00=(292.32, 320.32) 
With df = 19, t* = 1.729 
306.32 + 1.729=— 

V2 


= 306.32 + 14.00 = (292.32, 320.32) 


When raw data are provided, use your calculator to find the mean and standard 
deviation of the sample. 


a Sx 
xz + t* — 


— 


x +t*Sxn vn 


CONCLUDE: We are 90% confident that the interval from 292.32 MV to 320.32 mV 
captures = the true mean tension in the entire batch of video terminals produced that 


day. 


Make sure your conclusion is in context, includes units (mV), and is about a 


population mean. 
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| AP® EXAM TIP | 


It is not enough just to make a graph of the data on your calculator when assessing 
Normality. You must sketch the graph on your paper to receive credit. You don’t 
have to draw multiple graphs—any appropriate graph will do. 


As you probably guessed, your calculator will compute a one-sample t interval for a 
population mean from sample data or summary statistics. 


19. Technology Corner | CONSTRUCTING A CONFIDENCE 
INTERVAL FOR A POPULATION MEAN |») 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 


Confidence intervals for a population mean using t distributions can be constructed on the 
TI-83/84, avoiding the use of Table B. Here is a brief summary of the techniques when you 
have the actual data values and when you have only numerical summaries: 


1. Using summary statistics (see the “More books” example, page 535) 
e Press |STAT|, arrow over to TESTS, and choose Tinterval.... 
e On the TInterval screen, adjust your settings as shown and choose Calculate. 


NORHAL FLOAT AUTO REAL RADIAN HP fl 


Tinterval 


Inpt:Data (11.094,12.906) 
x: 12 x=12 


Sx:18 Sx=18 
n: 1520 n=1520 
C-Level:@.95 

Calculate 


2. Using raw data (see the “Video screen tension” example, page 536) 


e Enter the 20 video screen tension readings in list L1. Proceed to the TInterval screen as 
in Step 1, but choose Data as the input method. Then adjust your settings as shown and 
calculate the interval. 


NORHAL FLOAT AUTO REAL RADIAN HP a] NORMAL FLOAT AUTO REAL RADIAN HP ol 


Tinterval TiInterval 
Inet: PEL) Stats (292.32,320.32) 
List:La x=306.32 
Frea:1 Sx=36. 20928349 
C-Level:0.9 n=20 
Calculate 


CHECK YOUR UNDERSTANDING 


\ 


y Biologists studying the healing of skin wounds measured the rate at which new cells 
closed a cut made in the skin of an anesthetized newt. Here are data from a random sample 


of 18 newts, measured in micrometers (millionths of a meter) per hour:24 


29 27 34 40 22 28 14 35 26 35 12 30 23 18 11 22 23 33 


Calculate and interpret a 95% confidence interval for the mean healing rate p. 


Choosing the Sample Size 


A wise user of statistics never plans data collection without thinking about the analysis at the 
same time. You can arrange to have both high confidence and a small margin of error by taking 
enough observations. When the population standard deviation o is unknown and conditions are 
met, the C% confidence interval for p is 
Se 
= +¢*— 
x itt*sxx vx 
where ¢* is the critical value for confidence level C and degrees of freedom df = n - 1. The 
margin of error (ME) of the confidence interval is 


ME = t* —~ 
ME=t*sxx va 


To determine the sample size for a desired margin of error, it makes sense to set the 
expression for ME less than or equal to the specified value and solve the inequality for n. There 
are two problems with this approach: 


.. We don’t know the sample standard deviation s, because we haven’t produced the data yet. 


’. The critical value t* depends on the sample size n that we choose. 
The second problem is more serious. To get the correct value of t*, we need to know the 
sample size. But that’s what we’re trying to find! There is no easy solution to this problem. 
We recommend that you come up with a reasonable estimate for the population standard 
deviation o from a similar study that was done in the past or from a small-scale pilot study. By 
pretending that o is known, we can use the one-sample z interval for p: 


Using the appropriate standard Normal critical value z* for confidence level C, we can solve 
for n using 


Here is a summary of this strategy. 


SAMPLE SIZE FOR A DESIRED MARGIN OF ERROR WHEN 
ESTIMATING p 


To determine the sample size n that will yield a C% confidence interval for a population 
mean with a specified margin of error ME: 


e Get a reasonable value for the population standard deviation o from an earlier or pilot 
study. 


e Find the critical value z* from a standard Normal curve for confidence level C. 


e Set the expression for the margin of error to be less than or equal to ME and solve for n: 


o 


= < ME 


z*on<ME n 


The procedure is best illustrated with an example. 


EXAMPLE |How many monkeys? (>) 


Determining sample size from margin of error 


Richard Nowitz/Medical Images USA 


PROBLEM: Researchers would like to estimate the mean cholesterol level p of a particular 
variety of monkey that is often used in laboratory experiments. They would like their 
estimate to be within 1 milligram per deciliter (mg/dl) of the true value of pi at a 95% 
confidence level. A previous study involving this variety of monkey suggests that the 
standard deviation of cholesterol level is about 5 mg/dl. What is the minimum number of 
monkeys the researchers will need to test to get a satisfactory estimate? 


SOLUTION: 
1.96 ey 
1.96-5<1/n 
(1.96-5)? <n 
1.965ns1 96.04 < n 


For 95% confidence, z* = 1.96. Use 0 = 5 as our best guess for the standard 


deviation. Set the expression for the margin of error to be at most 1 and solve for n. 


1.965 a | 

1.96-5<1/n 

(1.96-5)? <n 
1.96° 5<1n 96.04 < Tl 


Multiply both sides by nv 


1.96 ey 

1.96-5<1/n 

(1.96-5)? <n 
(1.96° 5)2<n 96.04 <n 


Square both sides. 


1.96 61 
1.96-5<1/n 
(1.96-5)? <n 


Remember that your answer should be an integer and it must follow the inequality 


sign. 


The researchers would need at least 97 monkeys to estimate the cholesterol levels to their 
satisfaction. 
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Taking observations costs time and money. The required sample size may be impossibly 
expensive. Notice that it is the size of the sample that determines the margin of error. The size 
of the population does not influence the sample size we need. This is true as long as the 
population is much larger than the sample. 


@ Although we assumed a value for the population standard deviation o to calculate 


the required sample size, you should not assume a value for o when calculating a 
confidence interval for p. When calculating a confidence interval for p, the sample size is 
known so there is no need to use an assumed value of o. 


There are other methods of determining sample size that do not require us to use a known 
value of the population standard deviation o. These methods are beyond the scope of this text. 
Our advice: consult with a statistician when planning your study! 


CHECK YOUR UNDERSTANDING 


A 


”, Administrators at your school want to estimate how much time students spend on 
homework, on average, during a typical week. They want to estimate p at the 90% 
confidence level with a margin of error of at most 30 minutes. A pilot study indicated that 
the standard deviation of time spent on homework per week is about 154 minutes. How 
many students need to be surveyed to meet the administrators’ goal? 


Section 8.3 Summary 


Confidence intervals for the mean p of a population are based on the sample mean x @. If 
we know o (which is very rare), we use a z* critical value and the standard Normal 
distribution to calculate a one-sample z interval for p. 


In practice, we usually don’t know o. Replacing the standard deviation of the sampling 
distribution of x (ox =o/n)® (7 = 7/V™) by the standard error of x (SEx =sx/n) 
(SE; = s/n) requires use of the ¢ distribution with n — 1 degrees of freedom (df) rather 

than the standard Normal distribution when calculating a confidence interval for a population 


mean. 

The conditions for constructing a confidence interval for a population mean are 

= Random: The data come from a random sample from the population of interest. 
© 10%: When sampling without replacement, n < 0.10N. 


« Normal/Large Sample: The population has a Normal distribution or the sample size is 
large (n = 30). If the population distribution has unknown shape and n < 30, use a graph of 
the sample data to assess the Normality of the population. Do not use t procedures if the 
graph shows strong skewness or outliers. 


When conditions are met, a C% confidence interval for the mean p is given by the one- 
sample t interval: 


ttt 
x -t*sxn Vv 


The critical value t* is chosen so that the t curve with n — 1 degrees of freedom has C% of 
the area between —¢t* and t*. Use Table B or technology to calculate t*. 


Follow the four-step process—State, Plan, Do, Conclude—whenever you are asked 
“G8 to construct and interpret a confidence interval for a population mean. Remember: 
inference for proportions uses z; inference for means uses t. 


The sample size needed to obtain a confidence interval with approximate margin of error 
ME for a population mean involves solving 


Co 


= < ME 


pa 


z*on<ME Vv! 


for n, where the standard deviation o is a reasonable value from a previous or pilot study, and 
z* is the critical value for the level of confidence we want. 


8.3 Technology Corners | @ 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 


18. 
19. 
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Section 8.3 Exercises 


61. 


62. 


63. 


Pg 530 |») Critical values What critical value t* from Table B should be used for a 


confidence interval for the population mean in each of the following situations? 
a. A 95% confidence interval based on n = 10 randomly selected observations 
b. A 99% confidence interval from an SRS of 20 observations 


c. A 90% confidence interval based on a random sample of 77 individuals 

Critical values What critical value t* from Table B should be used for a confidence 
interval for the population mean in each of the following situations? 

a. A 90% confidence interval based on n = 12 randomly selected observations 


b. A 95% confidence interval from an SRS of 30 observations 


c. A 99% confidence interval based on a random sample of size 58 


Pg 533 |p») Weeds among the corn Velvetleaf is a particularly annoying weed in 


cornfields. It produces lots of seeds, and the seeds wait in the soil for years until 
conditions are right for sprouting. How many seeds do velvetleaf plants produce? The 
histogram shows the counts from a random sample of 28 plants that came up in a cornfield 
when no herbicide was used.22 Determine if the Normal/Large Sample condition is met in 
this context. 


Frequency 


Number of seeds (thousands) 


64. Judy is interested in the reading level of a medical journal. She records the length of a 


random sample of 100 words. The histogram displays the distribution of word length for 
her sample. Determine if the Normal/Large Sample condition is met in this context. 


Frequency 


2 4 6 & LO 12 


Word length 


65. Check them all Determine if the conditions are met for constructing a confidence interval 
for the population mean in each of the following settings. 


a. How much time do students at your school spend on the Internet? You collect data from 
the 32 members of your AP® Statistics class and calculate the mean amount of time 
that these students spent on the Internet yesterday. 


b. Is the real-estate market heating up? To estimate the mean sales price, a realtor in a 
large city randomly selected 100 home sales from the previous 6 months in her city. 
These sales prices are displayed in the boxplot. 
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66. Check them all Determine if the conditions are met for constructing a confidence interval 
for the population mean in each of the following settings. 


a. We want to estimate the average age at which U.S. presidents have died. So we obtain a 
list of all U.S. presidents who have died and their ages at death. 


b. Do teens text more than they call? To find out, an AP® Statistics class at a large high 
school collected data on the number of text messages and calls sent or received by each 
of 25 randomly selected students. The boxplot displays the difference (Texts — Calls) 
for each student. 


oe 
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Difference (Texts — Calls) 


67. 


68. 


69. 


70. 


71. 


72. 


Blood pressure A medical study finds that x =114.9% = 114.9 and s, = 9.3 for the seated 


systolic blood pressure of the 27 randomly selected adults. What is the standard error of 
the mean? Interpret this value in context. 


Travel time to work A study of commuting times reports the travel times to work of a 
random sample of 20 employed adults in New York State. The mean is x =31.25 
& = 31.25 minutes and the standard deviation is s, = 21.88 minutes. What is the standard 


error of the mean? Interpret this value in context. 


Pg 535 |») Bone loss by nursing mothers Breast-feeding mothers secrete calcium into 


their milk. Some of the calcium may come from their bones, so mothers may lose bone 
mineral. Researchers measured the percent change in bone mineral content (BMC) of the 
spines of 47 randomly selected mothers during three months of breast-feeding.2° The 
mean change in BMC was —3.587% and the standard deviation was 2.506%. 


a. Construct and interpret a 99% confidence interval to estimate the mean percent change 
in BMC in the population of breast-feeding mothers. 


b. Based on your interval from part (a), do these data give convincing evidence that 
nursing mothers lose bone mineral, on average? Explain your answer. 


Reading scores in Atlanta The Trial Urban District Assessment (TUDA) is a 
government-sponsored study of student achievement in large urban school districts. 
TUDA gives a reading test scored from 0 to 500. A score of 243 is a “basic” reading level 
and a score of 281 is “proficient.” Scores for a random sample of 1470 eighth-graders in 
Atlanta had a mean of 240 with standard deviation of 42.17.22 


a. Construct and interpret a 99% confidence interval for the mean reading test score of all 
Atlanta eighth-graders. 


b. Based on your interval from part (a), is there convincing evidence that the mean reading 
test score for all Atlanta eighth-graders is less than the basic level? Explain your 
answer. 


America’s favorite cookie Ann and Tori wanted to estimate the average weight of an 
Oreo cookie to determine if it was less than advertised (34 grams for 3 cookies). They 
selected a random sample of 36 cookies and found the weight of each cookie (in grams). 
The mean weight was x =11.3921% = 11.3921 grams with a standard deviation of s, = 


0.0817 grams.28 Construct and interpret a 90% confidence interval for the true mean 
weight of an Oreo cookie. 


Fruit fly thorax lengths Fruit flies are used frequently in genetic research because of 
their quick reproductive cycle. The length of the thorax (in millimeters) was measured for 
each fly in a random sample of 49 male fruit flies. The mean length was x =0.8004 mm , 
< = 0.8004 mm , with a standard deviation of s, = 0.0782 mm.?2 Construct and interpret a 


90% confidence interval for the true mean thorax length of a male fruit fly. 


7os 


74. 


7m 


76. 


77. 


78. 


Pg 536 |») Pepperoni pizza Melissa and Madeline love pepperoni pizza, but 
sometimes they are disappointed with the small number of pepperonis on their pizza. To 
investigate, they went to their favorite pizza restaurant at 10 random times during the 
week and ordered a large pepperoni pizza.22 Here are the number of pepperonis on each 
pizza: 


47 36 25 37 46 36 49 oe 32 34 
Construct and interpret a 95% confidence interval for the true mean number of pepperonis 
on a large pizza at this restaurant. 


Catching goldfish for school Carly and Maysem plan to be preschool teachers after they 
graduate from college. To prepare for snack time, they want to know the mean number of 
goldfish crackers in a bag of original-flavored goldfish. To estimate this value, they 
randomly selected 12 bags of original-flavored goldfish and counted the number of 
crackers in each bag.24 Here are their data: 


317 = 330 325 323 332 337 324 342 330 349 335 333 


Construct and interpret a 95% confidence interval for the true mean number of crackers in 
a bag of original- flavored goldfish. 


A plethora of pepperoni? Refer to Exercise 73. 


a. Explain why it was necessary to inspect a graph of the sample data when checking the 
Normal/Large Sample condition. 


b. According to the manager of the restaurant, there should be an average of 40 


pepperonis on a large pizza. Based on the interval, is there convincing evidence that the 
average number of pepperonis is less than 40? Explain your answer. 


A school of fish Refer to Exercise 74. 


a. Explain why it was necessary to inspect a graph of the sample data when checking the 
Normal/Large Sample condition. 


b. According to the packaging, there are supposed to be 330 goldfish in each bag of 


crackers. Based on the interval, is there convincing evidence that the average number of 
goldfish is less than 330? Explain your answer. 


Pg 539 |») Estimating BMI The body mass index (BMI) of all American young 


women is believed to follow a Normal distribution with a standard deviation of about 7.5. 
How large a sample would be needed to estimate the mean BMI p in this population to 
within +1 with 99% confidence? 


The SAT again High school students who take the SAT Math exam a second time 
generally score higher than on their first try. Past data suggest that the score increase has a 
standard deviation of about 50 points. How large a sample of high school students would 


be needed to estimate the mean change in SAT score to within 2 points with 95% 
confidence? 


79. Willows in Yellowstone Writers in some fields summarize data by giving x = and its 
standard error rather than x £ and s,. Biologists studying willow plants in Yellowstone 


National Park reported their results in a table with columns labeled x +SE®% + SE The 
table entry for the heights of willow plants (in centimeters) in one region of the park was 
61.55 + 19.03.22 The researchers measured a total of 23 plants. 


a. Find the sample standard deviation s, for these measurements. 


b. A hasty reader believes that the interval given in the table is a 95% confidence interval 
for the mean height of willow plants in this region of the park. Find the actual 
confidence level for the given interval. 


80. Blink When two lights close together blink alternately, we “see” one light moving back 
and forth if the time between blinks is short. What is the longest interval of time between 
blinks that preserves the illusion of motion? Ask subjects to turn a knob that slows the 
blinking until they “see” two lights rather than one light moving. A report gives the results 


in the form “mean plus or minus the standard error of the mean.”22 Data for 12 subjects 


are summarized as 251 + 45 (in milliseconds). 


a. Find the sample standard deviation s, for these measurements. 


b. A hasty reader believes that the interval given in the report is a 95% confidence interval 
for the population mean. Find the actual confidence level for the given interval. 


Multiple Choice Select the best answer for Exercises 81-84. 


81. One reason for using a ¢t distribution instead of the standard Normal distribution to find 
critical values when calculating a level C confidence interval for a population mean is that 


a. z can be used only for large samples. 

b. z requires that you know the population standard deviation o. 

c. z requires that you can regard your data as an SRS from the population. 
d. z requires that the sample size is less than 10% of the population size. 


e. az critical value will lead to a wider interval than a t critical value. 


82. You have an SRS of 23 observations from a large population. The distribution of sample 
values is roughly symmetric with no outliers. What critical value would you use to obtain 
a 98% confidence interval for the mean of the population? 


a. 2.177 
b. 2.183 


c. 2.326 
d. 2.500 
e. 2.508 


83. A quality control inspector will measure the salt content (in milligrams) in a random 
sample of bags of potato chips from an hour of production. Which of the following would 
result in the smallest margin of error in estimating the mean salt content p/? 


a. 90% confidence; n = 25 
b. 90% confidence; n = 50 
c. 95% confidence; n = 25 
d. 95% confidence; n = 50 


e. n= 100 at any confidence level 


84. Scientists collect data on the blood cholesterol levels (milligrams per deciliter of blood) of 
a random sample of 24 laboratory rats. A 95% confidence interval for the mean blood 
cholesterol level p is 80.2 to 89.8. Which of the following would cause the most worry 
about the validity of this interval? 


a. There is a clear outlier in the data. 

b. A stemplot of the data shows a mild right skew. 

c. You do not know the population standard deviation o. 
d. The population distribution is not exactly Normal. 


e. None of these are a problem when using a t interval. 
Recycle and Review 


85. Watching TV (6.1, 7.3) Choose a young person (aged 19 to 25) at random and ask, “In 
the past seven days, how many days did you watch television?” Call the response X for 
short. Here is the probability distribution for X:24 


Days 0 1 2 3 4 5 6 7 
Probability 0.04 0.03 0.06 0.08 0.09 0.08 0.05 ee? 
a. What is the probability that X = 7? Justify your answer. 


b. The mean of the random variable X is py = 5.44 and the standard deviation is oy = 


2.14. Interpret these values. 


c. Suppose that you asked 100 randomly selected young people (aged 19 to 25) to respond 
to the question and found that the mean x = of their responses was 4.96. Would this 


result surprise you? Justify your answer. 


86. Price cuts (4.2) Stores advertise price reductions to attract customers. What type of price 
cut is most attractive? Experiments with more than one factor allow insight into 
interactions between the factors. A study of the attractiveness of advertised price discounts 
had two factors: percent of all foods on sale (25%, 50%, 75%, or 100%) and whether the 
discount was stated precisely (e.g., as in “60% off”) or as a range (as in “40% to 70% 
off”). Subjects rated the attractiveness of the sale on a scale of 1 to 7. 


a. List the treatments for this experiment, assuming researchers will use all combinations 
of the two factors. 


b. Describe how you would randomly assign 200 volunteer subjects to treatments. 
c. Explain the purpose of the random assignment in part (b). 


d. The figure shows the mean ratings for the eight treatments formed from the two 
factors.22 Based on these results, write a careful description of how percent on sale and 
precise discount versus range of discounts influence the attractiveness of a sale. 
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MM _. FRAPPY! FREE RESPONSE AP® 
PROBLEM, YAY! 


The following problem is modeled after actual AP® Statistics exam free response 
questions. Your task is to generate a complete, concise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you use, because you will be 
scored on the correctness of your methods as well as on the accuracy and completeness of 
your results and explanations. 


Members at a popular fitness club currently pay a $40 per month membership fee. The 
owner of the club wants to raise the fee to $50 but is concerned that some members will 
leave the gym if the fee increases. To investigate, the owner plans to survey a random 
sample of the club members and construct a 95% confidence interval for the proportion of 
all members who would quit if the fee was raised to $50. 


Explain the meaning of “95% confidence” in the context of the study. 


After the owner conducted the survey, he calculated the confidence interval to be 
0.18 + 0.075. Interpret this interval in the context of the study. 


According to the club’s accountant, the fee increase will be worthwhile if fewer 
than 20% of the members quit. According to the interval from part (b), can the 
owner be confident that the fee increase will be worthwhile? Explain. 


One of the conditions for calculating the confidence interval in part (b) is that 
npA>10"P = 10 and n(1-p)=10.(1 — p) 2 10. Explain why it is necessary to 
check this condition. 


After you finish, you can view two example solutions on the book’s website 
(highschool.bfwpub.com/tps6e). Determine whether you think each solution is 
“complete,” “substantial,” “developing,” or “minimal.” If the solution is not 
complete, what improvements would you suggest to the student who wrote it? 
Finally, your teacher will provide you with a scoring rubric. Score your response 
and note what, if anything, you would do differently to improve your own score. 


Chapter 8 Review 


Section 8.1: Confidence Intervals: The Basics 


In this section, you learned that a point estimate is the single best guess for the value of a 
population parameter. You also learned that a confidence interval provides an interval of 
plausible values for a parameter. To interpret a confidence interval, say, “We are C% confident 
that the interval from ___ to__ captures the [parameter in context],” where C is the 
confidence level of the interval. 


The confidence level C describes the percentage of confidence intervals that we expect to 
capture the value of the parameter in repeated sampling. To interpret a C% confidence level, 
say, “If we took many samples and used them to construct C% confidence intervals, about C% 
of those intervals would capture the [parameter in context].” 


Confidence intervals are formed by including a margin of error on either side of the point 
estimate. The size of the margin of error is determined by several factors, including the 
confidence level C and the sample size n. Increasing the sample size n makes the standard 
deviation of our estimate smaller, decreasing the margin of error. Increasing the confidence 
level C makes the margin of error larger, to ensure that the capture rate of the interval increases 
to C%. Remember that the margin of error only accounts for sampling variability—it does not 
account for any bias in the data collection process. 


Section 8.2: Estimating a Population Proportion 


In this section, you learned how to construct and interpret confidence intervals for a population 
proportion. Several important conditions must be met for this type of confidence interval to be 
valid. First, the data used to calculate the interval must come from a random sample from the 
population of interest (the Random condition). When the sample is taken without replacement 
from the population, the sample size should be less than 10% of the population size (the 10% 
condition). Finally, the observed number of successes np”? and observed number of failures 
n(1-pryr( ~P) must both be at least 10 (the Large Counts condition). 


The formula for calculating a confidence interval for a population proportion is 


i ee Je ee 
pxrz \ —_—— 
p\+z*p\(1-p/)n . 


where p/P is the sample proportion, z* is the critical value, and n is the sample size. The value 
of z* is based on the confidence level C. To find z*, use Table A or technology to determine the 
values of z* and —z* that capture the middle C% of the standard Normal distribution. 

The four-step process (State, Plan, Do, Conclude) is perfectly suited for problems that ask 
you to construct and interpret a confidence interval. You should state the parameter you are 
estimating and the confidence level, plan your work by naming the type of interval you will use 


and checking the appropriate conditions, do the calculations, and make a conclusion in the 
context of the problem. You can use technology for the Do step, but make sure that you 
identify the procedure you are using and type in the values correctly. 

Finally, an important part of planning a study is determining the size of the sample to be 
selected. The necessary sample size is based on the confidence level, the proportion of 
successes, and the desired margin of error. To calculate the minimum sample size, solve the 
following inequality for n, where p/P is a guessed value for the sample proportion: 

p(1—p) 

n 


mr 
z*p\(1-p“)n<ME V 


< ME 


If you do not have an approximate value of pip from a previous study or a pilot study, use 
p\=0.5P = 9.5 to determine the sample size that will yield a value less than or equal to the 
desired margin of error. 


Section 8.3: Estimating a Population Mean 
In this section, you learned how to construct and interpret confidence intervals for a population 
mean. The Random and 10% conditions are the same as those for proportions. There’s one new 
condition for means: the population must be Normally distributed or the sample size must be at 
least 30 (the Normal/Large Sample condition). If the population distribution’s shape is 
unknown and the sample size is less than 30, graph the sample data and check for strong 
skewness or outliers. If there is no strong skewness or outliers, it is reasonable to assume that 
the population distribution is approximately Normal. 

The formula for calculating a confidence interval for a population mean is 

Ete 


x +t*sxn Jn 


where x = is the sample mean, ¢* is the critical value, s, is the sample standard deviation, and 


nis the sample size. We use at critical value instead of a z critical value when the population 
standard deviation is unknown—which is almost always the case. The value of t* is based on 
the confidence level C and the degrees of freedom (df = n — 1). To find t*, use Table B or 
technology to determine the values of t* and —t* that capture the middle C% of the appropriate 
t distribution. 

You also learned how to estimate the sample size when planning a study, as in Section 8.2. 
To calculate the minimum sample size, solve the following inequality for n, where o is a 
guessed value for the population standard deviation: 


Confidence interval for p Confidence interval for py 
Name (TI-83/84) One-sample z interval for p (1- One-sample t interval for y 
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random sample from the 

population of interest. 

o 10%: When sampling without 
replacement, n < 0.10N. 


Large Counts: Both np"? and 
n(1-p’)"(1 — P) are at least 10. 
That is, the number of Successes 
and the number of failures in the 
sample are both at least 10. 


(TInterval) 
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e Random: The data come from a e Random: The data come froma 


random sample from the 
population of interest. 


o 10%: When sampling without 
replacement, n < 0.10N. 


Normal/Large Sample: The 
population has a Normal 
distribution or the sample size is 
large (n = 30). If the population 
distribution has unknown shape 
and n < 30, use a graph of the 
sample data to assess the 
Normality of the population. Do 
not use t procedures if the graph 
shows strong skewness or 


What Did You Learn? 


Learning Target 


Identify an appropriate point 
estimator and calculate the value 
of a point estimate. 


Interpret a confidence interval in 
context. 


Determine the point estimate and 
margin of error from a confidence 
interval. 


Use a confidence interval to make 
a decision about the value of a 
parameter. 


Interpret a confidence level in 
context. 


Describe how the sample size and 
confidence level affect the margin 
of error. 


outliers. 
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for constructing a confidence 
interval for a population 
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calculating a C% confidence 
interval for a population proportion 
using a table or technology. 


Construct and interpret a 
confidence interval for a 
population proportion. 


Determine the sample size 
required to obtain a C% 
confidence interval fora 
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specified margin of error. 


Determine the critical value for 
calculating a C% confidence 
interval for a population mean 
using a table or technology. 
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confidence interval fora 
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Chapter 8 Review Exercises 


These exercises are designed to help you review the important ideas and methods of the 
chapter. 


R8.1 |) It’s critical Find the appropriate critical value for constructing a confidence interval 
in each of the following settings. 


a. Estimating a population proportion p at a 94% confidence level based on an SRS of 
size 125 


b. Estimating a population mean p at a 99% confidence level based on an SRS of size 58 

R8.2 |») Batteries A company that produces AA batteries tests the lifetime of a random 
sample of 30 batteries using a special device designed to imitate real-world use. Based on 
the testing, the company makes the following statement: “Our AA batteries last an 
average of 430 to 470 minutes, and our confidence in that interval is 95%.”2® 


a. Determine the point estimate, margin of error, standard error, and sample standard 
deviation. 


b. A reporter translates the statistical announcement into “plain English” as follows: 
“95% of this company’s AA batteries last between 430 and 470 minutes.” Comment 
on this interpretation. 


c. Your friend, who has just started studying statistics, claims that there is a 95% 
probability that the mean lifetime will fall between 430 and 470 minutes. Do you 
agree? Explain your reasoning. 


d. Give a statistically correct interpretation of the confidence level that could be 
published in a newspaper report. 


R8.3 p) We love football! A Gallup poll conducted telephone interviews with a random 


sample of adults aged 18 and older. Data were obtained for 1000 people. Of these, 370 
said that football is their favorite sport to watch on television. 


a. Define the parameter p in this setting. 


b. What point estimator will you use to estimate p? What is the value of the point 
estimate? 


c. Do you believe that the value of the point estimate is equal to the value of p? Explain 
your answer. 


R8.4 ») Running red lights A random digit dialing telephone survey of 880 drivers asked, 


“Recalling the last ten traffic lights you drove through, how many of them were red when 
you entered the intersections?” Of the 880 respondents, 171 admitted that at least one 


light had been red.24 
a. Construct and interpret a 95% confidence interval for the population proportion. 


b. Nonresponse is a practical problem for this survey—only 21.6% of calls that reached 
a live person were completed. Another practical problem is that people may not give 
truthful answers. What is the likely direction of the bias: Do you think more or fewer 
than 171 of the 880 respondents really ran a red light? Why? Are these sources of bias 
included in the margin of error? 
R8.5 \_») Engine parts A random sample of 16 of the more than 200 auto engine crankshafts 


produced in one day was selected. Here are measurements (in millimeters) of a critical 
component on these crankshafts: 


224.120 224.001 224.017 223.982 223.989 223.961 223.960 224.089 
223.987 223.976 223.902 223.980 224.098 224057 223.913 223.999 


a. Construct and interpret a 95% confidence interval for the mean length of this 
component on all the crankshafts produced on that day. 


b. The mean length is supposed to be p = 224 mm but can drift away from this target 
during production. Does your interval from part (a) suggest that the mean has drifted 
from 224 mm? Explain your answer. 


R8.6 ») Do you go to church? The Gallup Poll plans to ask a random sample of adults 


whether they attended a religious service in the last 7 days. How large a sample would be 
required to obtain a margin of error of at most 0.01 in a 99% confidence interval for the 
population proportion who would say that they attended a religious service? 

R8.7 \») Good wood? A lab supply company sells pieces of Douglas fir 4 inches long and 


1.5 inches square for force experiments in science classes. From experience, the strength 
of these pieces of wood follows a distribution with standard deviation 3000 pounds. You 
want to estimate the mean load needed to pull apart these pieces of wood to within 1000 
pounds with 95% confidence. How large a sample is needed? 


R8.8 |») It’s about ME Explain how each of the following would affect the margin of error 
of a confidence interval, if all other things remained the same. 
a. Increasing the confidence level 
b. Quadrupling the sample size 

R8.9 |p) t time When constructing confidence intervals for a population mean, we almost 


always use critical values from a t distribution rather than the standard Normal 
distribution. 


a. When is it necessary to use a ¢ critical value rather than a z critical value when 
constructing a confidence interval for a population mean? 


b. For a particular level of confidence, explain what happens to the ¢ critical values as 


the degrees of freedom increase. 


Chapter 8 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


T8.1 The Gallup Poll interviews 1600 people. Of these, 18% say that they jog regularly. The 
news report adds: “The poll had a margin of error of plus or minus three percentage 
points at a 95% confidence level.” You can safely conclude that 


a. 


95% of all Gallup Poll samples like this one give answers within +3% of the true 
population value. 


. the percent of the population who jog is certain to be between 15% and 21%. 


95% of the population jog between 15% and 21% of the time. 


. we can be 95% confident that the sample proportion is captured by the confidence 


interval. 


if Gallup took many samples, 95% of them would find that 18% of the people in the 
sample jog. 


T8.2 The weights (in pounds) of three adult males are 160, 215, and 195. What is the 
standard error of the mean for these data? 


a. 190 
b. 

c 
d. 


e. 


27.84 
22.73 
16.07 
13.13 


T8.3 In preparing to construct a one-sample t interval for a population mean, suppose we are 
not sure if the population distribution is Normal. In which of the following 
circumstances would we not be safe constructing the interval based on an SRS of size 
14 from the population? 


a. 
b. 


Cc. 


d. 


e. 


A stemplot of the data is roughly bell-shaped. 
A histogram of the data shows slight skewness. 


A boxplot shows that the values above the median are much more variable than the 
values below the median. 


The sample standard deviation is large. 


The sample standard deviation is small. 


T8.4 Many television viewers express doubts about the validity of certain commercials. In 
an attempt to answer their critics, Timex Group USA wishes to estimate the true 


proportion p of all consumers who believe what is shown in Timex television 
commercials. What is the smallest number of consumers that Timex can survey to 
guarantee a margin of error of 0.05 or less at a 99% confidence level? 


a. 550 
b. 600 
c. 650 
d. 700 
e. 750 


T8.5 You want to compute a 90% confidence interval for the mean of a population with an 
unknown population standard deviation. The sample size is 30. What critical value 
should you use for this interval? 


a. 1.645 

b. 1.699 

c. 1.697 

d. 1.96 

e. 2.045 

T8.6 A radio talk show host with a large audience is interested in the proportion p of adults 

in his listening area who think the drinking age should be lowered to eighteen. To find 
this out, he poses the following question to his listeners: “Do you think that the 
drinking age should be reduced to eighteen in light of the fact that 18-year-olds are 
eligible for military service?” He asks listeners to go to his website and vote “Yes” if 
they agree the drinking age should be lowered and “No” if not. Of the 100 people who 
voted, 70 answered “Yes.” Which of the following conditions are violated? 

I. Random 

II. 10% 
Ill. Large Counts 

a. I only 

b. IL only 

c. I only 

d. I and II only 

e. I, Il, and III 

T8.7 A 90% confidence interval for the mean p of a population is computed from a random 


sample and is found to be 90 + 30. Which of the following could be the 95% 
confidence interval based on the same data? 


a. 90+21 


. 90 + 30 


90 + 39 


. 90 + 70 


Without knowing the sample size, any of the above answers could be the 95% 
confidence interval. 


T8.8 Suppose we want a 90% confidence interval for the average amount spent on books by 
freshmen in their first year at a major university. The interval is to have a margin of 
error of at most $2. Based on last year’s book sales, we estimate that the standard 
deviation of the amount spent will be close to $30. The number of observations 
required is closest to which of the following? 


a. 25 
b. 

C. 
d. 


e. 


30 

225 
609 
865 


T8.9 A telephone poll of an SRS of 1234 adults found that 62% are generally satisfied with 
their lives. The announced margin of error for the poll was 3%. Does the margin of 
error account for the fact that some adults do not have telephones? 


a. 


b. 


c 
d. 


e. 


Yes; the margin of error accounts for all sources of error in the poll. 


Yes; taking an SRS eliminates any possible bias in estimating the population 
proportion. 


Yes; the margin of error accounts for undercoverage but not nonresponse. 
No; the margin of error accounts for nonresponse but not undercoverage. 


No; the margin of error only accounts for sampling variability. 


T8.10 A Census Bureau report on the income of Americans says that with 90% confidence the 
median income of all U.S. households in a recent year was $57,005 with a margin of 
error of $742. Which of the following is the most appropriate conclusion? 


a. 90% of all households had incomes in the interval $57,005 + $742. 


. We can be sure that the median income for all households in the country lies in the 


interval $57,005 + $742. 


90% of the households in the sample interviewed by the Census Bureau had incomes 
in the interval $57,005 + $742. 


. The Census Bureau got the result $57,005 + $742 using a method that will capture 


the true median income 90% of the time when used repeatedly. 


90% of all possible samples of this same size would result in a sample median that 
falls within $742 of $57,005. 


Section Il: Free Response Show all your work. Indicate clearly the methods you use, 
because you will be graded on the correctness of your methods as well as on the accuracy and 
completeness of your results and explanations. 


T8.11 The U.S. Forest Service is considering additional restrictions on the number of vehicles 
allowed to enter Yellowstone National Park. To assess public reaction, the service asks 
a random sample of 150 visitors if they favor the proposal. Of these, 89 say “Yes.” 


a. Construct and interpret a 99% confidence interval for the proportion of all visitors to 
Yellowstone who favor the restrictions. 


b. Based on your work in part (a), is there convincing evidence that more than half of 
all visitors to Yellowstone National Park favor the proposal? Justify your answer. 


T8.12 Many people have asked the question, but few have been patient enough to collect the 
data. How many licks does it take to get to the center of a Tootsie Pop? After some 
intense research, a researcher revealed a 95% confidence interval for the mean number 
of licks to be 317.64 licks to 394.56 licks.28 


a. Interpret the confidence interval. 
b. Calculate the point estimate and margin of error used to construct the interval. 


c. Name two things the researcher could do to decrease the margin of error. Discuss a 
drawback of each. 


T8.13 A milk processor monitors the number of bacteria per milliliter in raw milk received at 
the factory. A random sample of 10 one-milliliter specimens of milk supplied by one 
producer gives the following data: 


5370 4890 5100 4500 5260 5150 4900 4760 4700 4870 


Construct and interpret a 90% confidence interval for the population mean p. 


Chapter 9 Testing a Claim 
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Chapter 9 Review Exercises 
Chapter 9 AP® Statistics Practice Test 


INTRODUCTION 


Confidence intervals are one of the two most common methods of statistical inference. You 
can use a confidence interval to estimate a population parameter, like the proportion of all U.S. 
adults who exercise regularly or the true mean amount of time that students at a large university 
spent on social media yesterday. 


What if we want to test a claim about a parameter? For instance, the U.S. Bureau of Labor 
Statistics claims that the national unemployment rate in March 2017 was 4.5%. A citizens’ 
group suspects that the actual rate was higher. The second common method of inference, called 
a significance test, allows us to weigh the evidence in favor of or against a particular claim. 


DEFINITION Significance test 


A significance test is a formal procedure for using observed data to decide 
between two competing claims (called hypotheses). The claims are usually 
statements about a parameter, like the population proportion p or the population 
mean Uw. 


A significance test is sometimes referred to as a test of significance, a hypothesis test, or a test 


of hypotheses. 


Here is an activity that illustrates the reasoning of a significance test. 


ACTIVITY I’m a great free-throw shooter! 


In this activity, you and your classmates will perform a simulation to test a claim about a 
population proportion. 

A basketball player claims to make 80% of the free throws that he attempts. That is, he 
claims p=0.80? = 9.80, where p is the true proportion of free throws he will make in the long 
run. We suspect that he is exaggerating and that p<0.80P < 9-80, 

Suppose the player shoots 50 free throws and makes 32 of them. His sample proportion 
of made shots is p\=32/50=0.64? = 32/50 = 0.64 This result gives some evidence that the 
player really makes less than 80% of his free throws in the long run. But do we have 
convincing evidence that p<0.80?P < 9-80? Or is it plausible that an 80% shooter will have a 
performance this poor by chance alone? You can use a simulation to find out. 


otnaydur/Shutterstock 


1. Using the pie chart provided by your teacher, label the 80% region “made shot” and the 
20% region “missed shot.” Straighten out one of the ends of a paper clip so that there is a 
loop on one side and a pointer on the other. On a flat surface, place a pencil through the 
loop, and put the tip of the pencil on the center of the pie chart. Then flick the paper clip 
and see where the pointed end lands: made shot or missed shot. 


2. Flick the paper clip a total of 50 times, and count the number of times that the pointed end 
lands in the “made shot” region. 


3. Compute the sample proportion p/? of made shots in your simulation from Step 2. Plot 
this value on the class dotplot drawn by your teacher. 


4. Repeat Steps 2 and 3 as needed to get at least 40 trials of the simulation for your class. 


5. Based on the class’s simulation results, how likely is it for an 80% shooter to make 64% 
or less when he shoots 50 free throws? 


6. Based on your answer to Question 5, does the observed pA=0.64P = 0.64 result give 
convincing evidence that the player is exaggerating? Or is it plausible that an 80% shooter 
can have a performance this poor by chance alone? 


In the activity, the shooter made only 32 of 50 free-throw attempts (p=0.64)(P = 0.64). 


There are two possible explanations for why he made less than 80% of his shots: 


(p = 0.80) and his bad performance happened by chance 


. The player’s claim is true (p=0.80) 
alone. 


p < 0.80) 


’. The player’s claim is false (p<0.80)! . That is, the population proportion is less than 


0.80 so the sample result is not an unlikely outcome. 

If explanation 1 is plausible, then we don’t have convincing evidence that the shooter is 
exaggerating—his poor performance could have occurred purely by chance. However, if it is 
unlikely for an 80% shooter to get a proportion of 0.64 or less in 50 attempts, then we can rule 
out explanation 1. 

We used software to simulate 400 sets of 50 shots, assuming that the player is really an 
80% shooter. Figure 9.1 shows a dotplot of the results. Each dot on the graph represents the 
sample proportion p/? of made shots in one set of 50 attempts. 


In 400 sets of 50 shots, 
there were only 3 sets 
when our shooter made 
less than or equal Lo 
the observed p = 0.64. 

a 
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FIGURE 9.1 Dotplot of the simulated sampling distribution of p? , the proportion of free throws 
made by an 80% shooter in a sample of 50 shots. 


sample proportion of made shots 


The simulation shows that it would be very unlikely for an 80% free-throw shooter to make 
32 or fewer out of 50 free throws (p<0.64)(P < 0.64) just by chance. This gives us convincing 
evidence that the player is less than an 80% shooter. 

Section 9.1 focuses on the underlying logic of significance tests. Once the foundation is 
laid, we consider the implications of using these tests to make decisions—about everything 
from free-throw shooting to the effectiveness of a new drug. In Section 9.2, we present the 
details of performing a test about a population proportion. Section 9.3 shows how to test a 
claim about a population mean. Along the way, we examine the connection between confidence 
intervals and significance tests. 


SECTION 9.1 Significance Tests: The Basics 


LEARNING TARGETS By the end of the section, you should be able to: 


State appropriate hypotheses for a significance test about a population 
parameter. 


Interpret a P-value in context. 


Make an appropriate conclusion for a significance test. 


Interpret a Type | error and a Type II error in context. Give a consequence of 
each error in a given setting. 


Ass noted in the definition, a significance test starts with a careful statement of the claims we 
want to compare. Let’s take a closer look at how to state these claims. 


Stating Hypotheses 


In the free-throw shooter activity, the player claims that his long-run proportion of made free 
throws is p=0.80P = 9-80. This is the claim we seek evidence against. We call it the null 
hypothesis, abbreviated HO. Usually, the null hypothesis is a statement of “no difference.” 
For the free-throw shooter, no difference from what he claimed gives HO:p=0.80/40 : p = 0.80, 


Remember: The null hypothesis is the dull hypothesis! 


The claim we hope or suspect to be true instead of the null hypothesis is called the 
alternative hypothesis. We abbreviate the alternative hypothesis as Ha/2a. In this case, we 


suspect the player might be exaggerating, so our alternative hypothesis is Ha: p<0.80 
H, : p< 0.80. 


DEFINITION Null hypothesis HO, Alternative hypothesis Ha/« 


The claim that we weigh evidence against in a significance test is called the null 
hypothesis (HO).(#o) - 


The claim that we are trying to find evidence for is the alternative hypothesis 
(Ha).(#fa) - 


Some people insist that all three possibilities—greater than, less than, and equal to—should be 


accounted for in the hypotheses. For the free-throw shooter example, because the alternative 
hypothesis is Ha: p<0.80,4a > p< 0.80, they would write the null hypothesis as H0:p20.80. 


Hy : p = 0.80. tn spite of the mathematical appeal of covering all three cases, we use only the 
value p=0.80P = 0.80 in our calculations. So we'll stick with HO:p=0.80./40 : pp = 0.80. 


In the free-throw shooter example, our hypotheses are 
HO: p=0.80/40 : p= 0.80 
Ha: p<0.804a : p < 0.80 


where p is the true proportion of free throws he will make in the long run. The alternative 


hypothesis is one-sided(p<0.80)(P < 0.80) because we suspect the player makes less than 80% 
of his free throws. If you suspect that the true value of a parameter may be either greater than 
or less than the null value, use a two-sided alternative hypothesis. 


DEFINITION One-sided and two-sided alternative hypothesis 


The alternative hypothesis is one-sided if it states that a parameter is greater 
than the null value or if it states that the parameter is /ess than the null value. 


The alternative hypothesis is two-sided if it states that the parameter is different 
from the null value (it could be either greater than or less than). 


The null hypothesis has the form HO:parameter=null value/4o : parameter = null value ~ 


one-sided alternative hypothesis has one of the forms Ha: parameter<null value 
H, : parameter < null value oy Ha: parameter>null value/a : parameter > null value A two- 


sided alternative hypothesis has the form Ha: parameter4null value/4« : parameter # null value 
. To determine the correct form of Ha/a, read the problem carefully. 


It is common to refer to a significance test with a one-sided alternative hypothesis as a one- 
sided test or one-tailed test and to a test with a two-sided alternative hypothesis as a two-sided 
test or two-tailed test. 


EXAMPLE | Juicy pineapples (>) 
Stating hypotheses 


PROBLEM: At the Hawaii Pineapple Company, managers are interested in the size of the 
pineapples grown in the company’s fields. Last year, the mean weight of the pineapples 
harvested from one large field was 31 ounces. A different irrigation system was installed in 
this field after the growing season. Managers wonder if this change will affect the mean 
weight of pineapples grown in the field this year. 


State appropriate hypotheses for performing a significance test. Be sure to define the 
parameter of interest. 


SOLUTION: 
Ho:p=3140 ‘p= 31 
Ha:p431 4a 7px 31 
where p = the true mean weight of all pineapples grown in the field this year. 


Because managers wonder if the mean weight of this year’s pineapples will differ from 


last year’s mean weight of 31 ounces, the alternative hypothesis is two-sided. 


FOR PRACTICE, TRY EXERCISE 1 


@ The hypotheses should express the belief or suspicion we have before we see the 
data. It is cheating to look at the data first and then frame the alternative hypothesis to fit what 
the data show. For example, the data for the pineapple study showed that x _=31.935¢ = 31.935 
ounces for a random sample of 50 pineapples grown in the field this year. You should not 
change the alternative hypothesis to Ha:p>314a >> 31 after looking at the data. 


AP® EXAM TIP 


Hypotheses always refer to a population, not to a sample. Be sure to state HOMo 
and Hae in terms of population parameters. It is never correct to write a 


hypothesis about a sample statistic, such as HO: p*=0.80/40 : P = 0.80 or Ha:x 431 
A, :% #31. 


CHECK YOUR UNDERSTANDING 


Sy 


y For each of the following settings, state appropriate hypotheses for performing a 
significance test. Be sure to define the parameter of interest. 


1. According to the National Sleep Foundation, 85% of teens are getting too little sleep on 
school nights. Jannie wonders whether this result holds in her large high school. She asks 
an SRS of 100 students at the school how much sleep they get on a typical night. In all, 75 
of the students are getting less than the recommended amount of sleep. 


2. As part of its marketing campaign for the 2010 census, the U.S. Census Bureau advertised 
“10 questions, 10 minutes—that’s all it takes.” On the census form itself, we read, “The 
U.S. Census Bureau estimates that, for the average household, this form will take about 10 
minutes to complete, including the time for reviewing the instructions and answers.” We 
suspect that the time it takes to complete the form may be longer than advertised. 


Interpreting P-values 


It may seem strange to you that we state a null hypothesis and then try to find evidence against 
it. Maybe it would help to think about how a criminal trial works in the United States. The 
defendant is “innocent until proven guilty.” That is, the null hypothesis is innocence and the 
prosecution must offer convincing evidence against this hypothesis and in favor of the 
alternative hypothesis: guilt. That’s exactly how significance tests work, although in statistics 
we deal with evidence provided by data and use a probability to say how strong the evidence is. 

In the free-throw shooter activity at the beginning of the chapter, a player who claimed to 
make 80% of his free throws made only p\=32/50=0.64P = 32/50 = 0.64 in a random sample 
of 50 free throws. This is evidence against the null hypothesis that p=0.80? = 9-89 and in favor 
of the alternative hypothesis p<0.80? < 9-80, But is the evidence convincing? To answer this 
question, we want to know how likely it is for an 80% shooter to make 64% or less by chance 
alone in a random sample of 50 attempts. This probability is called a P-value. 


DEFINITION P-value 


The P-value of a test is the probability of getting evidence for the alternative 
hypothesis Ha/« as strong or stronger than the observed evidence when the null 
hypothesis HO/2 is true. 


In 400 sets of 50 shots, 
there were only 3 sets 


when our shooter made 
less than or equal to 
the observed p = 0.64. 


3 * «e i s «6 
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p = sample proportion of made shots 


Small P-values give convincing evidence for Ha/4« because they say that the observed 
result is unlikely to occur when HO/“0 is true. Large P-values fail to give convincing evidence 
for Ha/fa because they say that the observed result is likely to occur by chance alone when HO 
Ao is true. 

We used simulation to estimate the P-value for our free-throw shooter: 3/400=0.0075 
3/400 = 0.0075 How do we interpret this P-value? Assuming that the player makes 80% of his 
free throws in the long run, there is about a 0.0075 probability of getting a sample proportion of 
0.64 or less just by chance in a set of 50 shots. 


We’ll show you how to calculate P-values later. For now, let’s focus on interpreting them. 


EXAMPLE | Healthy bones (>) 
Interpreting a P-value 


: Freshest tasti 
within 5 days after 


| Nutrition Facts | 
Serving Size 1 cup (240mL) 


servings Per Container about 8 me 


Calories 130 Calories from Fat 45 
en enna 


Cholesterol 20m 
Sodium 120mg 
Potassium 360mg 


ee VitaminA 10% « Vitamin C 0% 
Calcium 30% = Iron 0% 
Vitamin 0 275% 
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PROBLEM: Calcium is a vital nutrient for healthy bones and teeth. The National Institutes 
of Health (NIH) recommends a calcium intake of 1300 milligrams (mg) per day for 
teenagers. The NIH is concerned that teenagers aren’t getting enough calcium, on average. Is 
this true? Researchers decide to perform a test of 
Ho : wp = 1300 
HO: p=1300Ha: p1<1300 Ma : # < 1300 


where i is the true mean daily calcium intake in the population of teenagers. They ask a 
random sample of 20 teens to record their food and drink consumption for 1 day. The 
researchers then compute the calcium intake for each student. Data analysis reveals that 

x =1198% = 1198 mg and sx=4115x = 411 mg. Researchers performed a significance test 
and obtained a P-value of 0.1404. 


a. Explain what it would mean for the null hypothesis to be true in this setting. 


b. Interpret the P-value. 
SOLUTION: 


a. aa Ho:w=1300!to : pp = 1300 j, true, then the mean daily calcium intake in the 
population of teenagers 1s 1300 mg. 


b. Assuming that the mean daily calcium intake in the teen population is 1300 mg, there 
is a 0.1404 probability of. getting a sample mean of 1198 mg or less just by chance in a 
random sample of 20 teens. 


FOR PRACTICE, TRY EXERCISE 9 


Remember: A P-value measures the strength of evidence for the alternative hypothesis (and 
against the null hypothesis). In the preceding example, the sample mean was x _=1198% = 1198 
mg. This result gives some evidence for Ha: u<13004 a? # < 1300 because 1198<1300 
1198 < 1300. The P-value is the probability of getting evidence for Ha/« as strong or stronger 
than the observed result when HO/0 is true. We can write the P-value as a conditional 
probability. For the “Healthy bones” example, the P-value =P(x <1198| p=1300)=0.1404 
= P(% < 1198) « = 1300) = 0.1404 


When Ha/fe is two-sided (parameter~null value), (Parameter # nullvalue) , values of the 


sample statistic less than or greater than the null value both count as evidence for Ha/a. 
Suppose we want to perform a test of HO: p=0.5440 > p = 0.5 versus Ha: p#0.5/4a ‘pF 0.5 
based on an SRS with a sample proportion of p\=0.65? = 9-65, This result gives some 
evidence for Ha: p#0.544a ‘PF 9.5 because 0.65~0.59-65 # 9-5 Ip this case, evidence for Ha 
Ho as strong or stronger than the observed result includes any value of pAP greater than or 
eet to 0.65 as well as any value of pp less than or equal to 0.35. Why? Because p4=0.35 

= 0.35 is just as different from the null value of p=0.5? = 9-5 as pA=0. 65P = 0.65. For this 


scenario, the P-value is equal to the conditional probability P(p4<0.35 or pA>0.65] p=0.5) 
P (p < 0.35 or p > 0.65| p = 0.5). 


Making Conclusions 


The final step in performing a significance test is to draw a conclusion about the competing 
claims being tested. We make a decision based on the strength of the evidence in favor of the 
alternative hypothesis (and against the null hypothesis) as measured by the P-value. 
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e If the observed result is unlikely to occur by chance alone when H0/40 is true (small P- 
value), we will “reject HO/0.” 


e If the observed result is not unlikely to occur by chance alone when H0/40 is true (large P- 
value), we will “fail to reject HOM0 .” 


This wording may seem unusual at first, but it’s consistent with what happens in a criminal 
trial. Once the jury has weighed the evidence against the null hypothesis of innocence, they 
return one of two verdicts: “guilty” (reject HO) or “not guilty” (fail to reject HOM0). A not- 
guilty verdict doesn’t guarantee that the defendant is innocent, just that there’s not convincing 
evidence of guilt. Likewise, a fail-to-reject HO0 decision in a significance test doesn’t 
guarantee that HO/20 is true. 


HOW TO MAKE A CONCLUSION IN A SIGNIFICANCE TEST 


e If the P-value is small, reject HOM and conclude that there is convincing evidence for 
Hale (in context). 


e If the P-value is not small, fail to reject HO“0 and conclude that there is not convincing 
evidence for Ha/e (in context). 


In the free-throw shooter activity, the estimated P-value was 0.0075. Because the P-value is 
small, we reject HO: p=0.800 : p = 9-80. We have convincing evidence that the player makes 
fewer than 80% of his free throws in the long run. 

For the teen calcium study, the P-value was 0.1404. Because the P-value is not small, we 
fail to reject HO: u=13004 0: # = 1300 We don’t have convincing evidence that teens are 
getting less than 1300 mg of calcium per day, on average. 


How small does a P-value have to be for us to reject HO“0? In Chapter 4 , we suggested 


that you use a boundary of 5% when determining whether a result is statistically significant. 
That is equivalent to saying, “View a P-value less than 0.05 as small.” Choosing this boundary 
value means we require evidence for Ha/« so strong that it would happen less than 5% of the 
time just by chance when HO/40 is true. 

Sometimes it may be preferable to use a different boundary value—like 0.01 or 0.10— 
when drawing a conclusion in a significance test. We will explain why shortly. The chosen 
boundary value is called the significance level. We denote it by a@, the Greek letter alpha. 


DEFINITION Significance level 


The significance level ae is the value that we use as a boundary for deciding 
whether an observed result is unlikely to happen by chance alone when the null 
hypothesis is true. 


When we use a fixed significance level a@ to draw a conclusion in a significance test, here 
are the two possibilities: 


P-value<a — reject HO — convincing evidence for Ha (in context) 
P-value < a — reject Hj — convincing evidence for H, (in context) 


P-value>a — fail to reject HO — not convincing evidence for Ha (in context) 
P-value > a — fail to reject Hy — not convincing evidence for H, (in context) 


If the P-value is less than a@, we say that the result is “statistically significant at the o=a = 
___ level.” 

Significance at the a=0.05a = 0.05 level is often expressed by the statement “The results 
were significant (P<0.05)\? < 0.05) » Here, P stands for the P-value. The P-value is more 
informative than a statement of significance because it describes the strength of evidence for 
the alternative hypothesis. For example, both an observed result with P=0.03P = 0.03 and an 
observed result with P=0.0003P = 0.0003 are significant at the a=0.05e = 0.05 level. But the 
P-value of 0.0003 gives much stronger evidence against HO and in favor of Ha‘ than the 
P-value of 0.03. 


EXAMPLE | Better batteries (») 
Making conclusions 


DSGpro/Getty Images 


PROBLEM: A company has developed a new deluxe AAA battery that is supposed to last 
longer than its regular AAA battery.4 However, these new batteries are more expensive to 
produce, so the company would like to be convinced that they really do last longer. Based on 
years of experience, the company knows that its regular AAA batteries last for 30 hours of 
continuous use, on average. The company selects an SRS of 15 deluxe AAA batteries and 
uses them continuously until they are completely drained. The sample mean lifetime is 

x =33.93% = 33.93 hours. A significance test is performed using the hypotheses 


HO: p=3040 : 4» = 30 

Ha: u>30Ha > > 30 
where pl is the true mean lifetime (in hours) of the deluxe AAA batteries. The resulting P- 
value is 0.0717. What conclusion would you make at the a=0.05a = 0.05 level? 
SOLUTION: 


Because the P-value of 0.0717>a=0.05,0-:0717 > a = 0.05, we fail to reject Hoo. We don’t 
have convincing evidence that the true mean lifetime of the company’s deluxe AAA 
batteries is greater than 30 hours. 


What does the P-value of 0.0717 tell us? Assuming HO:p=3040 : # = 30 js true, there 


is a 0.0717 probability of getting a sample mean of 33.9 hours or more in a random 
sample of 15 batteries. 


FOR PRACTICE, TRY EXERCISE 15 


Beginning users of significance tests generally find it easier to compare a P-value to a 
significance level than to interpret the P-value correctly in context. For that reason, we will 
include stating a significance level as a required part of every significance test. We’ll also ask 
you to explain what a P-value means in a variety of settings. Just remember that the P-value 


measures the strength of evidence for the alternative hypothesis and against the null hypothesis. 


Be careful how you write conclusions when the P-value is large. Don’t conclude that the 
null hypothesis is true just because we didn’t find convincing evidence for the alternative 
hypothesis. For example, it would be incorrect to conclude that the company’s deluxe AAA 
batteries last exactly 30 hours, on average. We found some evidence that the new batteries last 
longer, but the evidence wasn’t convincing enough to reject HO/0. Oo Never “accept H04o” 


or conclude that H0O/0 is true! That would be like the jury in a criminal trial declaring the 
defendant “innocent” when they really mean “not guilty 


39 
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This is consistent with what you learned in Chapter 8: a confidence interval that includes the 
value 30 would include many other plausible values for p also. 


AP® EXAM TIP 


We recommend that you follow the two-sentence structure from the example when 
writing the conclusion to a significance test. The first sentence should give a 


decision about the null hypothesis—reject HO40 or fail to reject HO/40—based on 
an explicit comparison of the P-value to a stated significance level. The second 
sentence should provide a statement about whether or not there is convincing 
evidence for Ha/e in the context of the problem. 


In practice, the most commonly used significance level is a=0.050 = 0.05, This is mainly 
due to Sir Ronald A. Fisher, a famous statistician who worked on agricultural experiments in 
England during the early twentieth century. Fisher was the first to suggest deliberately using 
random assignment in an experiment. In a paper published in 1926, Fisher wrote that it is 
convenient to draw the line at about the level at which we can say: “Either there is something in 
the treatment, or a coincidence has occurred such as does not occur more than once in twenty 


trials.”2 


When a researcher plans to draw a conclusion based on a significance level, og a@ should 


be stated before the data are produced. Otherwise, a deceptive user of statistics might choose 
aa after the data have been analyzed in an attempt to manipulate the conclusion. This is just as 
inappropriate as choosing an alternative hypothesis after looking at the data. 


Type I and Type Il Errors 


When we draw a conclusion from a significance test, we hope our conclusion will be correct. 
But sometimes it will be wrong. There are two types of mistakes we can make: a Type I error 


or a Type II error. 


DEFINITION Type! error, Type Il error 


A Type l error occurs if a test rejects HO40 when HO40 is true. That is, the test 
finds convincing evidence that Ha/Zc is true when it really isn’t. 


A Type Il error occurs if a test fails to reject HO/40 when Ha/« is true. That is, the 
test does not find convincing evidence that Ha/« is true when it really is. 


The possible outcomes of a significance test are summarized in Figure 9.2. 


Truth about the population 


Hy true H, true 


; . Correct 
‘ . Reject Hy, | Type Lerror sroncliiic 
Conclusion conciusion 
based on 
sample ee ; Correct mn 
P Fail to reject Hy coc nist he Iype HH error 


FIGURE 9.2 The two types of errors in significance tests. 


If HO/0 is true: 
¢ Our conclusion is correct if we don’t find convincing evidence that Hae is true. 


e We make a Type | error if we find convincing evidence that Hae is true. 


If Ha/e is true: 
e Our conclusion is correct if we find convincing evidence that Ha/o is true. 


e We make a Type II error if we do not find convincing evidence that Hae is true. 


Only one error is possible at a time, depending on the conclusion we make. 


Here’s a helpful reminder to keep the two types of errors straight. “Fail to” goes with Type II. 


It is important to be able to describe Type I and Type II errors in the context of a problem. 
Considering the consequences of each of these types of error is also important, as the following 
example shows. 


EXAMPLE | Perfect potatoes (>) 
Type | and Type Il errors 


Steve Cukrov/Alamy Stock Photo 


PROBLEM: A potato chip producer and its main supplier agree that each shipment of 
potatoes must meet certain quality standards. If the producer determines that more than 8% 
of the potatoes in the shipment have “blemishes,” the truck will be sent away to get another 
load of potatoes from the supplier. Otherwise, the entire truckload will be used to make 
potato chips. To make the decision, a supervisor will inspect a random sample of 500 
potatoes from the shipment. The producer will then perform a test at the a=0.05a = 0.05 
significance level of 


HO: p=0.08440 : p = 0.08 
Ha: p>0.08 4a :p> 0.08 


where p = the true proportion of potatoes with blemishes in a given truckload. Describe a 
Type I and a Type II error in this setting, and give a possible consequence of each. 


SOLUTION: 


TT ype 1 error: The producer finds convincing evidence that more than 8% of the potatoes in 
the shipment have blemishes, when the true proportion is really 0.08. 

Consequence: The potato-chip producer sends away the truckload of acceptable potatoes, 
wasting time and depriving the supplier of money. 

TT ype 1] error: The producer does not find convincing evidence that more than 8% of the 
potatoes in the shipment have blemishes, when the true proportion ts greater than 0.08. 
Consequence: More potato chips are made with blemished potatoes, which may upset 
customers and lead to decreased sales. 


FOR PRACTICE, TRY EXERCISE 23 


Which is more serious: a Type I error or a Type II error? That depends on the situation. For 
the potato-chip producer, a Type II error seems more serious because it may lead to lower- 
quality potato chips and decreased sales. 

The most common significance levels are a=0.05a = 0.05, a=0.01a = 0.01, and a=0.10 
a = 0.10, Which is the best choice for a given significance test? That depends on whether a 
Type I error or a Type II error is more serious. 


In the “Perfect potatoes” example, a Type I error occurs if the true proportion of blemished 
potatoes in a shipment is p=0.08? = 9-08 but we get a value of the sample proportion p/P? 
small enough to yield a P-value less than a=0.05e = 0.05, When H0/40 is true, this will 
happen 5% of the time just by chance. In other words, P(Type I error)=a= @. 


TYPE | ERROR PROBABILITY 


The probability of making a Type I error in a significance test is equal to the significance 
level aa. 


We can decrease the probability of making a Type I error in a significance test by using a 
smaller significance level. For instance, the potato-chip producer could use a=0.010 = 0.01 
instead of a=0.05a = 0.05, But there is a trade-off between P(Type I error) and P(Type II 
error): as one increases, the other decreases. If we make it more difficult to reject HOM0 by 
decreasing o@, we increase the probability that we will not find convincing evidence for Ha/o 
when it is true. That’s why it is important to consider the possible consequences of each type of 
error before choosing a significance level. 


CHECK YOUR UNDERSTANDING 


i. 


,, The manager of a fast-food restaurant wants to reduce the proportion of drive-thru 
customers who have to wait longer than 2 minutes to receive their food after placing an 
order. Based on store records, the proportion of customers who had to wait longer than 2 
minutes was p=0.63? = 9-63, To reduce this proportion, the manager assigns an additional 


employee to drive-thru orders. During the next month, the manager collects a random sample 


Rs. 144 — 
of 250 drive-thru times and finds that p\=144250=0.576" ~ 250 ~ 0.576 the manager then 


performs a test of the following hypotheses at the a=0.10e = 0.10 significance level: 
HO:p=0.63 4o : p = 0.63 
Ha:p<0.63 Ha : p < 0.63 


where p=? ~ the true proportion of drive-thru customers who have to wait longer than 2 
minutes to receive their food. 


1. Describe a Type I error and a Type II error in this setting. 
2. Which type of error is more serious in this case? Justify your answer. 


3. Based on your answer to Question 3, do you agree with the company’s choice of a=0.10? 
a = 0.10? Why or why not? 


4. The P-value of the manager’s test is 0.0385. Interpret the P-value. 


Section 9.1 Summary 


e A significance test is a procedure for using observed data to decide between two competing 
claims, called hypotheses. The hypotheses are often statements about a parameter, like the 
population proportion p or the population mean p. 


e The claim that we weigh evidence against in a significance test is called the null hypothesis 
(Ho)\4o), The null hypothesis has the form HO: parameter=null value. 
Hp : parameter = null value. 

e The claim about the population that we are trying to find evidence for is the alternative 
hypothesis (Ha)| 4a ). 
: A one-sided alternative hypothesis has the form Ha:parameter<null value 


H, : parameter < null value oy Ha:parameter>null value/fa : parameter > null value. 


. A two-sided alternative hypothesis has the form Ha:parameter“null value 
H,, : parameter # null value. 


° Often, HO/0 is a statement of no change or no difference. The alternative hypothesis states 
what we hope or suspect is true. 


e The P-value of a test is the probability of getting evidence for the alternative hypothesis Ha 
Ho that is as strong as or stronger than the observed evidence when the null hypothesis HO 
Ao is true. 


e Small P-values are evidence against the null hypothesis and for the alternative hypothesis 
because they say that the observed result is unlikely to occur when HO/40 is true. To 
determine if a P-value should be considered small, we compare it to the significance level a 
Q, 

e We make a conclusion in a significance test based on the P-value. 


» If P-value <a:< @ : Reject HO0 and conclude that there is convincing evidence for Ha/fa 
(in context). 


» If P-value >a:2 © : Fail to reject HOM0 and conclude that there is not convincing evidence 
for Ha/fe (in context). 


e When we make a conclusion in a significance test, there are two kinds of mistakes we can 
make. 


: A Type I error occurs if we reject Hoo when it is, in fact, true. In other words, the data 
give convincing evidence for Ha/4« when the null hypothesis is correct. 


. A Type II error occurs if we fail to reject HO/40 when Ha/q is true. In other words, the 
data don’t give convincing evidence for Ha/4«, even though the alternative hypothesis is 


correct. 


The probability of making a Type I error is equal to the significance level aa. There is a 
trade-off between P(Type I error) and P(Type II error): as one increases, the other decreases. 
So it is important to consider the possible consequences of each type of error before 
choosing a significance level. 


Section 9.1 Exercises 


In Exercises 1-6, state appropriate hypotheses for performing a significance test. Be sure to 
define the parameter of interest. 


1. 


pg 555 |») No homework? Mr. Tabor believes that less than 75% of the students at his 


school completed their math homework last night. The math teachers inspect the 
homework assignments from a random sample of 50 students at the school. 


Don’t argue! A Gallup poll report revealed that 72% of teens said they seldom or never 
argue with their friends.2 Yvonne wonders whether this result holds true in her large high 
school, so she surveys a random sample of 150 students at her school. 


How much juice? One company’s bottles of grapefruit juice are filled by a machine that 
is set to dispense an average of 180 milliliters (ml) of liquid. A quality-control inspector 
must check that the machine is working properly. The inspector takes a random sample of 
AO bottles and measures the volume of liquid in each bottle. 


Attitudes The Survey of Study Habits and Attitudes (SSHA) is a psychological test that 
measures students’ attitudes toward school and study habits. Scores range from 0 to 200. 
Higher scores indicate better attitudes and study habits. The mean score for U.S. college 
students is about 115. A teacher suspects that older students have better attitudes toward 
school, on average. She gives the SSHA to an SRS of 45 of the over 1000 students at her 
college who are at least 30 years of age. 


Cold cabin? During the winter months, the temperatures at the Starneses’ Colorado cabin 
can stay well below freezing (32°F32°F or 0°CO°C) for weeks at a time. To prevent the 
pipes from freezing, Mrs. Starnes sets the thermostat at 50°F.50°F. The manufacturer 
claims that the thermostat allows variation in home temperature of o=3°F.¢ = 3 F. Mrs. 
Starnes suspects that the manufacturer is overstating the consistency of the thermostat. 


Ski jump When ski jumpers take off, the distance they fly varies considerably depending 
on their speed, skill, and wind conditions. Event organizers must position the landing area 
to allow for differences in the distances that the athletes fly. For a particular competition, 
the organizers estimate that the variation in distance flown by the athletes will be o=10 

o = 10 meters. An experienced jumper thinks that the organizers are underestimating the 
variation. 


In Exercises 7 and 8, explain what’s wrong with the stated hypotheses. Then give correct 
hypotheses. 


7. Stating hypotheses 


a. A change is made that should improve student satisfaction with the parking situation at 
a local high school. Before the change, 37% of students approve of the parking that’s 
provided. The null hypothesis HO: p>0.37/40 : Pp > 9.37 is tested against the alternative 
Ha: p=0.37Ha : p = 0.37, 


b. A researcher suspects that the mean birth weights of babies whose mothers did not see a 
doctor before delivery is less than 3000 grams. The researcher states the hypotheses as 
Ho: x =3000 grams/to : = 3000 grams 
Ha: x <3000 grams/a : & < 3000 grams 
8. Stating hypotheses 


a. A change is made that should improve student satisfaction with the parking situation at 
your school. Before the change, 37% of students approve of the parking that’s 
provided. The null hypothesis HO: p\=0.3740 : P = 9.37 is tested against the alternative 
Ha: p>0.37Ha : p > 0.37, 


b. A researcher suspects that the mean birth weights of babies whose mothers did not see a 
doctor before delivery is less than 3000 grams. The researcher states the hypotheses as 
HO: 1=3000 grams/to :pf= 3000 grams 
Ha: <2999 grams/ta : pe < 2999 grams 
9. PQ 557 |») No homework? Refer to Exercise 1. The math teachers inspect the 


homework assignments from a random sample of 50 students at the school. Only 68% of 
the students completed their math homework. A significance test yields a P-value of 
0.1265. 


a. Explain what it would mean for the null hypothesis to be true in this setting. 


b. Interpret the P-value. 


10. Attitudes Refer to Exercise 4. In the study of older students’ attitudes, the sample mean 
SSHA score was 125.7 and the sample standard deviation was 29.8. A significance test 
yields a P-value of 0.0101. 


a. Explain what it would mean for the null hypothesis to be true in this setting. 


b. Interpret the P-value. 


11. How much juice? Refer to Exercise 3. The mean amount of liquid in the bottles is 179.6 
ml and the standard deviation is 1.3 ml. A significance test yields a P-value of 0.0589. 
Interpret the P-value. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21, 


Don’t argue Refer to Exercise 2. Yvonne finds that 96 of the 150 students (64%) say they 
rarely or never argue with friends. A significance test yields a P-value of 0.0291. Interpret 
the P-value. 


Interpreting a P-value A student performs a test of HO: u=1004% 0: # = 100 versus Ha: 
u>100% a? #> 100 and gets a P-value of 0.044. The student says, “There is a 0.044 
probability of getting the sample result I did by chance alone.” Explain why the student’s 
explanation is wrong. 


Interpreting a P-value A student performs a test of HO: p=0.3/40 : P = 9.3 versus 

Ha: p<0.34 : Pp < 9.3 and gets a P-value of 0.22. The student says, “This means there is 
about a 22% chance that the null hypothesis is true.” Explain why the student’s 
explanation is wrong. 


pg 559 |») No homework Refer to Exercises 1 and 9. What conclusion would you 
make at the a=0.05@ = 0.05 level? 


Attitudes Refer to Exercises 4 and 10. What conclusion would you make at the a=0.05 
a = 0.05 level? 


How much juice? Refer to Exercises 3 and 11. 
a. What conclusion would you make at the a=0.10a@ = 0.10 level? 


b. Would your conclusion from part (a) change if a 5% significance level was used 
instead? Explain your reasoning. 


Don’t argue Refer to Exercises 2 and 12. 
a. What conclusion would you make at the a=0.010 = 0.01 level? 


b. Would your conclusion from part (a) change if a 5% significance level was used 
instead? Explain your reasoning. 


Making conclusions A student performs a test of HO: p=0.75/40 : P = 9-75 versus 

Ha: p<0.754a * P < 9.75 at the a=0.05a = 0.05 significance level and gets a P-value of 
0.22. The student writes: “Because the P-value is large, we accept HO“0. The data 
provide convincing evidence that the null hypothesis is true.” Explain what is wrong with 
this conclusion 


Making conclusions A student performs a test of HO: y=124o > = 12 versus Ha: x12 
Hq: “#12 at the a=0.05a = 0.05 significance level and gets a P-value of 0.01. The 
student writes: “Because the P-value is small, we reject HO“. The data prove that Ha/fo 
is true.” Explain what is wrong with this conclusion. 


Heavy bread? The mean weight of loaves of bread produced at the bakery where you 
work is supposed to be 1 pound. You are the supervisor of quality control at the bakery, 
and you are concerned that new employees are producing loaves that are too light. 
Suppose you weigh an SRS of bread loaves and find that the mean weight is 0.975 pound. 


22; 


23. 


24. 


a. State appropriate hypotheses for performing a significance test. Be sure to define the 
parameter of interest. 


b. Explain why there is some evidence for the alternative hypothesis. 
c. The P-value for the test in part (a) is 0.0806. Interpret the P-value. 


d. What conclusion would you make at the a=0.010 = 0.01 significance level? 


Philly fanatics? Nationally, the proportion of red cars on the road is 0.12. A statistically 

minded fan of the Philadelphia Phillies (whose team color is red) wonders if Phillies fans 
are more likely to drive red cars. One day during a home game, he takes a random sample 
of 210 cars parked at Citizens Bank Park (the Phillies home field), and counts 35 red cars. 


a. State appropriate hypotheses for performing a significance test. Be sure to define the 
parameter of interest. 


b. Explain why there is some evidence for the alternative hypothesis. 
c. The P-value for the test in (a) is 0.0187. Interpret the P-value. 
d. What conclusion would you make at the a=0.05e = 0.05 significance level? 


561 Opening a restaurant You are thinking about opening a restaurant and are 
pg vol pe) Up s g p g 


searching for a good location. From research you have done, you know that the mean 
income of those living near the restaurant must be over $85,000 to support the type of 
upscale restaurant you wish to open. You decide to take a simple random sample of 50 
people living near one potential location. Based on the mean income of this sample, you 
will perform a test of 


H0:1=$85,000 Ho : # =$85, 000 
Ha:p>$85,000 Ha : # >$85, 000 


where 1 is the true mean income in the population of people who live near the restaurant.4 
Describe a Type I error and a Type II error in this setting, and give a possible consequence 
of each. 


Reality TV Television networks rely heavily on ratings of TV shows when deciding 
whether to renew a show for another season. Suppose a network has decided that 
“Miniature Golf with the Stars” will only be renewed if it can be established that more 
than 12% of U.S. adults watch the show. A polling company asks a random sample of 
2000 U.S. adults if they watch “Miniature Golf with the Stars.” The network uses the data 
to perform a test of 


HO:p=0.12 Ho : p = 0.12 
Ha:p>0.12 H, :p> 0.12 


where p is the true proportion of all U.S. adults who watch the show. Describe a Type I 


2a. 


26. 


Zi, 


error and a Type II error in this setting, and give a possible consequence of each. 


Awful accidents Slow response times by paramedics, firefighters, and policemen can 
have serious consequences for accident victims. In the case of life-threatening injuries, 
victims generally need medical attention within 8 minutes of the accident. Several cities 
have begun to monitor emergency response times. In one such city, emergency personnel 
took more than 8 minutes to arrive on 22% of all calls involving life-threatening injuries 
last year. The city manager shares this information and encourages these first responders 
to “do better.” After 6 months, the city manager selects an SRS of 400 calls involving life- 
threatening injuries and examines the response times. She then performs a test at the 
a=0.05a = 0.05 level of HO: p=0.220 : p = 0.22 versus Ha: p<0.22Ma : p < 0.22, where 
p is the true proportion of calls involving life-threatening injuries during this 6-month 
period for which emergency personnel took more than 8 minutes to arrive. 


a. Describe a Type I error and a Type II error in this setting. 
b. Which type of error is more serious in this case? Justify your answer. 


c. Based on your answer to part (b), do you agree with the manager’s choice of a=0.05? 
a = 0.05? Why or why not? 


Clean water The Environmental Protection Agency (EPA) has determined that safe 
drinking water should contain at most 1.3 mg/liter of copper, on average. A water supply 
company is testing water from a new source and collects water in small bottles at each of 
30 randomly selected locations. The company performs a test at the a=0.05a = 0.05 
significance level of HO: p=1.3/40 * # = 1-3 versus Ha: p>1.340 : # > 1:3, where p is the 
true mean copper content of the water from the new source. 


a. Describe a Type I error and a Type II error in this setting. 
b. Which type of error is more serious in this case? Justify your answer. 


c. Based on your answer to part (b), do you agree with the company’s choice of a=0.05? 
a = 0.05? Why or why not? 


More lefties? In the population of people in the United States, about 10% are left-handed. 
After bumping elbows at lunch with several left-handed students, Simon wondered if more 
than 10% of students at his school are left-handed. To investigate, he selected an SRS of 
50 students and found 8 lefties (p\=8/50=0.16)\P = 8/50 = 0.16). 


To determine if these data provide convincing evidence that more than 10% of the 
students at Simon’s school are left-handed, 200 trials of a simulation were conducted. 
Each dot in the graph shows the proportion of students that are left-handed in a random 
sample of 50 students, assuming that each student has a 10% chance of being left handed. 


vil. 
>. OE er YF 


0.00 0.05 0.10 0.15 0.20 


p = simulated sample proportion of lefties 


a. State appropriate hypotheses for performing a significance test. Be sure to define the 
parameter of interest. 


b. Use the simulation results to estimate the P-value of the test in part (a). Interpret the P- 
value. 


c. What conclusion would you make? 


28. Who wrote this poem? Statistics can help decide the authorship of literary works. 
Sonnets by a well-known poet contain an average of 1=6.94 — 6.9 new words (words not 
used in the poet’s other works) and a standard deviation of o=2.7¢ = 2.7 words, and the 
number of new words is approximately Normally distributed. Scholars expect sonnets by 
other authors to contain more new words than this famous poet’s works. A new 
manuscript has been discovered with many new sonnets, and scholars are debating 
whether it is this poet’s work. They take a random sample of five sonnets from the new 
manuscript and count the number of new words in each one. The mean number of new 
words in these five sonnets is x =9.2% = 9.2, 

The following dotplot shows the results of simulating 200 random samples of size 5 
from a Normal distribution with a mean of 6.9 and a standard deviation of 2.7, and 
calculating the mean for each sample. 
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xX = simulated mean number of new words 


a. State appropriate hypotheses for performing a significance test. Be sure to define the 


parameter of interest. 


b. Use the simulation results to estimate the P-value of the test in part (a). Interpret the P- 


value. 


c. What conclusion would you make? 


Multiple Choice Select the best answer for Exercises 29-32. 


29. 


30. 


a. 


Experiments on learning in animals sometimes measure how long it takes mice to find 
their way through a maze. The mean time is 18 seconds for one particular maze. A 
researcher thinks that a loud noise will cause the mice to complete the maze faster. She 
measures how long each of 10 mice takes with a loud noise as stimulus. The appropriate 
hypotheses for the significance test are 


a. HO: p=18; Ha: p#18/40 * # = 18; Ha: w x 18, 
b. HO: p=18; Ha: p>180 : w= 18; Ha sw > 18, 
c. HO: p<18;Ha: p=1840 : # < 18; Ha: w= 18, 
d. HO: 1=18; Ha: p<180 : w= 18; Ha 2 < 18, 


e. HO: x =18;Ha:x <18/o : @ = 18; MH, : F< 18 


Exercises 30-32 refer to the following setting. Members of the city council want to know 
if a majority of city residents supports a 1% increase in the sales tax to fund road repairs. 
To investigate, they survey a random sample of 300 city residents and use the results to 
test the following hypotheses: 


HO: p=0.5040 : p = 0.50 
Ha: p>0.504a : p > 0.50 


where p is the proportion of all city residents who support a 1% increase in the sales tax to 
fund road repairs. 


A Type I error in the context of this study occurs if the city council 


finds convincing evidence that a majority of residents supports the tax increase, when 
in reality there isn’t convincing evidence that a majority supports the increase. 


. finds convincing evidence that a majority of residents supports the tax increase, when 
in reality at most 50% of city residents support the increase. 


c. finds convincing evidence that a majority of residents supports the tax increase, when 
in reality more than 50% of city residents do support the increase. 


. does not find convincing evidence that a majority of residents supports the tax increase, 
when in reality more than 50% of city residents do support the increase. 


e. does not find convincing evidence that a majority of residents supports the tax increase, 
when in reality at most 50% of city residents do support the increase. 


31. In the sample, p\=158/300=0.527P = 158/300 = 0.527 The resulting P-value is 0.18. 
What is the correct interpretation of this P-value? 


a. Only 18% of the city residents support the tax increase. 
b. There is an 18% chance that the majority of residents supports the tax increase. 


c. Assuming that 50% of residents support the tax increase, there is an 18% probability 
that the sample proportion would be 0.527 or greater by chance alone. 


d. Assuming that more than 50% of residents support the tax increase, there is an 18% 
probability that the sample proportion would be 0.527 or greater by chance alone. 


e. Assuming that 50% of residents support the tax increase, there is an 18% chance that 
the null hypothesis is true by chance alone. 


32. Based on the P-value in Exercise 31, which of the following would be the most 
appropriate conclusion? 


a. Because the P-value is large, we reject HO/40. We have convincing evidence that more 
than 50% of city residents support the tax increase. 


b. Because the P-value is large, we fail to reject HOM#0. We have convincing evidence that 
more than 50% of city residents support the tax increase. 


c. Because the P-value is large, we reject HO/0. We have convincing evidence that at 
most 50% of city residents support the tax increase. 


d. Because the P-value is large, we fail to reject HO. We have convincing evidence that 
at most 50% of city residents support the tax increase. 


e. Because the P-value is large, we fail to reject HO0. We do not have convincing 
evidence that more than 50% of city residents support the tax increase. 


Recycle and Review 


33. Women in math (5.3) Of the 24,611 degrees in mathematics given by U.S. colleges and 
universities in a recent year, 70% were bachelor’s degrees, 24% were master’s degrees, 
and the rest were doctorates. Moreover, women earned 43% of the bachelor’s degrees, 


41% of the master’s degrees, and 29% of the doctorates.2 


a. How many of the mathematics degrees given that year were earned by women? Justify 
your answer. 


b. Suppose we randomly select a person who earned a mathematics degree in this recent 
year. Are the events “Degree earned by a woman” and “Degree was a bachelor’s 


degree” independent? Justify your answer using appropriate probabilities. 


c. If you choose 2 of the 24,611 mathematics degrees at random, what is the probability 
that at least 1 of the 2 degrees was earned by a woman? Show your work. 


34. Explaining confidence (8.2) Here is an explanation from a newspaper concerning one of 
its opinion polls. Explain what is wrong with the following statement. 


For a poll of 1600 adults, the variation due to sampling error is no more than 3 
percentage points either way. The error margin is said to be valid at the 95% 
confidence level. This means that, if the same questions were repeated in 20 polls, the 
results of at least 19 surveys would be within 3 percentage points of the results of this 
survey. 


SECTION 9.2 Tests About a Population Proportion 


LEARNING TARGETS By the end of the section, you should be able to: 


e State and check the Random, 10%, and Large Counts conditions for performing 
a Significance test about a population proportion. 


e Calculate the standardized test statistic and P-value for a test about a population 
proportion. 


e Perform a significance test about a population proportion. 


A significance test can be used to test a claim about a population parameter. Section 9.1 
presented the reasoning of significance tests, including the idea of a P-value. This section 
describes how to perform a significance test about a population proportion. 


Performing a Significance Test About p 


In Section 9.1, we met a basketball player who claimed to make 80% of his free throws. We 
thought that he might be exaggerating. In a sample of 50 free throws, the player made only 32. 
His sample proportion of made free throws was therefore 


Viorika/Getty Images 


32 
p =—= 0.64 
p4=3250=0.64 = 50 
This result is much lower than what he claimed. Does it provide convincing evidence against 
the player’s claim? To find out, we need to perform a significance test of 


HO: p=0.80/40 : p = 0.80 
Ha: p<0.80/a : p < 0.80 


where p = the true proportion of free throws that the shooter makes in the long run. 


CHECKING CONDITIONS In Chapter 8, we introduced conditions that should be met before 
we construct a confidence interval for a population proportion p. We called them Random, 
10%, and Large Counts. These same conditions must be verified before carrying out a 
significance test. 


The Large Counts condition for proportions requires that both np and n(1—p)” (1—P) beat 
least 10. When constructing a confidence interval for p, we use the sample proportion p/P in 
place of the unknown p to check this condition. As we discussed in Section 8.2, npA>10 
np > 10 and n(1—p4)=10” (1—p) > 10 are the symbolic equivalent of saying that the observed 
counts of successes and failures in the sample are both at least 10. Because we assume HO/40 is 
true when performing a significance test, we use the parameter value specified by the null 
hypothesis (denoted p0?? ) when checking the Large Counts condition. In this case, the Large 
Counts condition says that the expected count of successes np0’P0 and of failures n(1—p0) 
n(1— Po) are both at least 10. 


CONDITIONS FOR PERFORMING A SIGNIFICANCE TEST ABOUT A 
PROPORTION 


e Random: The data come from a random sample from the population of interest. 
o 10%: When sampling without replacement, n<0.10Nn < 0.10N, 


e Large Counts: Both np0”?0 and n(1—p0)” (1 — Po) are at least 10. 


If the data come from a convenience sample or a voluntary response sample, there’s no 
point carrying out a significance test for p. The same is true if there are other sources of bias 
during data collection. If the Large Counts condition is violated, a P-value calculated from a 
Normal distribution will not be accurate. 

Let’s check the conditions for performing a significance test of the virtual basketball 
player’s claim that p=0.80P = 9.80, 


e Random: The 50 shots can be viewed as a random sample from the population of all possible 
shots that the shooter takes. 
o 10%: We’re not sampling without replacement from a finite population (because the 
player can keep on shooting), so we don’t check the 10% condition. 
e Large Counts: Assuming H0/20 is true, p=0.80P — 0.80. Then np0=(50)(0.80)=40 
npo = (50) (0.80) = 40 and n(1—p0)=(50)(0.20)=10" (1 — Po) = (50) (0.20) = 10 are both at 
least 10, so this condition is met. 


EXAMPLE |Get a job! (>) 
Checking conditions 


ArtWell/Shutterstock.com 


PROBLEM: According to the U.S. Census Bureau, the proportion of students in high school 
who have a part-time job is 0.25. An administrator at a local high school suspects that the 
proportion of students at her school who have a part-time job is less than the national figure. 
She would like to carry out a test at the a=0.05a = 0.05 significance level of 


HO: p=0.254o : p = 0.25 
Ha: p<0.254a [p< 0.25 


where p=? ~ the true proportion of all students at the school who have a part-time job. 


The administrator selects a random sample of 200 students from the school and finds that 
39 of them have a part-time job. Check if the conditions for performing the significance test 
are met. 


SOLUTION: 
e Random? Random sample of 200 students from the school. / 
© 10%: 200 is less than 10% of students at a large high school. 


© Large Counts? npo = 200(0.25) = soz10"P0 ~ 200 (0.25) = 50 > 10 and n(1 
= 200 (1 — 0.25) = 150 > 10/7 


— po) = 200(1 - 


0.25) = 150210" (1 — po) 


Be sure to use Po, not pr? , when checking the Large Counts condition! 
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CALCULATIONS: STANDARDIZED TEST STATISTIC AND P-VALUE For the free- 


~ _ 32 _ 
throw shooter, the sample proportion of made shots was p\=3250=0.64? ~ 50 sd 


Because this result is less than 0.80, there is some evidence against HO: p=0.80/0 : p = 0.80 
and in favor of Ha: p<0.80/4e : Pp < 9.80, But do we have convincing evidence that the player is 


exaggerating? To answer this question, we have to know how likely it is to get a sample 
proportion of 0.64 or less by chance alone when the null hypothesis is true. In other words, we 
are looking for a P-value. 

Suppose for now that the null hypothesis HO: p=0.80/0 : p = 9.80 is true. Consider the 
sample proportion p/? of made free throws in a random sample of size n=50n = 50. You 
learned in Section 7.2 that the sampling distribution of pp will have mean 


upA=p=0.804 = p = 0.80 


and standard deviation 


[pQ—p) —_‘[0.80(0.20) _ 


oS = \ — \ e 0.0566 
op/=p(1—-p)n=0.80(0.20)50=0.0566 n 50 


Because the Large Counts condition is met, the sampling distribution of p’? will be 
approximately Normal. Figure 9.3 displays this distribution. We have added the player’s 


- _. 32 _ 
sample result, p\=3250=0.64? «50 no 


(1.6302 (1.6868 (}.7434 (.8000 O.8566 0.9132 0.9698 
p = sample proportion of 
p=0.64 made free throws 
FIGURE 9.3 Normal distribution that models the sampling distribution of the sample proportion 
p’*P of made shots in random samples of 50 free throws by an 80% shooter. 


To assess how far the statistic (p/=0.64)\? = 9-64) 


(p0=0.80) (Po ~ 0.80) we standardize the statistic: 


is from the null value of the parameter 


| Po (1—p9) 
z=p\-p0 p 0( 1- p 0)n=0.64-0.800.0566=-2.83 VY  ® 


This value is called the standardized test statistic. 


DEFINITION Standardized test statistic 


A standardized test statistic measures how far a sample statistic is from what 
we would expect if the null hypothesis HO//0 were true, in standard deviation 
units. That is, 


standardized test statistic=statistic — parameterstandard deviation of statistic 
: sg statistic — parameter 

standardized test statistic = _-—_——__—__\! 
standard deviation of statistic 


Many people refer to the value of z as the test statistic. Some also write the formula as 


test statistic=statistic — null valuestandard error of statistic 


= statistic — nullvalue 
teststatistic = -—_—_—_—_—__—_—__———_ 
standarderrorofstatistic 


We chose to use the terminology from the AP® Statistics exam formula sheet. 


The standardized test statistic says how far the sample result is from the null value, and in 
what direction, on a standardized scale. In this case, the sample proportion p\=0.64P = 9.64 of 
made free throws is 2.83 standard deviations less than the null value of p=0.80P = 9-80, 


You can use the standardized test statistic to find the P-value for a significance test. In this 
case, the P-value is the probability of getting a sample proportion less than or equal to pA=0.64 
p = 0.64 by chance alone when HO: p=0.80/0 : p = 9-80 js true. The shaded area in Figure 
9.4(a) shows this probability. Figure 9.4(b) shows the corresponding area to the left of z=—-2.83 
z =~2.83 in the standard Normal distribution. 


Standard 
Normal curve 


P-value = 0.0023 P-value = 0.0023 


T of | ay ol T 


0.6302) 0.6868 0.7434 O.8000 0.8566 0.9132 0.9698 —3 =? = 0 l 2 3 


p = sample proportion of z=-2.83 z-score 
p = 0.64 made free throws 
FIGURE 9.4 The shaded area shows the P-value for the player’s sample proportion of made 


shots (a) on the Normal distribution that models the sampling distribution of p\P from Figure 9.3 
and (b) on the standard Normal curve. 


We can find the P-value using Table A or technology. Table A gives P(z<—2.83)=0.0023 


P(z <—2.83) = 0.0023 The TI-83/84 command normalcdf(lower: -1000~1000, upper:—2.83 
~2.83  mean:0, SD:1) also gives a P-value of 0.0023. 


Note that the calculated P-value of 0.0023 is even smaller than the estimated P-value of 0.0075 
from our earlier simulation. 


If HO/Z0 is true and the player makes 80% of his free throws in the long run, there’s only 
about a 0.0023 probability that he would make 32 or fewer of 50 shots by chance alone. This 
small probability confirms our earlier decision to reject HO“0 and gives convincing evidence 
that the player is exaggerating. 


EXAMPLE | Part-time jobs (>) 
Calculating the standardized test statistic and P-value 


Andersen Ross/Exactostock-1555 
/Superstock 


PROBLEM: In the preceding example, an administrator at a local high school decided to 
perform a test at the a=0.05a = 0.05 significance level of 

Ha: p<0.254a : p < 0.25 
where p = the true proportion of all students at the school who have a part-time job. The 
administrator selects a random sample of 200 students from the school and finds that 39 of 


them have a part-time job. We already confirmed that the conditions for performing a 
significance test are met. 


a. Explain why the sample result gives some evidence for the alternative hypothesis. 
b. Calculate the standardized test statistic and P-value. 


c. What conclusion would you make? 
SOLUTION: 


a. The sample proportion of students at this school with a part-time job 1s pr = 39200 = 
“ 39 


0.195 == 90: 0.529) which is less than the national proportion of p= coer — 0.25 (as 
suggested by Hatta), 


0,195—0.25 
z= —— = - 1.80 


/ 0.25(0.75) 


b. z= 0.195 - 0.250.25(0.75)200=-180 VV ™ 


standardized test statistic=statistic 


—parameterstandard deviation of statisticz=p/—popo(1—po)n 


g = 3 statistic — parameter 
standardized test statistic = 


standard deviation of statistic 


Standard 
Normal curve 


=3 “#] =] 0 | 2 3 


z=-1.80 
Using Table A: P(z < -1.80) = 0.0359! (2 < —1.80) = 0.0359 
Using Technology: normalcdf (lower: 1000, 1000, upper:—1.80, 1.80, mean: 0, SD: 
1)=0.0359= 0.0359 
c. Because the P-value of 0.0359<« = 0.05,0-0359 < a = 0.05, we reject H,. We have 


convincing evidence that the true proportion of all students at this large high school 
who have a part-time job is less than 0.25. 
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Remember that there are two possible explanations for why the sample proportion of 
students who have part-time jobs (p\=39/200=0.195)(P = 39/200 = 0.195) in the example is 
less than p=0.25? = 9-25, The first explanation is that the true proportion of all students at the 
school who have part-time jobs is 0.25 and that we got a sample proportion this small due to 
sampling variability. The second explanation is that the true proportion of students at the school 
with part-time jobs is less than 0.25. We ruled out the first explanation due to the small P-value 
(0.0359) and settled on the second explanation. Of course, it is possible that we made a Type I 
error by rejecting HO/“0 when it is true. 


Putting It All Together: One-Sample z Test for p 


To perform a significance test, we state hypotheses, check conditions, calculate a standardized 
test statistic and P-value, and draw a conclusion in the context of the problem. The four-step 
process is ideal for organizing our work. 


SIGNIFICANCE TESTS: A FOUR-STEP PROCESS 


State: State the hypotheses you want to test and the significance level, and define 
— any parameters you use. 


Plan: Identify the appropriate inference method and check conditions. 


Do: If the conditions are met, perform calculations. 
e Give the sample statistic(s). 
e Calculate the standardized test statistic. 


e Find the P-value. 


Conclude: Make a conclusion about the hypotheses in the context of the problem. 


We have shown you how to complete each of the four steps in a test about a population 
proportion. Let’s reflect on how the pieces fit together in a test of HO: p=p0/0 : P = Po. When 
the conditions are met, the sampling distribution of p/? is approximately Normal with 


mean wp=p and _ standard deviation op=p(1—p)n 


j 
ee 2 ue | p(1—p) 
mean ft, =p and standard deviation o, = \ 


n 


For confidence intervals, we substitute p/? for p in the standard deviation formula to obtain the 
standard error. When performing a significance test, however, we start by assuming that the 
null hypothesis HO: p=p0/0 : P = Po is true. We use this null value when calculating the 
standard deviation. 


If we standardize the statistic p/P by subtracting its mean and dividing by its standard 
deviation, we get the standardized test statistic. There are three conditions that must be met for 


this formula to be valid—one for each of the three components in the formula. 


oan P — po 
——————— 
Ci |po(1 — po) 
n 


. The Random condition helps ensure that p\—poP ~ Po is a good estimate for the difference 
between the true value of p and the null value of po. 


’. The Large Counts condition allows us to use a Normal distribution to model the sampling 
distribution of p/?. When this condition is met and HO/“0 is true, the standardized test 
Statistic z has approximately the standard Normal distribution. Then we can obtain P-values 
from this distribution using Table A or technology. 


3. The 10% condition allows us to use the familiar formula for the standard deviation of the 
sampling distribution of p\? (with pOP9 replacing p) when we are sampling without 
replacement from a finite population. 


Here is a summary of the details for a one-sample z test for a proportion. 


ONE-SAMPLE z TEST FOR A PROPORTION 


Suppose the conditions are met. To perform a test of HO: p=p040 : P = Po, compute the 
standardized test statistic 
P—P0 
= 
Po (1—Po) 
z=p/—p0p0(1-p0)n me 

Find the P-value by calculating the probability of getting a z statistic this large or larger in 
the direction specified by the alternative hypothesis Ha/4« using a standard Normal 
distribution. 


Ha: p > Py Ay: p< py H,:p # py 


The AP® Statistics course outline calls this test a Jarge-sample test for a proportion because it is 
based on a Normal approximation to the sampling distribution of p“? that becomes more 
accurate as the sample size increases. 


Here is an example of the one-sample z test for p in action. 


EXAMPLE | One potato, two potato (>) 
Significance test for a proportion 


imagestock/Getty Images 


PROBLEM: Recall that the potato-chip producer we met in Section 9.1 and its 
He = main supplier agree that each shipment of potatoes must meet certain quality 
standards. If the producer finds convincing evidence that more than 8% of the 
potatoes in the shipment have “blemishes,” the truck will be sent away to get another load of 
potatoes from the supplier. Otherwise, the entire truckload will be used to make potato chips. 
The potato-chip producer has just received a truckload of potatoes from the supplier. A 
supervisor selects a random sample of 500 potatoes from the truck. An inspection reveals 
that 47 of the potatoes have blemishes. Is there convincing evidence at the a=0.100 = 0.10 
level that more than 8% of the potatoes in the shipment have blemishes? 


SOLUTION: 

STATE: We want to test 
HO: p = 0.08/40 : p = 0.08 
Ha: p> 0.08. :p > 0.08 


where (be the true proportion of potatoes in this shipment with blemishes, using a = 0:10. 
a = 0.10. 


7 STATE: State the hypotheses you want to test and the significance level, and 


‘EZ define any parameters you use. 


PLAN: One-sample zZ test ‘for fp 


© Random: Random sample of 500 potatoes from the shipment. / 


© 10%: It’s reasonable to assume that 500 < 10% of all potatoes in the shipment. / 


500 (0.08) = 40 > 10 


© Large Counts: 500(0.08) = 40210 and 500(0.92) = 460210 


500 (0.92) = 460 > 107 


PLAN: Identify the appropriate inference method and check conditions. 


DO: 
p = — = 0.094 


@ p= 47500 = 0.094 500 


0,094—0.08 . 
eS SO 


/ 0.08(0.92) 


®@ z= 0.094 — 0.080.08(0.92)500 = 1.15 \ ~ 500 


e P-value: 


Standard 
Normal curve 


= 1:15 
Using Table A: P(z>1.15) = 1 — 0.8749 = 01251 (z 2 1.15) = 1 — 0.8749 = 0.1251 


Using technology: normalcdf(lower:1.1 5, upper:1000, mean:o, SD) = 0.1251 


DO: If the conditions are met, perform calculations: 


e Give the sample statistic(s). 


e Calculate the standardized test statistic. 


e Find the P-value. 


The sample result gives some evidence in favor of Hz because 0.094 + 0.08. 


CONCLUDE: Because our P-value of 0.1251>a = 0.10,0-1251 > a= 0.10, we fail to reject 
H,. There is not convincing evidence that the true proportion of blemished potatoes in the 


shipment 1S greater than 0.08. 


Conclude: Make a conclusion about the hypotheses in the context of the problem. 
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AP® EXAM TIP 


When a significance test leads to a fail to reject HO“0 decision, as in the preceding 
example, be sure to interpret the results as “We don’t have convincing evidence for 


Hae.” Saying anything that sounds like you believe HO/o is (or might be) true will 
lead to a loss of credit. For instance, it would be wrong to conclude, “There is 
convincing evidence that the true proportion of blemished potatoes is 0.08.” And 
don’t write responses as text messages, like “FTR the HO/o .” 


The preceding example reminds us why significance tests are important. The sample 
proportion of blemished potatoes was p\=47/500=0.094? = 47/500 = 0.094 This result gave 
some evidence against HO/0 and in favor of Ha/«. To see whether such an outcome is 
unlikely to occur by chance alone when H0/20 is true, we had to carry out a significance test. 
The P-value told us that a sample proportion this large or larger would occur in about 12.5% of 
all random samples of 500 potatoes when H0/40 is true. So we can’t rule out sampling 
variability as a plausible explanation for getting a sample proportion of p\=0.094P = 9-094, of 
course, we could have made a Type II error in this case by failing to reject HO0 when 
Ha: p>0.084a : p > 0.08 is true. 


WHAT HAPPENS WHEN THE DATA DON’T SUPPORT Hae? Suppose the supervisor 
had inspected a random sample of 500 potatoes from the shipment and found 33 with 
blemishes. This yields a sample proportion of p\=33/500=0.066? = 33/500 = 0.066 This 
sample doesn’t give any evidence to support the alternative hypothesis Ha: p>0.08 

H, : p > 0.081 Don’t continue with the significance test. The conclusion is clear: we should fail 
to reject HO: p=0.08/40 : p = 9.08. This truckload of potatoes will be used by the potato-chip 
producer. 


If you weren’t paying attention, you might end up carrying out the test. Let’s see what 
would happen. The corresponding standardized test statistic is 
p—p 0.066—0.08 
St _ i 
| PO (1—pg) / 0.08(0.92) 


™ V~ 500 
2=p/—p0p0(1—p0)n=0.066—-0.080.08(0.92)500=-1.15 


&N 


What’s the P-value? It’s the probability of getting a z statistic this large or larger in the 
direction specified by Ha, P(z>-1.15) 4a »P(z>-1.15) Figure 9.5 shows this P-value as an 
area under the standard Normal curve. Using Table A or technology, the P-value is 
1-0.1251=0.87491 — 0.1251 = 0.8749. There’s about an 87.5% chance of getting a sample 


proportion as large as or larger than p\=0.066P = 9.066 if p=0.08P = 9-08. As a result, we 
would fail to reject HOM#0. Same conclusion, but with lots of unnecessary work! 


Standard P-value = 0.8749 


Normal curve 


z=-115 
FIGURE 9.5 The P-value for the one-sided test of Hg: p = 0.08 versus H,;: p > 0.08. 


Always check to see whether the data give evidence against HO/0 in the direction specified 
by Ha/ta before you do calculations. 


CHECK YOUR UNDERSTANDING 


i. 


,, According to the National Campaign to Prevent Teen and Unplanned Pregnancy, 20% of 
teens aged 13 to 19 say that they have electronically sent or posted sexually suggestive 
images of themselves.® The counselor at a large high school worries that the actual figure 
might be higher at her school. To find out, she administers an anonymous survey to a 
random sample of 250 of the school’s 2800 students. All 250 respond, and 63 admit to 
sending or posting sexual images. Carry out a significance test at the a=0.05a = 0.05 
significance level. 


Your calculator will handle the “Do” part of the four-step process, as this Technology 
Corner illustrates. However, be sure to read the AP® Exam Tip that follows. 


20. Technology Corner |PERFORMING A ONE-SAMPLE Z TEST 
FOR A PROPORTION (>) 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


The TI-83/84 can be used to test a claim about a population proportion. We’ |! demonstrate 
using the preceding example. In a random sample of size n = 500, the supervisor found X = 


47 potatoes with blemishes. To perform a significance test: 
e Press [STAT], then choose TESTS and 1-PropZTest. 


e On the 1-PropZTest screen, enter the values shown: p0=0.08?0 = 0.08 | x=472 = 47, and 
n=500n = 500, Specify the alternative hypothesis as “prop >p0> P0.” Note: x is the 
number of successes and n is the number of trials. Both must be whole numbers! 


e If you select “Calculate” and press [ENTER|, you will see that the standardized test 
statistic is z=1.152 = 1.15 and the P-value is 0.1243. 


e If you select the “Draw” option, you will see the screen shown on the right. Compare 
these results with those in the example on page 574. 


Pe:0.88 

x:47 

n: 580 

ProP:#poe <Pe 
Color: i= 


Calculate Draw 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


Prop>@. 08 
z=1.153915828 
P=0, 1242673934 
b=0.094 

n=50@ 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


ropzTest 
1539 


ro 
we 


P=8,1243 


AP® EXAM TIP 


You can use your calculator to carry out the mechanics of a significance test on the 
AP® Statistics exam. But there’s a risk involved. If you give just the calculator 


answer with no work, and one or more of your values are incorrect, you will 
probably get no credit for the “Do” step. If you opt for the calculator-only method, be 
sure to name the procedure (one-sample z test for a proportion) and to report the 
standardized test statistic (z=1.15)(* = 1-15) and P-value (0.1243). 


Two-Sided Tests 


The free-throw shooter, part-time job, and blemished-potato settings involved one-sided tests. 
The P-value in a one-sided test about a proportion is the area in one tail of a standard Normal 
distribution—the tail specified by Ha/4«. In a two-sided test, the alternative hypothesis has the 
form Ha: p~po4. a: P * Po, The P-value in such a test is the probability of getting a sample 
proportion as far as or farther from p0?P0 in either direction than the observed value of p/?. As 
a result, you have to find the area in both tails of a standard Normal distribution to get the P- 
value. The following example shows how this process works. 


EXAMPLE |Nonsmokers (>) 
A two-sided test 


snowmeo/Getty Images 


PROBLEM: According to the Centers for Disease Control and Prevention (CDC) website, 
68% of high school students have never smoked a cigarette.2 Statistics class project, 
Yanhong surveys a simple random sample of 150 students from her school. She gets 
responses from all 150 students, and 90 say that they have never smoked a cigarette. Is there 
convincing evidence that the CDC’s claim does not hold true at Yanhong’s school? 


SOLUTION: 


STATE: We want to test 
‘Ho: p = 0.68 : p = 0.68 


Ha: p,0.68Ha : p, 0.68 
where pe the proportion of all students in Yanhong’s school who would say they have 
never smoked a cigarette. We'll use « = 0.05.0 = 0.05. 


Follow the four-step process! 


PLAN: One-sample z test ‘for Pf 
e Random: Yanhong surveyed an SRS of 150 students from her school. / 


Oo 10%: 150 students is less than 10% of all students ata large high school. JY 


150 (0.68) = 102 > 10 


© Large Counts: 150(0.68) = 102210 and 150(0.32) = 48210 


150 (0.32) = 48 > 10/7 


DO: 
p = = = 0.60 

© pr =90150=0.60' — 150 ~ 

o= aes. = — 2.10 

/ 0.68(0.32) 
© z= 0.60 — 0.680.68(0.32)150=-2.10 VY 
e P-value: 
Standard 


Normal curve 


Using Table A: P(z< ~2.10P (2 S —2.10 or 759.10) = 2(0.0179) = 0.0358 
z > 2.10) = 2(0.0179) = 0.0358 


Using technology: normalcdf(lower: —1000, upper: —2.10, mean:o, SD:1) x 2= 0.0357 
2 = 0.0357 


The sample result gives some evidence in favor of Hz because 0.60 # 0.68. 


The TI-83/84’s 1-PropZTest gives z = -2.10 and P-value = 0.0357. 


CONCLUDE: Because our P-value of 0.0357<a =0.05,0-0357 <a = 0.05, we reject Holo, 
We have convincing evidence that the proportion of all students at Yanhong’s school who 
would say they have never smoked a cigarette differs from the CDC’s claim of 0.68. 
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AP® EXAM TIP 


When making a conclusion in a significance test, be sure that you are describing the 
parameter and not the statistic. In the preceding example, it’s wrong to say that we 
have convincing evidence that the proportion of students at Yanhong’s school who 
said they have never smoked differs from the CDC’s claim of 0.68. The “proportion 
who said they have never smoked” is the sample proportion, which is known to be 
0.60. The test gives convincing evidence that the proportion of all students at 
Yanhong’s school who would say they have never smoked a cigarette differs from 
0.68. 


WHY CONFIDENCE INTERVALS GIVE MORE INFORMATION The result of a 
significance test begins with a decision to reject HO“0 or fail to reject HO“. In Yanhong’s 
smoking study, for instance, the data led us to reject HO: p=0.68/40 : p = 9.68 because we 
found convincing evidence that the proportion of students at her school who would say they 
have never smoked cigarettes differs from the CDC’s claim. We’re left wondering what the 
actual proportion p may be. A confidence interval can shed some light on this issue. You 
learned how to calculate a confidence interval for a population proportion in Section 8.2. 


A 95% confidence interval for p is 


0.60+1.960.60(1—0.60)150=0.60+40.078=(0.522,0.678) 
/0.60(1—0.60) 


0.60 + 1.964/ = 0.60 + 0.078 = (0.522,0.678 
60 \ 150 8 = (0.522, ) 


This interval gives the values for p that are plausible based on the sample data. We would 
not be surprised if the proportion of all students at Yanhong’s school who would say they have 
never smoked cigarettes was any value between 0.522 and 0.678. However, we would be 
surprised if the true proportion was 0.68 because this value is not contained in the confidence 
interval. Figure 9.6 gives computer output from Minitab software that includes both the results 
of the significance test and the confidence interval. 


—. Session 


Test and Cl for One Proportion 
Test of p = 0.68 vs p # 0.68 


> 


FIGURE 9.6 Minitab output for the two-sided significance test at a = 0.05 and a 95% confidence 
interval for Yanhong’s smoking study. 

There is a link between confidence intervals and two-sided tests. The 95% confidence 
interval (0.522, 0.678) gives an approximate set of p0’sP0 § that would not be rejected by a 
two-sided test at the a=0.05a = 0.05 significance level. Any pOP° value outside the interval 


would be rejected as implausible. 
With proportions, the link isn’t perfect because the standard error used for the confidence 
interval is based on the sample proportion p/?, while the denominator of the standardized test 


Statistic is based on the value pOP° from the null hypothesis. 


Standardized test statistic: z ee se: 
Te 
| Po (1—Po) 
Standardized test statistic: z=p/—p0p0(1—p0)n \ i 
1—p 


Confidence interval: p + z* Y 
Tl 


Confidence interval: p\+z*p/(1-p)n 
The big idea is still worth considering: a two-sided test at significance level a and a 100(1-a), 
100(1—@), confidence interval (a 95% confidence interval if a=0.05e = 0.05) give similar 
information about the population parameter. There is a connection between one-sided tests and 
confidence intervals, but it is beyond the scope of this course. 


CHECK YOUR UNDERSTANDING 


hy 


v According to the National Institute for Occupational Safety and Health, job stress poses 
a major threat to the health of workers. A news report claims that 75% of restaurant 
employees feel that work stress has a negative impact on their personal lives.2 Managers of a 
large restaurant chain wonder whether this claim is valid for their employees. A random 
sample of 100 employees finds that 68 answer “Yes” when asked, “Does work stress have a 
negative impact on your personal life?” 

1. Do these data provide convincing evidence at the a=0.10e = 0.10 significance level that 


the proportion of all employees in this chain who would say “Yes” differs from 0.75? 


2. The figure shows Minitab output from a significance test and confidence interval for the 
restaurant worker data. Explain how the confidence interval is consistent with, but gives 
more information than, the test. 


_! Session 


Test and Cl for One Proportion 


Section 9.2 Summary 


¢ The conditions for performing a significance test of HO: p=p00 : P = Po are 
= Random: The data come from a random sample from the population of interest. 
» 10%: When sampling without replacement, n<0.10Nn < 0.10N, 
. Large Counts: Both np0”?0 and n(1—p0)" (1 — Po) are at least 10. 
e The standardized test statistic for a one-sample z test for a proportion is 


P—Po 


& 


| Po (1—Po) 
z=p\—p0po(1-p0)n = Y 
e When the Large Counts condition is met, the standardized test statistic has approximately a 
standard Normal distribution. You can use Table A or technology to find the P-value. 


e Follow the four-step process when you perform a significance test: 


State: State the hypotheses you want to test and the significance level, and define any 
parameters you use. 


Plan: Identify the appropriate inference method and check conditions. 


Do: If the conditions are met, perform calculations: 
: Give the sample statistic(s). 
=: Calculate the standardized test statistic. 


. Find the P-value. 


Conclude: Make a conclusion about the hypotheses in the context of the problem. 


e Confidence intervals provide additional information that significance tests do not—namely, a 


set of plausible values for the population proportion p. A two-sided test of HO: p=p0 
Ho : P = Po at significance level a usually gives the same conclusion as a 100(1-a)% 


100 (1 — &) % confidence interval. 


9.2 Technology Corner 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 
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Section 9.2 Exercises 


35. PG 569 ») Home computers Jason reads a report that says 80% of U.S. high school 
Ne] 


students have a computer at home. He believes the proportion is smaller than 0.80 at his 
large rural high school. Jason chooses an SRS of 60 students and finds that 41 have a 
computer at home. He would like to carry out a test at the a=0.05a = 0.05 significance 
level of HO: p=0.80/0 : p = 9.80 versus Ha:p<0.80/a : p < 9.80) where p = the true 
proportion of all students at Jason’s high school who have a computer at home. Check if 
the conditions for performing the significance test are met. 


36. Walking to school A recent report claimed that 13% of students typically walk to school.2 
DeAnna thinks that the proportion is higher than 0.13 at her large elementary school. She 
surveys arandom sample of 100 students and finds that 17 typically walk to school. 
DeAnna would like to carry out a test at the a=0.05@ = 0.05 significance level of 
HO: p=0.13440 > p= 0.13 versus Ha: p>0.134a (p> 0.13) where p = the true proportion 
of all students at her elementary school who typically walk to school. Check if the 
conditions for performing the significance test are met. 


37. Fair coin? You want to determine if a coin is fair. So you toss it 10 times and record the 
proportion of tosses that land “heads.” You would like to perform a test of HO: p=0.5 
Hy : p = 9-5 versus Ha: p#0.5/4a : p#0.5 where p = the proportion of all tosses of the 
coin that would land “heads.” Check if the conditions for performing the significance test 
are met. 


38. Fire the coach! A college president says, “More than two-thirds of the alumni support my 
firing of Coach Boggs.” The president’s statement is based on 200 emails he has received 
from alumni in the past three months. The college’s athletic director wants to perform a 
test of HO: p=2/340 >: P= 2/3 versus Ha: p>2/34a ‘p> 2/3 where p = the true 
proportion of the college’s alumni who favor firing the coach. Check if the conditions for 
performing the significance test are met. 


39. 


40. 


41. 


42. 


43. 


44. 


pg 571 |») Home computers Refer to Exercise 35. 


a. Explain why the sample result gives some evidence for the alternative hypothesis. 
b. Calculate the standardized test statistic and P-value. 


c. What conclusion would you make? 


Walking to school Refer to Exercise 36. 
a. Explain why the sample result gives some evidence for the alternative hypothesis. 
b. Calculate the standardized test statistic and P-value. 


c. What conclusion would you make? 


Significance tests A test of HO: p=0.5/40 : P = 9-5 versus Ha: p>0.54a : P > 9-5 based on 
a sample of size 200 yields the standardized test statistic z=2.192 = 2.19. Assume that the 
conditions for performing inference are met. 


a. Find and interpret the P-value. 


b. What conclusion would you make at the a=0.01@ = 0.01 significance level? Would 
your conclusion change if you used a=0.05e@ = 0.05 instead? Explain your reasoning. 


c. Determine the value of pA=P = the sample proportion of successes. 


Significance tests A test of HO: p=0.6540 : p = 9.65 against Ha: p<0.654a : p < 0.65 
based on a sample of size 400 yields the standardized test statistic z=-1.782 =~—1-78. 


a. Find and interpret the P-value. 


b. What conclusion would you make at the a=0.10a@ = 0.10 significance level? Would 
your conclusion change if you used a=0.05e@ = 0.05 instead? Explain your reasoning. 


c. Determine the value of pA=P = the sample proportion of successes. 


pg 574 |») Bullies in middle school A media report claims that more than 75% of 


middle school students engage in bullying behavior. A University of Illinois study on 
aggressive behavior surveyed a random sample of 558 middle school students. When 
asked to describe their behavior in the last 30 days, 445 students admitted that they had 
engaged in physical aggression, social ridicule, teasing, name-calling, and issuing threats 
—all of which would be classified as bullying.42 Do these data provide convincing 
evidence at the a=0.05a = 0.05 significance level that the media report’s claim is correct? 


Watching grass grow The germination rate of seeds is defined as the proportion of seeds 
that sprout and grow when properly planted and watered. A certain variety of grass seed 
usually has a germination rate of 0.80. A company wants to see if spraying the seeds with 
a chemical that is known to increase germination rates in other species will increase the 
germination rate of this variety of grass. The company researchers spray a random sample 


45. 


46. 


47. 


48. 


49. 


of 400 grass seeds with the chemical, and 339 of the seeds germinate. Do these data 
provide convincing evidence at the a=0.05a@ = 0.05 significance level that the chemical is 
effective for this variety of grass? 


Better parking A local high school makes a change that should improve student 
satisfaction with the parking situation. Before the change, 37% of the school’s students 
approved of the parking that was provided. After the change, the principal surveys an SRS 
of 200 from the more than 2500 students at the school. In all, 83 students say that they 
approve of the new parking arrangement. The principal cites this as evidence that the 
change was effective. 


a. Describe a Type I error and a Type II error in this setting, and give a possible 
consequence of each. 


b. Is there convincing evidence that the principal’s claim is true? 


Side effects A drug manufacturer claims that less than 10% of patients who take its new 
drug for treating Alzheimer’s disease will experience nausea. To test this claim, 
researchers conduct an experiment. They give the new drug to a random sample of 300 out 
of 5000 Alzheimer’s patients whose families have given informed consent for the patients 
to participate in the study. In all, 25 of the subjects experience nausea. 


a. Describe a Type I error and a Type II error in this setting, and give a possible 
consequence of each. 


b. Do these data provide convincing evidence for the drug manufacturer’s claim? 


Cell-phone passwords A consumer organization suspects that less than half of parents 
know their child’s cell-phone password. The Pew Research Center asked a random sample 
of parents if they knew their child’s cell-phone password. Of the 1060 parents surveyed, 
551 reported that they knew the password. Explain why it isn’t necessary to carry out a 
significance test in this setting. 


Proposition X A political organization wants to determine if there is convincing evidence 
that a majority of registered voters in a large city favor Proposition X. In an SRS of 1000 
registered voters, 482 favor the proposition. Explain why it isn’t necessary to carry out a 
significance test in this setting. 


Mendel and the peas Gregor Mendel (1822-1884), an Austrian monk, is considered the 
father of genetics. Mendel studied the inheritance of various traits in pea plants. One such 
trait is whether the pea is smooth or wrinkled. Mendel predicted a ratio of 3 smooth peas 
for every 1 wrinkled pea. In one experiment, he observed 423 smooth and 133 wrinkled 
peas. Assume that the conditions for inference are met. 


a. State appropriate hypotheses for testing Mendel’s claim about the true proportion of 
smooth peas. 


b. Calculate the standardized test statistic and P-value. 


c. Interpret the P-value. What conclusion would you make? 


50. Spinning heads? When a fair coin is flipped, we all know that the probability the coin 
lands on heads is 0.50. However, what if a coin is spun? According to the article “Euro 
Coin Accused of Unfair Flipping” in the New Scientist, two Polish math professors and 
their students spun a Belgian euro coin 250 times. It landed heads 140 times. One of the 
professors concluded that the coin was minted asymmetrically. A representative from the 
Belgian mint indicated the result was just chance. Assume that the conditions for inference 
are met. 


a. State appropriate hypotheses for testing these competing claims about the true 
proportion of spins that will land on heads. 


b. Calculate the standardized test statistic and P-value. 
c. Interpret the P-value. What conclusion would you make? 


51. pg 577 |») Teen drivers A state’s Division of Motor Vehicles (DMV) claims that 60% 


of all teens pass their driving test on the first attempt. An investigative reporter examines 
an SRS of the DMV records for 125 teens; 86 of them passed the test on their first try. Is 
there convincing evidence at the a=0.05@ = 0.05 significance level that the DMV’s claim 
is incorrect? 


52. We want to be rich In a recent year, 73% of first-year college students responding to a 
national survey identified “being very well-off financially” as an important personal goal. 
A state university finds that 132 of an SRS of 200 of its first-year students say that this 
goal is important. Is there convincing evidence at the a=0.05a = 0.05 significance level 
that the proportion of all first-year students at this university who think being very well- 
off is important differs from the national value of 73%? 


53. Teen drivers Refer to Exercise 51. 


a. Construct and interpret a 95% confidence interval for the true proportion p of all teens 
in the state who passed their driving test on the first attempt. Assume that the 
conditions for inference are met. 


b. Explain why the interval in part (a) provides more information than the test in Exercise 
ol. 


54. We want to be rich Refer to Exercise 52. 


a. Construct and interpret a 95% confidence interval for the true proportion p of all first- 
year students at the university who would identify being very well-off as an important 
personal goal. Assume that the conditions for inference are met. 


b. Explain why the interval in part (a) provides more information than the test in Exercise 
oz. 


55. Do you Tweet? The Pew Internet and American Life Project asked a random sample of 


56. 


o7. 


58. 


U.S. adults, “Do you ever ... use Twitter or another service to share updates about 
yourself or to see updates about others?” According to Pew, the resulting 95% confidence 
interval is (0.123, 0.177).44 Based on the confidence interval, is there convincing evidence 
that the true proportion of U.S. adults who would say they use Twitter or another service 
to share updates differs from 0.17? Explain your reasoning. 


Losing weight A Gallup poll found that 59% of the people in its sample said “Yes” when 
asked, “Would you like to lose weight?” Gallup announced: “For results based on the total 
sample of national adults, one can say with 95% confidence that the margin of (sampling) 
error is +3+3 percentage points.”!2 Based on the confidence interval, is there convincing 
evidence that the true proportion of U.S. adults who would say they want to lose weight 
differs from 0.55? Explain your reasoning. 


Reporting cheating What proportion of students are willing to report cheating by other 
students? A student project put this question to an SRS of 172 undergraduates at a large 
university: “You witness two students cheating on a quiz. Do you go to the professor?” 
The Minitab output shows the results of a significance test and a 95% confidence interval 


based on the survey data.42 


& Session 


Test and Cl for One Proportion 


a. Define the parameter of interest. 
b. Check that the conditions for performing the significance test are met in this case. 
c. Interpret the P-value. 


d. Do these data give convincing evidence that the population proportion differs from 


0.15? Justify your answer with appropriate evidence. 


Teens and sex The Gallup Youth Survey asked a random sample of U.S. teens aged 13 to 
17 whether they thought that young people should wait until marriage to have sex.14 The 
Minitab output shows the results of a significance test and a 95% confidence interval 
based on the survey data. 


@ Session 


Test and CI for One Proportion 


;} p not = 0.5 


N Sample p 95% C Z-Value P-Ve 


439 0.560364 (0.513935, 0.606794) 2453 


a. Define the parameter of interest. 
b. Check that the conditions for performing the significance test are met in this case. 
c. Interpret the P-value. 


d. Do these data give convincing evidence that the actual population proportion differs 
from 0.5? Justify your answer with appropriate evidence. 


Multiple Choice Select the best answer for Exercises 59-62. 


59. After once again losing a football game to the archrival, a college’s alumni association 
conducted a survey to see if alumni were in favor of firing the coach. An SRS of 100 
alumni from the population of all living alumni was taken, and 64 of the alumni in the 
sample were in favor of firing the coach. Suppose you wish to see if a majority of all 
living alumni is in favor of firing the coach. The appropriate standardized test statistic is 


a. 
0.64—0.5 


0.64(0.36 
z=0.64-0.50.64(0.36)100 100 


0.64—0.5 


0.64(0.36 
t=0.64—0.50.64(0.36)100 100 


, __0-64-0.5 
0.5(0.5 
z=0.64-0.50.5(0.5)100 100 


0.64—0.5 


==. 


/0.64(0.36) 
7=0.64-0.50.64(0.36)64 V 64 


—_—:-— 


/0.5(0.5) 
2=0.5-0.640.5(0.5)100  \ 100 


60. Which of choices (a) through (d) is not a condition for performing a significance test about 
a population proportion p? 


a. The data should come from a random sample from the population of interest. 
b. Both np0”Po and n(1—-p0)" (1— Po) should be at least 10. 


c. If you are sampling without replacement from a finite population, then you should 
sample less than 10% of the population. 


d. The population distribution should be approximately Normal, unless the sample size is 
large. 


e. All of the above are conditions for performing a significance test about a population 
proportion. 


61. The standardized test statistic for a test of HO: p=0.4/40 : p = 9-4 versus Ha: p~0.4 
Hy: p# 0.4 is 7=7 432 = 2.43, This test is 


a. not significant at either a=0.05e = 0.05 or a=0.01a = 0.01, 
b. significant at a=0.05a = 0.05, but not at a=0.01a = 0.01, 
c. significant at o=0.010 = 0.01, but not at a=0.05a = 0.05, 
d. significant at both a=0.05a = 0.05 and a=0.01a0 = 0.01, 


e. inconclusive because we don’t know the value of p/P. 


62. Which of the following 95% confidence intervals would lead us to reject HO: p=0.30 
Hp : p = 0.30 in favor of Ha: p#0.304e : P # 9-30 at the 5% significance level? 


a. (0.19, 0.27) 
b. (0.24, 0.30) 
c. (0.27, 0.31) 
d. (0.29, 0.38) 


e. None of these 
Recycle and Review 


63. Packaging DVDs (6.2, 5.3) A manufacturer of digital video discs (DVDs) wants to be 
sure that the DVDs will fit inside the plastic cases used as packaging. Both the cases and 
the DVDs are circular. According to the supplier, the diameters of the plastic cases vary 
Normally with mean p=5.3/ = 5-3 inches and standard deviation o=0.01¢ = 0.01 inch. 
The DVD manufacturer produces DVDs with mean diameter =5.26/ = 5-26 inches. 
Their diameters follow a Normal distribution with o=0.02¢ = 0.02 inch. 


a. Let X =the diameter of a randomly selected case and Y = the diameter of a randomly 
selected DVD. Describe the shape, center, and variability of the distribution of the 
random variable X-Y*X — Y. What is the importance of this random variable to the 
DVD manufacturer? 


b. Calculate the probability that a randomly selected DVD will fit inside a randomly 
selected case. 


c. The production process runs in batches of 100 DVDs. If each of these DVDs is paired 
with a randomly chosen plastic case, find the probability that all the DVDs fit in their 
cases. 


64. Cash to find work? (4.2) Will cash bonuses speed the return to work of unemployed 
people? The Illinois Department of Employment Security designed an experiment to find 
out. The subjects were 10,065 people aged 20 to 54 who were filing claims for 
unemployment insurance. Some were offered $500 if they found a job within 11 weeks 
and held it for at least 4 months. Others could tell potential employers that the state would 


pay the employer $500 for hiring them. A control group got neither kind of bonus.42 


a. Describe a completely randomized design for this experiment. 
b. Explain how you would use a random number generator to assign the treatments. 


c. What is the purpose of the control group in this setting? 


SECTION 9.3 Tests About a Population Mean 


LEARNING TARGETS By the end of the section, you should be able to: 


e State and check the Random, 10%, and Normal/Large Sample conditions for 
performing a significance test about a population mean. 


Calculate the standardized test statistic and P-value for a test about a population 
mean. 


Perform a significance test about a population mean. 


Use a confidence interval to make a conclusion for a two-sided test about a 
population parameter. 


Interpret the power of a significance test and describe what factors affect the 
power of a test. 


Y ou learned how to construct a confidence interval for a population mean in Section 8.3. Now 
we’ll examine the details of testing a claim about a population mean pF. 


Carrying Out a Significance Test for pu 


In an earlier example, a company claimed to have developed a deluxe AAA battery that lasts 
longer than its regular AAA batteries. Based on years of experience, the company knows that 
its regular AAA batteries last for 30 hours of continuous use, on average. To test the 
company’s claim, we want to perform a test at the a=0.05a = 0.05 significance level of 


HO: p=3040: pt = 30 
Ha: u>30Ha: pt > 30 


where pF is the true mean lifetime (in hours) of the deluxe AAA batteries. Our next step is to 
check that the conditions for performing this significance test are met. 


CHECKING CONDITIONS In Chapter 8, we introduced conditions that should be met before 
we construct a confidence interval for a population mean. We called them Random, 10%, and 
Normal/Large Sample. These same conditions must be verified before performing a 
significance test about a population mean. 


CONDITIONS FOR PERFORMING A SIGNIFICANCE TEST ABOUT A 
MEAN 


e Random: The data come from a random sample from the population of interest. 
o 10%: When sampling without replacement, n<0.10Nn < 0.10N, 


e Normal/Large Sample: The population has a Normal distribution or the sample size is 
large (n>30)(" > 30) Tf the population distribution has unknown shape and n<30n < 30, 
use a graph of the sample data to assess the Normality of the population. Do not use t 
procedures if the graph shows strong skewness or outliers. 


AP® EXAM TIP 


It is not enough just to make a graph of the data on your calculator when assessing 


Normality. You must sketch the graph on your paper and make an appropriate 
comment about it to receive credit. You don’t have to draw more than one graph—a 
single appropriate graph will do. 


If the sample size is large (n>30)(" = 30) the Normal/Large Sample condition is met. This 
condition is more difficult to check if the sample size is small (n<30)(" < 30) In that case, we 
have to examine a graph of the sample data to see if it is reasonable to believe that the 


population distribution is Normal. 


EXAMPLE | Better batteries (>) 
Checking conditions 


zentilia/Getty Images 


PROBLEM: Here are the lifetimes (in hours) of the 15 deluxe AAA batteries from the 
company’s simple random sample: 


17 32 ee 45 30 36 1 27 
37 47 35 33 44 22 oe 


Check if the conditions for performing the significance test are met. 


SOLUTION: 


© Random: SRS of 15 deluxe AAA batteries. / 
© 10%: Assume that 15 is less than 10% of all the company’s deluxe AAA batteries. / 
e Normal/Large Sample: The dotplot does not show strong skewness or outliers. / 


» & @ 060 eee eee ¢ 
15 20 25 30 35 40 45 50 55 
Battery life (h) 


Because the graph shows that there are no outliers or strong skewness in the sample, 


it is plausible that the population distribution of deluxe AAA battery lifetimes is Normal. 


FOR PRACTICE, TRY EXERCISE 65 


We used a dotplot to check the Normal/Large Sample condition in the example because it is 
an easy graph to make by hand. You can also make a stemplot, histogram, boxplot, or Normal 
probability plot to check this condition. Figure 9.7 shows all four of these graphs for the battery 
lifetime data. The stemplot, histogram, and boxplot are roughly symmetric and have no 
outliers. The Normal probability plot is fairly linear, as we would expect if the data came from 
a Normally distributed population of battery lifetimes. 
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FIGURE 9.7 (a) A stemplot, (b) a histogram, (c) a boxplot, and (d) a Normal probability plot of 


the lifetimes of a simple random sample of 15 deluxe AAA batteries. None of the graphs shows 
any strong skewness or outliers, so it is plausible that the population distribution of deluxe AAA 
battery lifetimes is Normal. 


CALCULATIONS: STANDARDIZED TEST STATISTIC AND P-VALUE In the “Better 
batteries” example, the sample mean lifetime for the SRS of 15 deluxe batteries is x =33.93 

@ = 33.93 hours and the standard deviation is sx=9.825* = 9-82 hours. Because the sample 
mean of 33.93 hours is greater than 30 hours, we have some evidence against HO: p=30 

Ho : # = 30 and in favor of Ha: u>30Aa :# > 30 But do we have convincing evidence that the 
true mean lifetime of the company’s new AAA batteries is greater than 30 hours? To answer 
this question, we have to know how likely it is to get a sample mean of 33.93 hours or more by 
chance alone when the null hypothesis is true. As with proportions, we will calculate a 
standardized test statistic and a P-value to find out. 


When performing a significance test, we do calculations assuming that the null hypothesis 
HO4o is true. The standardized test statistic measures how far the sample result diverges from 
the null parameter value, in standardized units. As before, 


standardized test statistic=statistic - parameterstandard deviation of statistic 
. oh oe statistic — parameter 
standardized test statistic ={———_—_—_— 
standard deviation of statistic 

For a test of HO: p=0/0 : # = 0, our statistic is the sample mean x . The standard 
deviation of the sampling distribution of x = is 


oO 
o> =— 
ox =on vn 
In an ideal world, our standardized test statistic would be 
t—Ug 
LZ 
z=x —y0on vn 


When the Normal/Large Sample condition is met, the standardized test statistic z can be 
modeled by the standard Normal distribution. We could then find the P-value from a standard 
Normal distribution if the Normal/Large Sample condition is met. 


There are only a few real-world situations in which we might know the population standard 


deviation o. If we do, then we can calculate P-values using Table A or technology. The TI-83/84’s 
Z-Test option in the TESTS menu is designed for this special situation. 


Because the population standard deviation o is almost always unknown, we use the sample 
standard deviation sx*= in its place. The resulting standardized test statistic has the standard 
error of x £ in the denominator and is denoted by t (you will see why shortly). So the formula 
becomes 


t=x —yOsxn Mi 


The standardized test statistic for the “Better batteries” example is therefore 


_ 33.93-30 
9.82 


t=33.93-309.8215=1.55 V15 


= 1.55 


When the Normal/Large Sample condition is met, the standardized test statistic 


t=x —pO0sxn 


can be modeled by at distribution. As you leamed in Section 8.3, we specify a particular t 
distribution by giving its degrees of freedom (df). When we perform inference about a 
population mean J using a t distribution, the appropriate degrees of freedom are found by 
subtracting 1 from the sample size n, making df=n-—14f = n — 1, 


DEFINITION t distribution 


A t distribution is described by a symmetric, single-peaked, bell-shaped density 
curve. Any t distribution is completely specified by its degrees of freedom (df). 
When performing inference about a population mean based on a random sample 
of size n when the population standard deviation o7 is unknown, use at 
distribution with df=n-1¢f =" — 1, 


Figure 9.8 compares the density curves of the standard Normal distribution and the t 
distributions with 2 and 9 degrees of freedom. The figure illustrates these facts about the t 
distributions: 


t distributions have 
more area in the tails 
than the standard 

Normal distribution. 


—-— 1,2 degrees 


of freedom 


~~~ 1,9 degrees 
of freedom 

— Standard 

Normal 
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FIGURE 9.8 Density curves for the ft distributions with 2 and 9 degrees of freedom and the 
standard Normal distribution. All are symmetric with center 0. The ¢ distributions have more 
variability and a slightly different shape than the standard Normal distribution. 


e The ¢ distributions are similar in shape to the standard Normal distribution. They are 
symmetric about 0, single-peaked, and bell-shaped. 


e The ¢t distributions have more variability than the standard Normal distribution. It is more 
likely to get an extremely large value of t (say, greater than 3) than an extremely large value 
of z because the t distributions have more area in the tails of the distribution. 


e As the degrees of freedom increase, the t distributions approach the standard Normal 
distribution. 


We can use Table B to find a P-value from the appropriate t distribution when performing a 
test about a population mean. In the “Better batteries” example, we are testing 


HO: p=30/40: # = 30 
Ha: p>30Ha: p> 30 


where p=! ~ the true mean lifetime (in hours) of the company’s deluxe AAA batteries. An 
SRS of n=15n = 15 batteries yielded an average lifetime of x =33.93% = 33.93 hours and a 
standard deviation of sx=9.82%* = 9-82 hours. The P-value is the probability of getting a result 
as large as or larger than x =33.93% = 33.93 just by chance when HO: p=30/40 : # = 39 js true. 
In symbols, P-value=P(x >33.93|1=30)” —value = P(z > 33.93|4 = 30) Earlier, we calculated 
the standardized test statistic to be t=1.55t = 1.55, So we estimate the P-value by finding 
P(t=1.55)? (t = 1-55) in a ¢ distribution with df=15-1=144f = 15 — 1 = 14. The shaded area in 
Figure 9.9 shows this probability. 


t distribution j 
with df=14 / \ 


/ \ P-value 
/ . PUELSS) 


t= 555 


FIGURE 9.9 The shaded area shows the P-value for the “Better batteries” example as the area 
to the right of t=1.55¢ = 1.55 in a t distribution with 14 degrees of freedom. 


We can find this P-value using Table B. Go to the df=14df = 14 row. The t statistic falls 
between the values 1.345 and 1.761. If you look at the top of the corresponding columns in 
Table B, you’ll find that the “Upper-tail probability p” is between 0.10 and 0.05. (See the 
excerpt from Table B.) Because we are looking for P(t21.55)P (t> 1.55) this is the probability 
we seek. That is, the P-value for this test is between 0.05 and 0.10. 


Upper-tail probability p 


df 0.10 0.05 0.025 
13 1.350 1.771 2.160 
14 1.345 1760 2.145 
15 1.341 1.753 2.134 

80% 90% 95% 


Confidence level C 


As you can see, Table B gives an interval of possible P-values for a significance test. We 
can still draw a conclusion from the test in much the same way as if we had a single 
probability. Let’s illustrate using the “Better batteries” example. Because the P-value of 
between 0.05 and 0.100.10 is greater than a=0.05a = 0.05, we fail to reject HO40. We don’t 
have convincing evidence that the true mean lifetime of the company’s deluxe AAA batteries is 
greater than 30 hours. 

Table B has two other limitations for finding P-values. 


e The table shows probabilities for only positive values of t. To find a P-value for a negative 
value of t, we use the symmetry of the ¢ distributions. For example, P(t=1.55)=P(t<—1.55) 
P(t > 1.55) = P(t < —1.55) when using a ¢ distribution with a particular df. 


e The table includes probabilities only for t distributions with degrees of freedom from 1 to 30 


and then skips to df=40, 50, 60, 80, 100df = 40, 50, 60, 80, 100° and 1000. (The bottom row 
gives probabilities for df=oodf = 00, which corresponds to the standard Normal distribution.) 
og If the df you need isn’t provided in Table B, use the next lower df that is available. 
It’s not fair “rounding up” to a larger df, which is like pretending that your sample size is 
larger than it really is. Doing so would give you a smaller P-value than is true and would 
make you more likely to incorrectly reject HO/0 when it’s true (i.e., make a Type I error). 
Of course, “rounding down” to a smaller df will give you a larger P-value than is true, which 
makes you more likely to commit a Type II error! 


The next example shows how to deal with both of these issues. 


EXAMPLE | Two-sided tests (>) 
Calculating the standardized test statistic and P-value 


PROBLEM: Suppose you want to perform a test of HO: p=5/20 : # = 95 versus Ha: HA~5 
Ha: # ¥ 5 at the a=0.01a = 0.01 significance level. A random sample of size n=37n = 37 
from the population of interest yields x =4.81% = 4.81 and sx=0.36552 = 9.365, Assume 
that the conditions for carrying out the test are met. 


a. Explain why the sample result gives some evidence for the alternative hypothesis. 


b. Calculate the standardized test statistic and P-value. 
SOLUTION: 


a. The sample mean ts x7 = 4.81% = 4.81, which is not equal to 5 (as suggested by Hatta), 


t= SS = = 3.17 


standardized test statistic=statistic — parameterstandard deviation of statistict=x —pOsxn 
“ = 3% statistic — parameter 

standardized test statistic ={-—-—_——_____ 

standard deviation of statistic 


= — #0 
$y 


Jn 


t 


t distribution 
with df = 36 


Upper-tail probability p 


df 0.005 0.0025 0.001 
29 2.756 3.038 3.396 
30 2.750 3.030 3.385 
40 2.704 PRT 3.307 

99% 99.5% 99.8% 


Confidence level C 
Using Table B: Because df = B= | = 364 f = 37 —1= 36 js not available on the table, use df 
es 304 f =30 
P(t>3.17)P (t > 3.17) §5 between 0.001 and 0.0025. 


(Peete or t>3.17) (t <—3.17ort > 3.17) 
2 (0.0025) = 0.005 


is between 2(0.001) = 0.0022 (9-001) = 0.002 gn 


2(0.0025) = 0.005 
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Given the limitations of Table B, our advice is to use technology to find P-values when 
carrying out a significance test about a population mean. 


21. Technology Corner | COMPUTING P-VALUES FROM t 
DISTRIBUTIONS (>) 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 


You can use the tcdf command on the TI-83/84 to calculate areas under a t distribution 
curve. The syntax is tcdf(lower bound, upper bound, df). 


Let’s use the tcdf command to compute the P-values from the last two examples. 


Better batteries: To find P(t > 1.55)” (t > 1.55) 


e Press (DISTR) and choose tcdf(.OS 2.55 or later: In the dialog box, enter 
these values: lower: 1.55, upper: 1000, df:14, choose Paste, and then press [ENTER|. 
Older OS: Complete the command tcdf(1.55,1000,14) and press [ENTER], 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


tcdf (1.55,1900,14) 
@.9717235647 


Two-sided test: To find the P-value for the two-sided test with df=36df = 36 and t=—3.17 
t = —3.17, use the command tcdf (lower: -1000~1900, upper: -3.17—3-17, df:36) and 
multiply the result by 2. 


tcdf ( -1000,. -3.17,36)%*2 
@.903198986S5 


eee eee ee eee ee eee eee eee ee ee eee ee eee ee eee) 


CHECK YOUR UNDERSTANDING 


fy 


y The makers of Aspro brand aspirin want to be sure that their tablets contain the right 
amount of active ingredient (acetylsalicylic acid). So they inspect a random sample of 30 
tablets from a batch in production. When the production process is working properly, Aspro 
tablets contain an average of p=3204 ~ 320 milligrams (mg) of active ingredient. The 
amount of active ingredient in the 30 selected tablets has mean 319 mg and standard 


deviation 3 mg. 

1. State appropriate hypotheses for a significance test in this setting. 
2. Check that the conditions are met for carrying out the test. 

3. Calculate the standardized test statistic and P-value. 
4 


. What conclusion would you make? 


Putting It All Together: One-Sample t Test for py 


We have shown you how to complete each of the four steps in a test about a population mean. 
Let’s reflect on how the pieces fit together in a test of HO: p=p0/40 : # = Ho, We start by 
assuming that the null hypothesis is true. Because we usually don’t know the population 
standard deviation o, we use the sample standard deviation sx*z to estimate it. If we 
standardize the statistic x % by subtracting its mean and dividing by its standard error, we get 
the standardized test statistic. There are three conditions that must be met for this formula to be 
valid—one for each of the three components in the formula. 


. The Random condition helps ensure that x —07 ~ 0 is a good estimate for the difference 
between the true value of p and the null value p0M0. 

’. The Normal/Large Sample condition allows us to model the distribution of the standardized 
test statistic t using a ¢ distribution with n-1” ~ 1 degrees of freedom. Then we can obtain P- 
values from this distribution using Table B or technology. 


}. The 10% condition allows us to use the familiar formula for the standard deviation of the 
sampling distribution of x % (with sx$z replacing o7) when we are sampling without 
replacement from a finite population. 


Here is a summary of the details for a one-sample t test for a mean. 


ONE-SAMPLE t TEST FOR A MEAN 


Suppose the conditions are met. To test the hypothesis HO: p=po/t 0+ # = #0, compute the 
standardized test statistic 


rp 
t= 0 


Sx 


t=x —pOsxn vn 


Find the P-value by calculating the probability of getting a t statistic this large or larger in 
the direction specified by the alternative hypothesis Ha/« in a t distribution with df=n-1 
df=n-—1 


H,: > Ly Hy: < wy H,: UF Uy 


Now we are ready to test a claim about a population mean. Once again, we follow the four- 


step process. 


EXAMPLE | Healthy streams (>) 
Performing a significance test about py 


blackestockphoto/Getty Images 


PROBLEM: The level of dissolved oxygen (DO) in a stream or river is an 
45=4 important indicator of the water’s ability to support aquatic life. A researcher 
measures the DO level at 15 randomly chosen locations along a stream. Here are the 


results in milligrams per liter (mg/l): 


4.53 5.04 S20 5.20 4.13 5:50 4.83 4.40 
5.42 6.38 4.01 4.66 2.87 Sata Seale) 


An average dissolved oxygen level below 5 mg/l] puts aquatic life at risk. 


a. Do the data provide convincing evidence at the a=0.05e = 0.05 significance level that 
aquatic life in this stream is at risk? 


b. Given your conclusion in part (a), which kind of mistake—a Type I error or a Type II 
error—could you have made? Explain what this mistake would mean in context. 


SOLUTION: 


a. STATE: We want to test 
Ho: u=s Ho Sh 
Ha: p<sla? <5 


where ps5 9 the true mean dissolved oxygen (DO) level in the stream, using a =5 0.05. 
a = 50.05. 


Follow the four-step process! 


PLAN: One-sample t test for pH 


There are an infinite number of possible locations along the stream, so it isn’t 


necessary to check the 10% condition. 


© Random: The researcher measured the DO level at 1 Ts randomly chosen locations. 4 


® Normal/Large Sample: The histogram looks roughly symmetric and shows no outliers. 


DO level (mg/) 


Because the histogram shows no strong skewness or outliers in the sample, it is 


plausible that the population distribution of dissolved oxygen levels in the stream is 
Normal. 


Enter the data into your calculator to make a graph and to calculate 1-Var Stats. 


DO: 


pe SS ge 


0.9396 


@ t = 4.771 — 50.939615 =-0.94 VIB 
e P-value:df=15-1=14P value: df= 15—-1=14 


The sample result gives some evidence in favor of Hz because x = 4.771 <5. 


Using Table B: P-value ts between 0.15 and 0.20. 
Using technology: tedf (lower: -1000~1000 | upper: -0.94—9.94 df: 14) = 0.1816 


CONCLUDE: Because the P-value of 0.1816>0 = 0.05,0-1816 > a = 0.05, we fail to reject 
Hoo. We don’t have convincing evidence that the true mean DO level in the stream is less 


than 5 mg/ (, 


b. Because we failed to reject Hoo in part (a), we could have made a T ype JI error 
(failing to reject Holo when Hate is true). af we did, then the true mean dissolved 
oxygen level pl in the stream is less than 5 mg, but we didn’t find convincing evidence 
with our significance test. That would imply aquatic life in this stream ts at risk, but 
we weren't able to detect that fact. 


t distribution 
with df = 14 
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Because the t procedures are so common, all statistical software packages will do the 
calculations for you. Figure 9.10 shows the output from Minitab for the one-sample t test in the 
preceding example. Note that the results match! 


@ Session 


One-Sample T: DO (mg/L) 


— + 
Test of mu 


Variable N Mean StDev 
(mg/L) 5 4.771 0.940 


FIGURE 9.10 Minitab output for the one-sample t test from the dissolved oxygen example. 


You can also use your calculator to carry out a one-sample t test. But be sure to read the 
AP® Exam Tip at the end of the Technology Corner below. 


22. Technology Corner | PERFORMING A ONE-SAMPLE t TEST 
FOR A MEAN (>) 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 


You can perform a one-sample ¢ test using either raw data or summary statistics on the TI- 
83/84. Let’s use the calculator to carry out the test of HO: p=5/4o : # = 5 versus Ha: <5 
Hq : # <5 from the dissolved oxygen example. Start by entering the sample data in L1. 
Then, to do the test: 


e Press|STAT|, choose TESTS and T-Test. 
e Adjust your settings as shown. 


If you select “Calculate,” the screen below left appears. 
The standardized test statistic is t=-0.94¢ =—9.94 and the P-value is 0.1809. 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


Inet :BEbe) Stats 
He:5 
List:Li 


Frea:1 
LU: Ue He 
Color: BLUE 


Calculate Draw 


If you specify “Draw,” you see a t distribution curve (df= 14)(df = 14) with the lower tail 


shaded, along with the standardized test statistic and P-value. 


u<S 


t= -@. 9425562016 
P=@. 1809448972 
x=4. 771333333 
Sx=0. 9395961645 
n=15 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


ts 
t="6.9426 P=0.1809 


Note: If you are given summary statistics instead of the original data, you would select the 
option “Stats” instead of “Data” in the first screen and enter the summary statistics. 


AP® EXAM TIP 


Remember: If you give just calculator results with no work, and one or more values 


are wrong, you probably won't get any credit for the “Do” step. If you opt for the 
calculator- only method, name the procedure (one-sample t test for 7) and report 
the standardized test statistic (t=-0.94),(¢ =—9-94) ; degrees of freedom (df=14), 
(df= 14), and P-value (0.1809). 


CHECK YOUR UNDERSTANDING 


. 


”, A teacher suspects that students at his school are getting less than the recommended 8 
hours of sleep a night, on average. To test his belief, the teacher asks a random sample of 28 
students, “How much sleep did you get last night?” Here are the data (in hours): 


9 6 8 6 8 8 6 6.5 6 7 a 4 3 4 
5 6 Bal 6 3 6 6 10 7 8 4.5 9 it vi 


Do these data provide convincing evidence at the a=0.05e = 0.05 significance level in 
support of the teacher’s suspicion? 


Two-Sided Tests and Confidence Intervals 


You learned in Section 9.2 that a confidence interval gives more information than a 
significance test does—it provides the entire set of plausible values for the parameter based on 
the data. The connection between two-sided tests and confidence intervals is even stronger for 


means than it was for proportions. That’s because both inference methods for means use the 
standard error of x @ in the calculations: 


standardized test statistic:t=x —yOsxn confidence interval:x +t*sxn 
t—p = Sx 

confidence interval : % + t* — 
Vi 7 


standardized test statistic : t = 


Jn 
The link between two-sided tests and confidence intervals for a population mean allows us 
to make a conclusion directly from a confidence interval. 


e If a95% confidence interval for p“ does not capture the null value p00, we can reject HO: 
u=p040 : # = Ho in a two-sided test at the a=0.05a = 0.05 significance level. 


e If a 95% confidence interval for np“ captures the null value 10°, then we should fail to 
reject HO: p=p0/0 : # = #0 in a two-sided test at the a=0.05a = 0.05 significance level. 


The same logic applies for other confidence levels, but only for a two-sided test. 


EXAMPLE | Juicy pineapples (>) 
Confidence intervals and two-sided tests 


Maks Narodenko 
/Shutterstock.com 


PROBLEM: At the Hawaii Pineapple Company, the mean weight of the pineapples 
harvested from one large field was 31 ounces last year. A different irrigation system was 
installed in this field after the growing season. Managers wonder if this change will affect 
the mean weight of future pineapples grown in the field. To find out, they select and weigh a 
random sample of 50 pineapples from this year’s crop. 


a. State an appropriate pair of hypotheses for a significance test in this setting. Be sure to 
define the parameter of interest. 


b. Check conditions for performing the test in part (a). 


c. A 95% confidence interval for the mean weight of all pineapples grown in the field this 
year is (31.255, 32.616). Based on this interval, what conclusion would you make for a 


test of the hypotheses in part (a) at the a=0.05a = 0.05 significance level? 


d. Can we conclude that the different irrigation system caused a change in the mean weight 
of pineapples produced? Explain your answer. 


SOLUTION: 


a. We want to test 
Ho: = 310 : p= 3l 
Ha: ux31 Ha > wx 3l 


where p = the true mean weight (in ounces) of all pineapples grown in the field this 
year. 


© Random: Random sample of 50 pineapples from this year’s crop. / 
° 10%: Tt is safe to assume that 50<10,20 < 10, of all pineapples ina large field. / 
O Normal/Large Sample:n = 50>30,.n = 50 > 30.4 
c. The 95% confidence interval does not include 31 as a plausible value, so we would reject 


Holo. We have convincing evidence that the true mean weight of all pineapples grown 
this year is not 31 ounces. 


d. No: this was not a randomized comparative experiment, so we cannot infer causation. 
It is possible that other things besides the irrigation system changed from last year’s 
i ‘d J y ‘d yy 
growing season. Maybe the weather was different this year, and that’s why the 
pineapples have a different mean weight than last year. 


Recall from Chapter 4 that only well-designed experiments allow us to establish 


cause-and-effect conclusions. 
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Minitab output for a significance test and confidence interval based on the pineapple data is 
shown here. Notice that the weights of the 50 randomly selected pineapples from this year’s 
crop had a mean of x _=31.935@ = 31.935 ounces and a standard deviation of sx=2.394 
8x = 2.394 ounces. The standardized test statistic is 

EQ 31.935—31 


t= ——_—__* = 2.76 
‘Sx 2.39 


t=x —p0sxn=31.935-312.39450=2.76 Vv” 50 


The corresponding P-value from a t distribution with 49 degrees of freedom is 0.008. Because 
the P-value of 0.008<a=0.050.008 < a = 0.05, we would reject HO/0. This is consistent with 


our decision based on the 95% confidence interval in the example. 


& Session 


One-Sample T 


: Weight (oz) 
f mu = 31 vs not = 31 


OL RW = Al 


Would a 99% confidence interval for p/“ include 31 ounces as a plausible value for the 
parameter? Only if a two-sided test would fail to reject HO: y=3140 7M=31 ata 1% 
significance level. Because the P-value of 0.008<a=0.019.008 < a = 0.01, we would reject HO 
Ho. We once again have convincing evidence that the mean weight of all pineapples produced 
this year is not 31 ounces. So the 99% confidence interval would not contain 31. You can check 
that the interval is (31.028, 32.842). 


Think About It 


IS THERE A CONNECTION BETWEEN ONE-SIDED TESTS AND CONFIDENCE 
INTERVALS FOR A POPULATION MEAN? As you might expect, the answer is yes. 
But the link is more complicated. Consider a one-sided test of HO: u=1040 : # = 10 versus 
Ha: p>104e : # > 10 based on an SRS of 30 observations. With df=30-1=29 

df = 30 — 1 = 29 Table B says that the test will reject HOM0 at a=0.05a = 0.05 if the 
standardized test statistic t is greater than 1.699. For this to happen, the sample mean x & 
would have to exceed 0=10H0 = 10 by more than 1.699 standardized units. 

Table B also shows that t*=1.699t" = 1.699 is the critical value for a 90% confidence 
interval. That is, a 90% confidence interval will extend 1.699 standardized units on either 
side of the sample mean x £. If x % exceeds 10 by more than 1.699 standardized units, the 
resulting interval will not include 10. And the one-sided test will reject HO: p=10 
Ho : # = 10. There’s the link: our one-sided test at a=0.05a = 0.05 gives the same 
conclusion about HO/0 as a 90% confidence interval for p/. 


CHECK YOUR UNDERSTANDING 


i. 


y According to the National Center for Health Statistics, the mean systolic blood pressure 
for males 35 to 44 years of age is 128. The health director of a large company wonders if this 
national average holds for the company’s middle-aged male employees. So the director 
examines the medical records of a random sample of 72 male employees in this age group 
and records each of their systolic blood pressure readings. 


1. State an appropriate pair of hypotheses for a significance test in this setting. Be sure to 
define the parameter of interest. 


2. Check conditions for performing the test in Question 1. 


3. A 95% confidence interval for the mean systolic blood pressure of all 35- to 44-year-old 
male employees at this company is (126.43, 133.43). Based on this interval, what 
conclusion would you make for a test of the hypotheses in Question 1 at the a=0.05 
a = 0.05 significance level? 


The Power of a Test 


Researchers often perform a significance test in hopes of finding convincing evidence for the 
alternative hypothesis. Why? Because Ha/e states the claim about the population parameter 
that they believe, suppose, or suspect is true. For instance, a drug manufacturer claims that less 
than 10% of patients who take its new drug for treating Alzheimer’s disease will experience 
nausea. To test this claim, researchers want to carry out a test of 


HO: p=0.1040 : p = 9.10 
Ha: p<0.104a :p< 0.10 


where p = the true proportion of patients like the ones in the study who would experience 
nausea when taking the new Alzheimer’s drug. They plan to give the new drug to a random 
sample of Alzheimer’s patients whose families have given informed consent for the patients to 
participate in the study. 

Suppose that the true proportion of subjects like the ones in the Alzheimer’s study who 
would experience nausea after taking the new drug is p=0.08? = 9-98. This means that the 
alternative hypothesis Ha: p<0.104a : P < 9-10 js true. Researchers would make a Type II 
error if they failed to find convincing evidence for Ha/4« based on the sample data. How likely 
is the significance test to avoid a Type II error in this case? We refer to this probability as the 
power of the test. 


DEFINITION Power 


The power of a test is the probability that the test will find convincing evidence for 
H. when a specific alternative value of the parameter is true. 


The power of a test is a conditional probability: power = P(reject HO0| parameter = some 
specific alternative value). In other words, power is the probability that we find convincing 
evidence the alternative hypothesis is true, given that the alternative hypothesis really is true. 
To interpret the power of a test in a given setting, just interpret the relevant conditional 
probability. 

Let’s return to the Alzheimer’s study. Suppose the researchers decide to perform a test of 


HO: p=0.10/0 : p = 9.10 versus Ha: p<0.104a : Pp < 9-10 at the a=0.05a = 0.05 significance 
level based on data from a random sample of 300 Alzheimer’s patients. Advanced calculations 
reveal that the power of this test to detect p=0.08? = 9-08 is 0.29. Interpretation: If the true 
proportion of Alzheimer’s patients like these who would experience side effects when taking 
the new drug is p=0.08? = 9-08 there is a 0.29 probability that the researchers will find 
convincing evidence for Ha: p<0.10/4a : p < 0.10, 


EXAMPLE |Can we tell if the new batteries last longer? (>) 
Interpreting the power of a test 


= 


- — 
al 
tuahlensa/Shutterstock.com 


PROBLEM: Let’s return one last time to the company that developed a new deluxe AAA 
battery that is supposed to last longer than its regular AAA battery. Based on years of 
experience, the company’s regular AAA batteries last for 30 hours of continuous use, on 
average. The company plans to select an SRS of 50 deluxe AAA batteries and use them 
continuously until they are completely drained. Then it will perform a test at the a=0.05 

a = 0.05 significance level of 


Hosp —30o > p= 30 

Ha: p>30Ha:H > 30 
where p= ~ the true mean lifetime (in hours) of the deluxe AAA batteries. The new 
batteries are more expensive to produce, so the company would like to be convinced that 


they really do last longer. The power of the test to detect that p=314 — 31 hours is 0.762. 
Interpret this value. 


SOLUTION: 


af the true mean lifetime of the company’s deluxe AAA batteries is p=314 = 31 hours, there 


1s a 0.762 probability that the company will find convincing evidence for Ha:p+30. 
H, : p+ 30. 


FOR PRACTICE, TRY EXERCISE 85 


The company in the example has a good chance (power=0.762) (Power = 0.762) of rejecting 


HO if the true mean lifetime of its deluxe AAA batteries is 31 hours. That is, P(reject 


HO| U=31)=0.762P (rejectHo| 4 = 31) = 0.762 what’s the probability that the company makes 
a Type II error in this case? It’s P(fail to reject HO| p=31)=1—-0.762=0.238 
Ho| = 31) = 1 — 0.762 = 0.238 We can generalize this relationship between the power of a 


significance test and the probability of a Type II error. 


RELATING POWER AND TYPE II ERROR 


The power of a test to detect a specific alternative parameter value is related to the 
probability of a Type II error for that alternative: 


Power=|—P(Type error) and  P(Type II error)=l—Power 
Power =1— P(Typellerror) and P(Typellerror) = 1 — Power 


The power of the test in the Alzheimer’s study to detect p=0.08? = 9-08 js only 0.29. In other 
words, researchers have a 1-0.29=0.711 — 9.29 = 0.71 probability of making a Type II error by 
failing to find convincing evidence for Ha: p<0.10/40 : Pp < 9-10 when p=0.08? = 9-98, What 
can researchers do to decrease the probability of making a Type II error and increase the power 
of the test? 


WHAT AFFECTS THE POWER OF A TEST? Here is an activity that will help you answer 
this question. 


ACTIVITY Powerful batteries 


In this activity, we will use an applet to investigate the factors that affect the power of a test. 
Let’s return to the battery study from the preceding example. Recall that the company wants 
to perform a test of 


HO: u=3040 > p= 30 
Ha: p>30Ha + pt > 30 


where p= ~ the true mean lifetime (in hours) of the deluxe AAA batteries. The company 
estimates that the standard deviation of the lifetimes for its new batteries is o=30 = 3 hours 
based on a pilot study. 


1. Go to highschool.bfwpub.com/tps6e and launch the Statistical Power applet. Enter the 
values HO: p=30H0 i a 30 Ha: u>30Ha ‘p> 30 o=30 = 3, n=50, a=0.05 
n = 50,a = 0.05 and alternate p=31/ = 31. Confirm that the power of the test is 0.762. 


Alpha (a): 0.05 


=== 28.303 28.852 29.401 29.951 30.500 31.049 31.599 32.148 32.697 


alt. u= 31 Power = 0.762 


28.303 28.852 29.401 29.951 30.500 31.049 31.599 32.148 32.697 


e The top curve shows the sampling distribution of the sample mean x # for random 
samples of size n=50n = 50 when HO: p=30H0 : # = 30 is true. We refer to this as the 
null distribution. A value of x @ that falls along the horizontal axis within the yellow 
region would lead us to reject HO: p=30/40 : # = 30, 


e The bottom curve shows the sampling distribution of the sample mean x Z for random 
samples of size n=50n = 50 when p=314 — 31. We refer to this as the alternative 
distribution. The red region represents the power of the test. A value of x that falls 


along the horizontal axis within the red region would lead to a correct rejection of HO: 
u=3040 > p= 30. 


2. Sample size: Change the sample size from n=50n = 50 to n=100n = 100, What happens 
in the top and bottom panels of the applet? Does the power increase or decrease? Explain 
why this makes sense. 


3. Significance level: Reset the sample size to n=50n = 50, 


a. Using the slider, change the significance level to a=0.010 = 0.01, What happens in the 
bottom panel of the applet? Does the power increase or decrease? How about the 
probability of a Type II error? 


b. Make a guess about what will happen if you change the significance level to a=0.10 
a = 0.10, Use the applet to test your conjecture. 


c. Explain what the results in parts (a) and (b) tell you about the relationship between 
Type I error probability, Type II error probability, and power. 


4. Difference between null and alternative parameter value: Reset the sample size to n=50 
n = 50 and the significance level to a=0.05e = 0.05. Will the company be more likely to 
detect that the true mean lifetime of its new batteries is greater than p=30/ = 39 if y=31 
# = 31 hours or p=32H = 32 hours? Use the applet to test your conjecture. 


As Step 2 of the activity confirms, we get better information about the true average lifetime 
of the company’s new AAA batteries from a random sample of 100 batteries than from a 
random sample of 50 batteries. The power of the test to detect that p=314 ~ 31 increases from 
0.762 to 0.954 when the sample size increases from n=50n = 50 to n=100n = 100, 


Will it be easier to reject HOMo if «=0.05a = 0.05 or a=0.102a = 0.10? When ae is larger, 
it is easier to reject HOM40 because the P-value doesn’t need to be as small. Step 3 of the 
activity shows that the power of the test to detect that p=314 — 31 increases from 0.762 to 
0.859 when the significance level increases from 0.05 to 0.10. 

Figure 9.11 reveals an important link between Type I and Type IJ error probabilities. 
Because P(Type I error)=a/” (Typelerror) = a increasing the significance level increases the 
chance of making a Type I error. As the applet shows, this change also increases the power of 
the test. Because P(Type II error)=1—-Power!” (Type Ierror) = 1 — Power higher power means 
a smaller chance of making a Type II error. So increasing the Type I error probability a@ 
decreases the Type II error probability. By the same logic, decreasing the chance of a Type I 


error results in a higher chance of a Type II error. 


P(Type I error) = 0.10 


Alpha (a): 0.1 


be] 
alt. u= 34 Power = 0.859 


28.303 28.852 29.401 29.951 30.500 31.049 31.599 32.148 32.697 


UPDATE 


RESET P(Type Il error) = 1 — 0.859 


=().141 


28.303 28.852 29.401 29.951 30.500 31.049 31.599 32.148 32.697 


Figure 9.11 
Starnes & Tabor, The Practice of Statistics, 6e, © 2018 Bedford, Freeman & Worth High School Publishers 


FIGURE 9.11 The connection between Type | error, Type II error, and power. Because P(Type | 
error)=a,” (Type error) = a, ag the significance level a@ increases: P (Type | error) 
increases, P (Type II error) decreases, and power increases. 


Step 4 of the activity shows that it is easier to detect large differences between the null and 
alternative parameter values than small differences. When n=50n = 50 and a=0.05a@ = 0.05, 
the power of the test to detect p=324 — 32 is 0.999, while the power of the test to detect p=31 
= 31 is only 0.762. 

The difference between the null parameter value and the specific alternative parameter 
value of interest is often referred to as the effect size. For the batteries’ setting with a null value 
of p=30H — 30 hours, it is much easier to detect if the true mean lifetime of the company’s new 
AAA batteries is p=324 — 32 hours (an effect size of 2 hours) than if the true mean lifetime is 
pw=31h = 31 hours (an effect size of 1 hour). 


INCREASING THE POWER OF A SIGNIFICANCE TEST 


The power of a significance test to detect an alternative value of the parameter when HO/0 
is false and Hae is true, based on a random sample of size n and significance level aa, 
will be larger when: 


e The sample size n is larger. 


e The significance level a@ is larger. 


e The null and alternative parameter values are farther apart. 


In addition to these three factors, we can also gain power by making wise choices when 
collecting data. For example, using blocking in an experiment or stratified random sampling 
can greatly increase the power of a test in some circumstances. In an experiment, power will 
increase if you have fewer sources of variability by keeping other variables constant. 


EXAMPLE |Can we tell if the new drug reduces nausea? > 
What affects the power of a test 


Tom Grill/Getty Images 


PROBLEM: The researchers in the Alzheimer’s experiment want to test the drug 
manufacturer’s claim that fewer than 10% of patients who take its new drug for treating 
Alzheimer’s disease will experience nausea. That is, they want to carry out a test of 


HO: p=0.10/40 : p = 0.10 
Ha: p<0.104a : p < 0.10 


where p=? ~ the true proportion of patients like the ones in the study who would experience 
nausea when taking the new Alzheimer’s drug. Earlier, we mentioned that the power of the 
test to detect p=0.08? = 9-98 using a random sample of 300 patients and a significance level 
of a=0.05a = 0.05 js 0.29. 


Determine whether each of the following changes would increase or decrease the power of 
the test. Explain your answers. 


a. Use a=0.010 = 0.01 instead of a=0.05a = 0.05, 
b. If the true proportion is p=0.06? = 9-06 instead of p=0.08P = 9.08 
c. Use n=200n = 200 instead of n=300n = 300, 


SOLUTION: 


a. Decrease; using a smaller significance level makes it harder to reject Hoo when Hata 


is true. 


b. Increase; it is easier to detect a bigger difference between the null and alternative 
dal 
parameter value. 


c. Decrease; a smaller sample size gives less information about the true proportion Pp. 


FOR PRACTICE, TRY EXERCISE 87 


In Chapter 8 , you learned how to calculate the sample size needed for a desired margin of 
error in a confidence interval. When researchers plan to use a significance test to analyze their 
results, they will often calculate the sample size needed for a desired power. For a specific 
study design, the sample size required depends on three factors: 


. Significance level. How much risk of a Type I error—rejecting the null hypothesis when HO 
Ao is actually true—are we willing to accept? If a Type I error has serious consequences, we 
might opt for a=0.01@ = 0.01, Otherwise, we should choose a=0.05e = 0.05 or a=0.10 
a = 0.10, Recall that using a higher significance level would decrease the Type II error 
probability and increase the power. 


’. Effect size. How large a difference between the null parameter value and the actual parameter 
value is important for us to detect? 


3. Power. What probability do we want our study to have to detect a difference of the size we 
think is important? Most researchers insist on a power of at least 0.80 for their significance 
tests. 


Calculating power by hand is possible but unpleasant. It’s better to let technology do the 
work for you. There are many applets available to help determine sample size and to calculate 
the power of a test. 


Sometimes budget constraints get in the way of achieving high power. It can be expensive to 


collect data from a large enough sample of individuals to give a significance test the power that 
researchers desire. 


CHECK YOUR UNDERSTANDING 


i. 


y Can a six-month exercise program increase the total body bone mineral content 
(TBBMC) of young women? A team of researchers is planning a study to examine this 
question. The researchers would like to perform a test of 


HO: u=0/40 7p=0 


Ha: u>0La :pe>d 


where J is the true mean percent change in TBBMC during the exercise program. 


1. The power of the test to detect a mean increase in TBBMC of 1% using a=0.05a = 0.05 
and n=25n = 25 subjects is 0.80. Interpret this value. 


2. Find the probability of a Type I error and the probability of a Type II error for the test in 
Question 1. 


3. Describe two ways that researchers could increase the power of the test in Question 1. 


Using Tests Wisely 


Significance tests are widely used in reporting the results of research in many fields. New drugs 
require significant evidence of effectiveness and safety. Courts ask about statistical significance 
in hearing discrimination cases. Marketers want to know whether a new ad campaign 
significantly outperforms the old one, and medical researchers want to know whether a new 
therapy performs significantly better. In all these uses, statistical significance is valued because 
it points to an effect that is unlikely to occur simply by chance. 

Carrying out a significance test is often quite simple, especially if you use technology. 
Using tests wisely is not so simple. Here are some points to keep in mind when using or 
interpreting significance tests. 


STATISTICAL SIGNIFICANCE AND PRACTICAL IMPORTANCE When a null 
hypothesis of no effect or no difference can be rejected at the usual significance levels (a=0.10 
(a = 0.10 oF g=0,05a = 0.05 or a=0.01)% = 0.01) | there is convincing evidence of a 
difference. But that difference may be very small. When large samples are used, even tiny 
deviations from the null hypothesis will be significant. 

Suppose we’re testing a new antibacterial cream, “Formulation NS,” on a small cut made 
on the inner forearm. We know from previous research that with no medication, the mean 
healing time (defined as the time for the scab to fall off) is 7.6 days. The claim we want to test 
here is that Formulation NS speeds healing. So our hypotheses are 


HO: p=7.64o : # = 7.6 
Ha: <7.6a > p< 7.6 


where p=! ~ the true mean healing time (in days) in the population of college students whose 
cuts are treated with Formulation NS. We will use a 5% significance level. 


A random sample of 250 college students give informed consent to participate in a study 
and apply Formulation NS to their wounds. The mean healing time for these subjects is x =7.5 
& = 7.5 days and the standard deviation is sx=0.98 = 9-9 day. Note that the conditions for 
performing a one-sample t test for p“ are met. We carry out the test and find that t=-1.76 
t = —1.76 and P-value =0.04= 0.04 with df=249df = 249. Because 0.04 is less than a=0.05 


a = 0.05, we reject HOM#0. We have convincing evidence that Formulation NS reduces the 
average healing time. However, this result is not practically important. Having your scab fall 
off one-tenth of a day sooner is no big deal! 


Some people say “not clinically significant” when they mean not practically important. 


Remember the wise saying: Statistical significance is not the same thing as practical 
ying s gasp 


importance. The remedy for attaching too much importance to statistical significance is to pay 
attention to the data as well as to the P-value. The foolish user of statistics who feeds the 
data to a calculator or computer without exploratory analysis will often be embarrassed. 
Plot your data, and examine them carefully. Is the difference you are seeking visible in your 
graphs? If not, ask yourself whether the difference is large enough to be practically important. 
Are there outliers or other departures from a consistent pattern? A few outlying observations 
can produce highly significant results if you blindly apply common significance tests. Outliers 
can also destroy the significance of otherwise convincing data. 


To help evaluate practical importance, give a confidence interval for the parameter in which 
you are interested. A confidence interval provides a set of plausible values for the parameter, 
rather than simply asking if the observed result is too surprising to occur by chance alone when 
HO is true. Confidence intervals are not used as often as they should be, whereas 
significance tests are perhaps overused. 


BEWARE OF MULTIPLE ANALYSES Statistical significance ought to mean that you have 
found a difference that you were looking for. The reasoning behind statistical significance 
works well if you decide what difference you are seeking, design a study to search for it, and 
use a significance test to weigh the evidence you gather. In other settings, significance may 
have little meaning. Here’s one such example. 


Might the radiation from cell phones be harmful to users? Many studies have found little or 
no connection between using cell phones and various illnesses. Here is part of a news account 
of one study: 


A hospital study that compared brain cancer patients and a similar group without 
brain cancer found no statistically significant difference in cell-phone use for the 
two groups. But when 20 distinct types of brain cancer were considered 
separately, a significant difference in cell-phone use was found for one rare 
type. Puzzlingly, however, this risk appeared to decrease rather than increase 
with greater mobile phone use.1® 
Think for a moment. Suppose that the 20 null hypotheses for these 20 significance tests are 
all true. Then each test has a 5% chance of being significant at the 5% level. That’s what 
a=0.05a = 0.05 means: results this extreme occur only 5% of the time just by chance when the 
null hypothesis is true. We expect about 1 of 20 tests to give a significant result just by chance. 
Running one test and reaching the a=0.05@ = 0.05 level is reasonably good evidence that you 


have found something; running 20 tests and reaching that level only once is not. 


Searching data for patterns is certainly legitimate. Performing every conceivable 
significance test on a data set with many variables until you obtain a statistically significant 
result is not. This unfortunate practice is known by many names, including data dredging and 
P-hacking. 


For more on the pitfalls of multiple analyses, do an Internet search for the XKCD comic about jelly 


beans causing acne. 


Section 9.3 Summary 


e The conditions for performing a significance test of HO: p=y0/40 : # = Ho are: 
= Random: The data come from a random sample from the population of interest. 
» 10%: When sampling without replacement, n<0.10Nn < 0.10N, 


« Normal/Large Sample: The population has a Normal distribution or the sample size is 


large (n>30)\" > 30) Tf the population distribution has unknown shape and n<30n < 30, 
use a graph of the sample data to assess the Normality of the population. Do not use t 
procedures if the graph shows strong skewness or outliers. 


e The standardized test statistic for a one-sample t test for a mean is 


t=x —pOsxn Vv 


e When the Normal/Large Sample condition is met, the distribution of this standardized test 
statistic can be modeled by a t distribution with n—-1” ~ 1 degrees of freedom. You can use 
Table B or technology to find the P-value. 


e Confidence intervals provide additional information that significance tests do not—namely, a 
set of plausible values for the parameter p“. A 95% confidence interval for p gives 
consistent results with a two-sided test of HO: p=p0/40 * # = Ho at the a=0.05a = 0.05 
significance level. 


e The power of a test is the probability that the test will find convincing evidence for Ha/t« 
when a specific alternative value of the parameter is true. In other words, the power of a test 
is the probability of avoiding a Type IJ error. For a specific alternative, 

Power=1—P(Type II error)! ower = 1—P(Typellerror) 


e We can increase the power of a significance test by increasing the sample size, increasing the 
significance level, or increasing the difference that is important to detect between the null 
and alternative parameter values (known as the effect size). Wise choices when collecting 
data, such as controlling for other variables and blocking in experiments or stratified random 


sampling, can also increase power. 


e Fora specific study design, there are three factors that influence the sample size required for 
a Statistical test: significance level, effect size, and the desired power of the test. 


e Very small differences can be highly significant (small P-value) when a test is based on a 
large sample. A statistically significant result may not be practically important. 


e Many tests run at once will probably produce some significant results by chance alone, even 
if all the null hypotheses are true. Beware of P-hacking. 


9.3 Technology Corners 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


21. Computing P-values from t distributions Page 590 
22. Performing a one-sample t test for a mean Page 594 


Section 9.3 Exercises 


65. pg 586 |) Attitudes The Survey of Study Habits and Attitudes (SSHA) is a 


psychological test with scores that range from 0 to 200. The mean score for U.S. college 
students is 115. A teacher suspects that older students have better attitudes toward school. 
She gives the SSHA to an SRS of 45 students from the more than 1000 students at her 
college who are at least 30 years of age. The teacher wants to perform a test at the a=0.05 
a = 0.05 significance level of 


HO: p=11540: # = 115 
Ha: p>115 Ma: p> 115 


where p= ~ the mean SSHA score in the population of students at her college who are at 
least 30 years old. Check if the conditions for performing the test are met. 


66. Candy! A machine is supposed to fill bags with an average of 19.2 ounces of candy. The 
manager of the candy factory wants to be sure that the machine does not consistently 
underfill or overfill the bags. So the manager plans to conduct a significance test at the 
a=0.10e = 0.10 significance level of 

HO: p=19.2Mo: # = 19.2 

Ha: #19.2a: ue x 19.2 
where p= ~ the true mean amount of candy (in ounces) that the machine put in all bags 
filled that day. The manager takes a random sample of 75 bags of candy produced that day 


67. 


68. 


and weighs each bag. Check if the conditions for performing the test are met. 


Battery life A tablet computer manufacturer claims that its batteries last an average of 
10.5 hours when playing videos. The quality-control department randomly selects 20 
tablets from each day’s production and tests the fully charged batteries by playing a video 
repeatedly until the battery dies. The quality-control department will discard the batteries 
from that day’s production run if they find convincing evidence that the mean battery life 
is less than 10.5 hours. Here are a dotplot and summary statistics of the data from one day: 


o ° ° se c 
T T | | T T 
LO 10.5 1] 11.5 12 12.5 13 13.5 I4 


Battery life (h) 


n Mean SD Min Q1 Med Q3 Max 
20 11.07 1.097 10 10.3 10.6 11.85 13.9 


a. State appropriate hypotheses for the quality-control department to test. Be sure to 
define your parameter. 


b. Check if the conditions for performing the test in part (a) are met. 


Paying high prices? A retailer entered into an exclusive agreement with a supplier who 
guaranteed to provide all products at competitive prices. To be sure the supplier honored 
the terms of the agreement, the retailer had an audit performed on a random sample of 25 
invoices. The percent of purchases on each invoice for which an alternative supplier 
offered a lower price than the original supplier was recorded./4 For example, a data value 
of 38 means that the price would be lower with a different supplier for 38% of the items 
on the invoice. A histogram and some numerical summaries of the data are shown here. 
The retailer would like to determine if there is convincing evidence that the mean percent 
of purchases for which an alternative supplier offered lower prices is greater than 50% in 
the population of this company’s invoices. 
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25 77.76 32.6768 0 68 100 100 100 


a. State appropriate hypotheses for the retailer’s test. Be sure to define your parameter. 
b. Check if the conditions for performing the test in part (a) are met. 


69. pg 589 Two-sided test Suppose you want to perform a test of HO: u=6440 : pb = 64 
Ne] 


versus Ha: x64 a? Ht # 64 at the a=0.05a = 0.05 significance level. A random sample 
of size n=25n = 25 from the population of interest yields x =62.8% = 62.8 and sx=5.36 
Sx = 5.36. Assume that the conditions for carrying out the test are met. 


a. Explain why the sample result gives some evidence for the alternative hypothesis. 


b. Calculate the standardized test statistic and P-value. 


70. One-sided test Suppose you want to perform a test of HO: p=5/4o > = 95 versus Ha: <5 
Ha: & <5 at the a=0.05a = 0.05 significance level. A random sample of size n=20 
n = 20 from the population of interest yields x =4.7@ = 4.7 and sx=0.7482 = 0.74, 
Assume that the conditions for carrying out the test are met. 


a. Explain why the sample result gives some evidence for the alternative hypothesis. 


b. Calculate the standardized test statistic and P-value. 


71. Attitudes In the study of older students’ attitudes from Exercise 65, the sample mean 
SSHA score was 125.7 and the sample standard deviation was 29.8. 


a. Calculate the standardized test statistic. 
b. Find and interpret the P-value. 


c. What conclusion would you make? 


72. Candy! In the study of the candy machine from Exercise 66, the sample mean weight for 
the bags of candy was 19.28 ounces and the sample standard deviation was 0.81 ounce. 


a. Calculate the standardized test statistic. 
b. Find and interpret the P-value. 


c. What conclusion would you make? 


73: 592 Construction zones Every road has one at some point—construction zones 
PQ DIL PR) y p 


that have much lower speed limits. To see if drivers obey these lower speed limits, a 
police officer uses a radar gun to measure the speed (in miles per hour, or mph) of a 
random sample of 10 drivers in a 25 mph construction zone. Here are the data: 


27 33 32 21 30 30 29 25 27 34 


a. Is there convincing evidence at the a=0.010 = 0.01 significance level that the average 
speed of drivers in this construction zone is greater than the posted speed limit? 


74. 


75. 


76. 


77. 


b. Given your conclusion in part (a), which kind of mistake—a Type I error or a Type II 
error—could you have made? Explain what this mistake would mean in context. 


Ending insomnia A study was carried out with a random sample of 10 patients who suffer 
from insomnia to investigate the effectiveness of a drug designed to increase sleep time. 
The following data show the number of additional hours of sleep per night gained by each 
subject after taking the drug.42 A negative value indicates that the subject got less sleep 
after taking the drug. 


ded 0.8 Lt 0.1 =O.4. 4.4 5.5 1.6 4.6 3.4 


a. Is there convincing evidence at the a=0.010 = 0.01 significance level that the average 
sleep increase is positive for insomnia patients when taking this drug? 


b. Given your conclusion in part (a), which kind of mistake—a Type I error or a Type II 
error—could you have made? Explain what this mistake would mean in context. 


Reading level A school librarian purchases a novel for her library. The publisher claims 
that the book is written at a fifth-grade reading level, but the librarian suspects that the 
reading level is lower than that. The librarian selects a random sample of 40 pages and 
uses a standard readability test to assess the reading level of each page. The mean reading 
level of these pages is 4.8 with a standard deviation of 0.8. Do these data give convincing 
evidence at the a=0.05a@ = 0.05 significance level that the average reading level of this 
novel is less than 5? 


How much juice? One company’s bottles of grapefruit juice are filled by a machine that 
is set to dispense an average of 180 milliliters (ml) of liquid. The company has been 
getting negative feedback from customers about underfilled bottles. To investigate, a 
quality-control inspector takes a random sample of 40 bottles and measures the volume of 
liquid in each bottle. The mean amount of liquid in the bottles is 179.6 ml and the standard 
deviation is 1.3 ml. Do these data provide convincing evidence at the a=0.05a = 0.05 
significance level that the machine is underfilling the bottles? 


Pressing pills A drug manufacturer forms tablets by compressing a granular material that 
contains the active ingredient and various fillers. The hardness of a sample from each 
batch of tablets produced is measured to control the compression process. The target value 
for the hardness is p=11.5/ = 11-5, The hardness data for a random sample of 20 tablets 
from one large batch are 


11.627 11.613 11.493 11.602 11.360 
11.374 11.592 11.458 11.552 11.463 
11.383 11.715 11.485 11.509 11.429 
11.477 11.570 11.623 11.472 11.531 


Is there convincing evidence at the 5% level that the mean hardness of the tablets in this 
batch differs from the target value? 


78. 


79. 


80. 


81. 


Jump around Student researchers Haley, Jeff, and Nathan saw an article on the Internet 
claiming that the average vertical jump for teens was 15 inches. They wondered if the 
average vertical jump of students at their school differed from 15 inches, so they obtained 
a list of student names and selected a random sample of 20 students. After contacting 
these students several times, they finally convinced them to allow their vertical jumps to 
be measured. Here are the data (in inches): 


11.0 11.5 12.5 26.5 15.0 12:5 22.0 15.0 13.5 12.0 
23.0 19.0 15.5 21.0 12.5 23.0 20.0 8.5 25.0 20.5 


Do these data provide convincing evidence at the a=0.100 = 0.10 level that the average 
vertical jump of students at this school differs from 15 inches? 


Pressing pills Refer to Exercise 77. 


a. Construct and interpret a 95% confidence interval for the true hardness pi of the tablets 
in this batch. Assume that the conditions for inference are met. 


b. Explain why the interval in part (a) is consistent with the result of the test in Exercise 
af, 


Jump around Refer to Exercise 78. 


a. Construct and interpret a 90% confidence interval for the true mean vertical jump yp 
(in inches) of the students at Haley, Jeff, and Nathan’s school. Assume that the 
conditions for inference are met. 


b. Explain why the interval in part (a) is consistent with the result of the test in Exercise 
78. 


pg 595 |») Fast connection? How long does it take for a chunk of information to travel 


from one server to another and back on the Internet? According to the site 
internettrafficreport.com, the average response time is 200 milliseconds (about one-fifth of 
a second). Researchers wonder if this claim is true, so they collect data on response times 
(in milliseconds) for a random sample of 14 servers in Europe. A graph of the data reveals 
no strong skewness or outliers. 


a. State an appropriate pair of hypotheses for a significance test in this setting. Be sure to 
define the parameter of interest. 


b. Check conditions for performing the test in part (a). 


c. The 95% confidence interval for the mean response time is 158.22 to 189.64 
milliseconds. Based on this interval, what conclusion would you make for a test of the 
hypotheses in part (a) at the 5% significance level? 


d. Do we have convincing evidence that the mean response time of servers in the United 
States is different from 200 milliseconds? Justify your answer. 


82. 


83. 


84. 


85. 


86. 


Water! A blogger claims that U.S. adults drink an average of five 8-ounce glasses (that’s 
AO ounces) of water per day. Researchers wonder if this claim is true, so they ask a 
random sample of 24 U.S. adults about their daily water intake. A graph of the data shows 
a roughly symmetric shape with no outliers. 


a. State an appropriate pair of hypotheses for a significance test in this setting. Be sure to 
define the parameter of interest. 


b. Check conditions for performing the test in part (a). 


c. The 90% confidence interval for the mean daily water intake is 30.35 to 36.92 ounces. 
Based on this interval, what conclusion would you make for a test of the hypotheses in 
part (a) at the 10% significance level? 


d. Do we have convincing evidence that the amount of water U.S. children drink per day 
differs from 40 ounces? Justify your answer. 


Tests and confidence intervals The P-value for a two-sided test of the null hypothesis 
HO: p=10 is 0.06/40 : # = 10 is 0.06 


a. Does the 95% confidence interval for p“ include 10? Why or why not? 


b. Does the 90% confidence interval for p# include 10? Why or why not? 


Tests and confidence intervals The P-value for a two-sided test of the null hypothesis 
HO: p=15 is 0.03440 + # = 15 is 0.03 


a. Does the 99% confidence interval for p“ include 15? Why or why not? 
b. Does the 95% confidence interval for p“ include 15? Why or why not? 


pg 598 |») Potato chips A company that makes potato chips requires each shipment of 


potatoes to meet certain quality standards. If the company finds convincing evidence that 
more than 8% of the potatoes in the shipment have “blemishes,” the truck will be sent 
back to the supplier to get another load of potatoes. Otherwise, the entire truckload will be 
used to make potato chips. To make the decision, a supervisor will inspect a random 
sample of potatoes from the shipment. He will then perform a test of HO: p=0.08 

Ho : p = 0.08 versus Ha: p>0.084e : Pp > 9.08 where p is the true proportion of potatoes 
with blemishes in a given truckload. The power of the test to detect that p=0.11P = 9-11, 
based on a random sample of 500 potatoes and significance level a=0.05a = 0.05, is 
0.764. Interpret this value. 


Upscale restaurant You are thinking about opening a restaurant and are searching for a 
good location. From the research you have done, you know that the mean income of those 
living near the restaurant must be over $85,000 to support the type of upscale restaurant 
you wish to open. You decide to take a simple random sample of 50 people living near 
one potential site. Based on the mean income of this sample, you will perform a test at the 
a=0.05a = 0.05 significance level of HO: u=$85,00040 > # =$85, 000 versus Ha: 


u>$85,000 a? H >$85,000 where u/ is the true mean income in the population of people 
who live near the restaurant. The power of the test to detect that u=$86,000/ =$86, 000 i, 
0.64. Interpret this value. 


87. pg 602 |») Powerful potatoes Refer to Exercise 85. Determine if each of the following 
changes would increase or decrease the power of the test. Explain your answers. 
a. Change the significance level to a=0.10a = 0.10. 
b. Take a random sample of 250 potatoes instead of 500 potatoes. 


c. The true proportion is p=0.10? = 9-10 instead of p=0.11P = 9-11, 


88. Restaurant power Refer to Exercise 86. Determine if each of the following changes 
would increase or decrease the power of the test. Explain your answers. 


a. Use arandom sample of 30 people instead of 50 people. 
b. Try to detect that 1=$85,500/ =85, 500 instead of p=$86,000/ =$86, 000 


c. Change the significance level to a=0.10a = 0.10, 


89. Potato power problems Refer to Exercises 85 and 87. 


a. Explain one disadvantage of using a=0.100 = 0.10 instead of a=0.05a = 0.05 when 
performing the test. 


b. Explain one disadvantage of taking a random sample of 500 potatoes instead of 250 
potatoes from the shipment. 


90. Restaurant power problems Refer to Exercises 86 and 88. 


a. Explain one disadvantage of using a=0.10a = 0.10 instead of a=0.05a = 0.05 when 
performing the test. 


b. Explain one disadvantage of taking a random sample of 50 people instead of 30 people. 


91. Strong chairs? A company that manufactures classroom chairs for high school students 
claims that the mean breaking strength of the chairs is 300 pounds. One of the chairs 
collapsed beneath a 220-pound student last week. You suspect that the manufacturer is 
exaggerating the breaking strength of the chairs, so you would like to perform a test of 


HO: p=30040: # = 300 
Ha: p<300/4a: # < 300 
where ul is the true mean breaking strength of this company’s classroom chairs. 


a. The power of the test to detect that p=2944 — 294 based on a random sample of 30 
chairs and a significance level of a=0.05e = 0.05 is 0.71. Interpret this value. 


b. Find the probability of a Type I error and the probability of a Type II error for the test 


92. 


93. 


94. 


95. 


96. 


in part (a). 
c. Describe two ways to increase the power of the test in part (a). 


Better parking A local high school makes a change that should improve student 
satisfaction with the parking situation. Before the change, 37% of the school’s students 
approved of the parking that was provided. After the change, the principal surveys an SRS 
of students at the school. She would like to perform a test of 


HO: p=0.370: p = 0.37 
Ha: p>0.37Ha: p> 0.37 


where p is the true proportion of students at school who are satisfied with the parking 
situation after the change. 


a. The power of the test to detect that p=0.45? = 9-45 based on a random sample of 200 
students and a significance level of a=0.05a = 0.05 is 0.75. Interpret this value. 


b. Find the probability of a Type I error and the probability of a Type II error for the test 
in part (a). 
c. Describe two ways to increase the power of the test in part (a). 


Error probabilities and power You read that a significance test at the a=0.010 = 0.01 
significance level has probability 0.14 of making a Type II error when a specific 
alternative is true. 


a. What is the power of the test against this alternative? 


b. What’s the probability of making a Type I error? 


Power and error A scientist calculates that a test at the a=0.050 = 0.05 significance level 
has probability 0.23 of making a Type II error when a specific alternative is true. 


a. What is the power of the test against this alternative? 


b. What’s the probability of making a Type I error? 


Do you have ESP? A researcher looking for evidence of extrasensory perception (ESP) 


tests 500 subjects. Four of these subjects do significantly better (P<0.01)\P < 0.01) than 
random guessing. 


a. Is it proper to conclude that these four people have ESP? Explain your answer. 


b. What should the researcher now do to test whether any of these four subjects has ESP? 


Preventing colds A medical experiment investigated whether taking the herb echinacea 
could help prevent colds. The study measured 50 different response variables usually 
associated with colds, such as low-grade fever, congestion, frequency of coughing, and so 
on. At the end of the study, those taking echinacea displayed significantly better responses 
at the a=0.05a@ = 0.05 level than those taking a placebo for 3 of the 50 response variables 


studied. Should we be convinced that echinacea helps prevent colds? Why or why not? 


97. Improving SAT scores A national chain of SAT-preparation schools wants to know if 
using a smartphone app in addition to its regular program will help increase student scores 
more than using just the regular program. On average, the students in the regular program 
increase their scores by 128 points during the 3-month class. To investigate using the 
smartphone app, the prep schools have 5000 students use the app along with the regular 
program and measure their improvement. Then the schools will test the following 
hypotheses: HO: p=12840 > = 128 versus Ha: p>128 Aa ‘> 128) where uu is the true 
mean improvement in the SAT score for students who attend these prep schools. After 3 
months, the average improvement was x =130@ = 130 with a standard deviation of sx=65 
8x = 65. The standardized test statistic is t=2.18t = 2.18 with a P-value of 0.0148. Explain 
why this result is statistically significant, but not practically important. 


98. Music and mazes A researcher wishes to determine if people are able to complete a 
certain pencil and paper maze more quickly while listening to classical music. Suppose 
previous research has established that the mean time needed for people to complete a 
certain maze (without music) is 40 seconds. The researcher decides to test the hypotheses 
HO: )=40 versus Ha: p<40/40 : # = 40 versus H, : ¢ < 40) where u = the time in seconds 
to complete the maze while listening to classical music. To do so, the researcher has 
10,000 people complete the maze with classical music playing. The mean time for these 
people is x =39.92% = 39.92 seconds, and the P-value of his significance test is 0.0002. 
Explain why this result is statistically significant, but not practically important. 


99. Sampling shoppers A marketing consultant observes 50 consecutive shoppers at a 
supermarket, recording how much each shopper spends in the store. Explain why it would 
not be wise to use these data to carry out a significance test about the mean amount spent 
by all shoppers at this supermarket. 


00. Ages of presidents Joe is writing a report on the backgrounds of American presidents. He 
looks up the ages of all the presidents when they entered office. Because Joe took a 
statistics course, he uses these numbers to perform a significance test about the mean age of 
all U.S. presidents. Explain why this makes no sense. 

01. Power calculation: potatoes Refer to Exercise 85. 


a. Suppose that HO: p=0.08/0 : p = 9.08 is true. Describe the shape, center, and variability 
of the sampling distribution of p4? in random samples of size 500. 


b. Use the sampling distribution from part (a) to find the value of pip with an area of 0.05 
to the right of it. If the supervisor obtains a random sample of 500 potatoes with a 
sample proportion of defective potatoes greater than this value of p/P, he will reject HO: 
p=0.08/40 : p = 9.08 at the a=0.05a = 0.05 significance level. 


c. Now suppose that p=0.11? = 9-11. Describe the shape, center, and variability of the 
sampling distribution of p4? in random samples of size 500. 


d. Use the sampling distribution from part (c) to find the probability of getting a sample 


proportion greater than the value you found in part (b). This result is the power of the 
test to detect p=0.11P = 9-11, 


Multiple Choice Select the best answer for Exercises 102-108. 


02. The reason we use t procedures instead of z procedures when carrying out a test about a 
population mean is that 


a. z requires that the sample size be large. 

b. z requires that you know the population standard deviation o. 
c. z requires that the data come from a random sample. 

d. z requires that the population distribution be Normal. 


e. z can only be used for proportions. 
03. You are testing HO: p=1040 : # = 10 against Ha: p<104e ‘ # < 10 based on an SRS of 20 
observations from a Normal population. The t statistic is t=—-2.25 =—2-25, The P-value 


a. falls between 0.01 and 0.02. 
b. falls between 0.02 and 0.04. 
c. falls between 0.04 and 0.05. 
d. falls between 0.05 and 0.25. 


e. is greater than 0.25. 

04. You are testing HO: p= 1040 : # = 10 against Ha: #10 Ha : # #10 based on an SRS of 15 
observations from a Normal population. What values of the t statistic are statistically 
significant at the a=0.005a@ = 0.005 level? 


a. 

t>3.326t > 3.326 
b. 

t>3.286t > 3.286 
Cc. 


t>2.977t > 2.977 


d. t<-3.326¢ <—3.326 or t>3.326t > 3.326 


e. t<—3.286¢ <—3-286 or t>3.286t > 3.286 

05. After checking that conditions are met, you perform a significance test of HO: p=1 
Hy : 4 = 1 versus Ha: p#1 40 * # ~ 1, You obtain a P-value of 0.022. Which of the 
following must be true? 


a. A 95% confidence interval for p“ will include the value 1. 


C. 
d. 


e. 


A 95% confidence interval for np“ will include the value 0.022. 
A 99% confidence interval for p“ will include the value 1. 
A 99% confidence interval for p“ will include the value 0.022. 


None of these is necessarily true. 


06. The most important condition for making an inference about a population mean from a 
significance test is that 


the data come from a random sample. 


a. 
b. the population distribution is exactly Normal. 


C 
d. 


e. 


the data contain no outliers. 
the sample size is less than 10% of the population size. 


the sample size is at least 30. 


07. Vigorous exercise helps people live several years longer (on average). Whether mild 


a 


ctivities like slow walking extend life is not clear. Suppose that the added life expectancy 


from regular slow walking is just 2 months. A significance test is more likely to find a 
significant increase in mean life expectancy with regular slow walking if 


a. 
b. 
c 
d. 


e. 


it is based on a very large random sample and a 5% significance level is used. 
it is based on a very large random sample and a 1% significance level is used. 
it is based on a very small random sample and a 5% significance level is used. 
it is based on a very small random sample and a 1% significance level is used. 


the size of the sample doesn’t have any effect on the significance of the test. 


08. A researcher plans to conduct a significance test at the a=0.010 = 0.01 significance level. 
She designs her study to have a power of 0.90 at a particular alternative value of the 


Pp 
Pp 


a. 
b. 


arameter of interest. The probability that the researcher will commit a Type IJ error for the 
articular alternative value of the parameter she used is 


0.01. 
0.10. 


c. 0.89. 


d. 


e. 


0.90. 
0.99. 


Recycle and Review 


109. 


Is your food safe? (8.1) “Do you feel confident or not confident that the food available at 
most grocery stores is safe to eat?” When a Gallup poll asked this question, 87% of the 
sample said they were confident.42 Gallup announced the poll’s margin of error for 95% 
confidence as +3+3 percentage points. Which of the following sources of error are 
included in this margin of error? Explain your answer. 


Gallup dialed landline telephone numbers at random and so missed all people without 
landline phones, including people whose only phone is a cell phone. 


b. Some people whose numbers were chosen never answered the phone in several calls or 
answered but refused to participate in the poll. 


c. There is chance variation in the random selection of telephone numbers. 


110. Spinning for apples (5.3 or 6.3) In the “Ask Marilyn” column of Parade magazine, a 
reader posed this question: “Say that a slot machine has five wheels, and each wheel has 
five symbols: an apple, a grape, a peach, a pear, and a plum. I pull the lever five times. 
What are the chances that I’ll get at least one apple?” Suppose that the wheels spin 
independently and that the five symbols are equally likely to appear on each wheel in a 
given spin. 


a. Find the probability that the slot player gets at least one apple in one pull of the lever. 


b. Now answer the reader’s question. 


MM _. FRAPPY! FREE RESPONSE AP® 
PROBLEM, YAY! 


The following problem is modeled after actual AP® Statistics exam free response 
questions. Your task is to generate a complete, concise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you use, because you will be 
scored on the correctness of your methods as well as on the accuracy and completeness of 
your results and explanations. 


Anne reads that the average price of regular gas in her state is $4.06 per gallon. To see if 
the average price of gas is different in her city, she selects 10 gas stations at random and 
records the price per gallon for regular gas at each station. The data, along with the 
sample mean and standard deviation, are listed in the table. 


Station uf 2 3 4 5 6 7 8 9 10 Mean SD 
Price 413 4.01 4.09 4.05 3.97 3.99 405 3.98 4.09 4.02 4.038 0.0533 


Do the data provide convincing evidence that the average price of gas in Anne’s city 
is different from $4.06 per gallon? 


After you finish, you can view two example solutions on the book’s website 
(highschool.bfwpub.com/tps6e). Determine whether you think each solution is 
“complete,” “substantial,” “developing,” or “minimal.” If the solution is not 
complete, what improvements would you suggest to the student who wrote it? 
Finally, your teacher will provide you with a scoring rubric. Score your response 
and note what, if anything, you would do differently to improve your own score. 


Chapter 9 Review 


Section 9.1: Significance Tests: The Basics 


In this section, you learned the basic ideas of significance testing. Start by stating the 


hypotheses that you want to test. The null hypothesis (Ho)\4o) 


is typically a statement of “no 
difference” and the alternative hypothesis (Ha)(4<) describes what we suspect is true. 


Remember that hypotheses are always about population parameters, not sample statistics. 


When sample data provide evidence for the alternative hypothesis, there are two possible 
explanations: (1) The null hypothesis is true, and data supporting the alternative hypothesis 
occurred just by chance, or (2) the alternative hypothesis is true, and the data are consistent 
with an alternative value of the parameter. In a significance test, always start with the belief 
that the null hypothesis is true. If you can rule out chance as a plausible explanation for the 
observed data, there is convincing evidence that the alternative hypothesis is true. 


The P-value in a significance test measures how likely it is to get evidence for Ha/t« as 
strong or stronger than the observed evidence, assuming the null hypothesis is true. To 
determine if the P-value is small enough to reject HO/0, compare it to a predetermined 
significance level such as a=0.050 = 0.05. If P-value<avalue < a, reject HO/40—there is 
convincing evidence that the alternative hypothesis is true. However, if P-value>avalue > a, 
fail to reject HO/40—there is not convincing evidence that the alternative hypothesis is true. 


Because conclusions are based on sample data, there is a possibility that the conclusion will 
be incorrect. You can make two types of errors in a significance test: a Type I error occurs if 
you find convincing evidence for the alternative hypothesis when, in reality, the null hypothesis 
is true. A Type II error occurs when you don’t find convincing evidence that the alternative 
hypothesis is true when, in reality, the alternative hypothesis is true. The probability of making 
a Type I error is equal to the significance level (a)\@) of the test. 


Section 9.2: Tests About a Population Proportion 

In this section, you learned the details of performing a significance test about a population 
proportion p. Whenever you are asked if there is convincing evidence for a claim about a 
population proportion, you are expected to respond using the familiar four-step process. 


STATE: Give the hypotheses you are testing in terms of p, define the parameter p, and state 
the significance level. 


PLAN: Name the procedure you plan to use (one-sample z test for a population proportion) and 
check the appropriate conditions (Random, 10%, Large Counts) to see if the procedure is 
appropriate. 


e Random: The data come from a random sample from the population of interest. 


© 10%: The sample size is less than 10% of the population when sampling without 
replacement. 


e Large Counts: Both np0”Po and n(1-p0)” (1 — po) are at least 10, where pOPo is the value of 
p in the null hypothesis. 

DO: Calculate the standardized test statistic and P-value. The standardized test statistic z 

measures how far away the sample statistic is from the hypothesized parameter value in 

standardized units: 


P—Po 


| Po (1—P9) 
z=p\—pOpo(i-po)n Yo” 


To calculate the P-value, use Table A or technology. 


CONCLUDE: Use the P-value to make an appropriate conclusion about the hypotheses, in 
context. 


Perform a two-sided test when looking for convincing evidence that the true value of the 
parameter is different from the hypothesized value, in either direction. The P-value for a two- 
sided test is calculated by finding the probability of getting a sample statistic at least as extreme 
as the observed statistic, in either direction, assuming the null hypothesis is true. 


You can also use a confidence interval to make a conclusion for a two-sided test. If the null 
parameter value is one of the plausible values in the interval, there isn’t convincing evidence 
that the alternative hypothesis is true. However, if the null parameter value is not one of the 
plausible values in the interval, there is convincing evidence that the alternative hypothesis is 
true. Besides helping you draw a conclusion, the interval tells you which alternative parameter 
values are plausible. 


Section 9.3: Tests About a Population Mean 


In this section, you learned the details of performing a significance test about a population 
mean. Although some of the details are different, the reasoning and structure of the tests in this 
section are the same as in Section 9.2. In fact, the “State” and “Conclude” steps are exactly the 
same, other than the switch from proportions to means. 


PLAN: Name the procedure you are using (one-sample t test for a population mean), and check 
the conditions (Random, 10%, and Normal/Large Sample). The Random and 10% conditions 
are the same as in Section 9.2. The Normal/Large Sample condition states that the population 
distribution is Normal or the sample size is large (n>30)(" > 30) Tf the sample size is small 
and the population shape is unknown, graph the sample data to make sure there is no strong 


skewness or outliers that would suggest a non-Normal population. 


DO: Calculate the standardized test statistic and P-value. The standardized test statistic t 
measures how far away the sample statistic is from the hypothesized parameter value in 


standardized units: 


t=x —pOsxn vn 


To calculate the P-value using a t distribution, determine the degrees of freedom (df=n-1) 


(df = — 1) and use Table B or technology. 


The probability that you avoid making a Type IJ error when an alternative value of the 
parameter is true is called the power of the test. Power is good—if the alternative hypothesis is 
true, we want to maximize the probability of finding convincing evidence that it is true. We can 
increase the power of a significance test by increasing the sample size or by increasing the 
significance level. The power of a test will also be greater when the alternative value of the 
parameter is farther away from the null hypothesis value. 

Remember to use significance tests wisely. When planning a study, use a large enough 
sample size so the test will have adequate power. Also, remember that statistically significant 
results aren’t always practically important. Finally, be aware that the probability of making at 
least one Type I error goes up dramatically when conducting multiple tests. 


Significance test for p Significance test for p“ 
Name (TI- One-sample z test for p (1- One-sample t test for u“ (TTest) 
83/84) PropZTest) 
Formula b—po oe E—p9 
po (1-p = alee 
| ee t=x —yOsxn vn 
—p)\— = 
al a a a P-value from t distribution with df=n-1 
P-value from standard Normal df=n-—1 
distribution 
Conditions , Random: The data come froma e Random: The data come from a 
random sample from the random sample from the population of 
population of interest. interest. 
o 10%: When sampling without o 10%: When sampling without 
replacement, n<0.10N replacement, n<0.10N7 < 0.10N, 
n<0O0.10N. 


e Normal/Large Sample: The 

e Large Counts: Both npO’?0 and population has a Normal distribution or 
n(1-po)” (1 — Po) are atleast10. the sample size is large (n230). 
That is, the expected number of (n > 30). if the population distribution 


successes and the expected has unknown shape and n<30,” < 39, 
number of failures in the sample use a graph of the sample data to 
are both at least 10. assess the Normality of the 


population. Do not use t procedures if 
the graph shows strong skewness or 


outliers. 


What Did You Learn? 


Learning Target 


State appropriate hypotheses for a significance test 
about a population parameter. 


Interpret a P-value in context. 


Make an appropriate conclusion for a significance 
test. 


Interpret a Type | and a Type II error in context. Give 
a consequence of each error in a given setting. 


State and check the Random, 10%, and Large Counts 
conditions for performing a significance test about a 
population proportion. 


Calculate the standardized test statistic and P-value 
for a test about a population proportion. 


Perform a significance test about a population 
proportion. 


State and check the Random, 10%, and Normal/Large 
Sample conditions for performing a significance test 
about a population mean. 


Calculate the standardized test statistic and P-value 
for a test about a population mean. 


Perform a significance test about a population mean. 


Use a confidence interval to make a conclusion for a 
two-sided test about a population parameter. 


Interpret the power of a significance test and describe 
what factors affect the power of a test. 
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Chapter 9 Review Exercises 


These exercises are designed to help you review the important ideas and methods of the 
chapter. 


R9.1 |») Stating hypotheses State the appropriate null and alternative hypotheses in each of 


the following settings. Explain why the sample data give some evidence for Hao in 
each case. 


a. The average height of 18-year-old American women is 64.2 inches. You wonder 
whether the mean height of this year’s female graduates from a large local high 
school differs from the national average. You measure an SRS of 48 female graduates 
and find that x =63.5 inches® = 63.5 inches and sx=3.7 inches’ = 3.7 inches | 


b. Rob once read that one-quarter of all people have played/danced in the rain at some 
point in their lives. His friend Justin thinks that the proportion is higher than 0.25 for 
their high school. To settle their dispute, they ask a random sample of 80 students in 
their school and find out that 28 have played/danced in the rain. 


R9.2 _») Checking conditions Refer to Exercise R9.1. Identify the appropriate test to 
perform in each setting and show that the conditions for carrying out the test are met. 

R9.3 |») Calculations and conclusions Refer to Exercise R9.1. Find the standardized test 
statistic and P-value in each setting, and make an appropriate conclusion. 

R9.4 |») Fonts and reading ease Does the use of fancy type fonts slow down the reading of 


text on a computer screen? Adults can read four paragraphs of a certain text in the 
common Times New Roman font in an average time of 22 seconds. Researchers asked a 
random sample of 24 adults to read this text in the ornate font named Gigi. Here are their 
times (in seconds): 


23.2 21.2 289 27.7 291 27.3 #161 226 256 342 239 268 
20.5 343 214 326 262 341 315 246 230 286 244 28.1 


Do these data provide convincing evidence that it takes adults longer than 22 seconds, on 
average, to read these four paragraphs in Gigi font? 


R9.5 |) Flu vaccine A drug company has developed a new vaccine for preventing the flu. 


The company claims that fewer than 5% of adults who use its vaccine will get the flu. To 
test the claim, researchers give the vaccine to a random sample of 1000 adults. 


a. State appropriate hypotheses for testing the company’s claim. Be sure to define your 
parameter. 


b. Describe a Type I error and a Type II error in this setting, and give the consequences 


of each. 


c. Would you recommend a significance level of 0.01, 0.05, or 0.10 for this test? Justify 
your choice. 


d. The power of the test to detect the fact that only 3% of adults who use this vaccine 
would develop flu using a=0.05a = 0.05 is 0.9437. Interpret this value. 


e. Explain two ways that you could increase the power of the test from part (d). 

R9.6 \») Flu vaccine Refer to Exercise R9.5. Of the 1000 adults who were given the vaccine, 
43 got the flu. Do these data provide convincing evidence to support the company’s 
claim? 

R9.7 _») Roulette An American roulette wheel has 18 red slots among its 38 slots. To test if a 


particular roulette wheel is fair, you spin the wheel 50 times and the ball lands in a red 
slot 31 times. The resulting P-value is 0.0384. 


a. Interpret the P-value. 
b. What conclusion would you make at the a=0.05a = 0.05 level? 


c. The casino manager uses your data to produce a 99% confidence interval for p and 
gets (0.44, 0.80). He says that this interval provides convincing evidence that the 
wheel is fair. How do you respond? 


Chapter 9 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


T9.1 An opinion poll asks a random sample of adults whether they favor banning ownership 
of handguns by private citizens. A commentator believes that more than half of all 
adults favor such a ban. The null and alternative hypotheses you would use to test this 


claim are 

a. HO: p\=0.5; Ha: p\>0.5/40 : Pp = 0.5; Ha : p > 0.5, 
b. HO: p=0.5; Ha: p>0.540 : p = 0.5; Ha : p > 0.5, 

c. HO: p=0.5; Ha: p<0.5/40 > p= 0.5; H, : p< 0.5. 

d. HO: p=0.5; Ha: pz0.540 p= 0.5; H, :p#0.5 

e. HO: p>0.5; Ha: p=0.5440 :p > 0.5; H, : p= 0.5. 


T9.2 You are thinking of conducting a one-sample t test about a population mean pi using a 
0.05 significance level. Which of the following statements is correct? 


a. 
b. 


d. 


e. 


You should not carry out the test if the sample does not have a Normal distribution. 


You can safely carry out the test if there are no outliers, regardless of the sample 
size. 


You can carry out the test if a graph of the data shows no strong skewness, 
regardless of the sample size. 


You can carry out the test only if the population standard deviation is known. 


You can safely carry out the test if your sample size is at least 30. 


T9.3 To determine the reliability of experts who interpret lie detector tests in criminal 
investigations, a random sample of 280 such cases was studied. The results were 


Suspect’s True Status 


Examiner’s Decision Innocent Guilty 
“Innocent” 131 15 
“Guilty” 9 125 


If the hypotheses are HO/“0 : Suspect is innocent versus Ha/4a: Suspect is guilty, which 
of the following is the best estimate of the probability that an expert commits a Type II 
error? 


a. 
b. 


C. 


15/280 
9/280 
15/140 


d. 9/140 
e. 15/146 


T9.4 A significance test allows you to reject a null hypothesis HO//0 in favor of an 
alternative hypothesis Ha/e at the 5% significance level. What can you say about 
significance at the 1% level? 


a. HOMo can be rejected at the 1% significance level. 

b. There is insufficient evidence to reject HO/0 at the 1% significance level. 
c. There is sufficient evidence to accept HO/40 at the 1% significance level. 
d. Hae can be rejected at the 1% significance level. 

e. The answer can’t be determined from the information given. 


T9.5 A random sample of 100 likely voters in a small city produced 59 voters in favor of 
Candidate A. The observed value of the standardized test statistic for performing a test 
of HO: p=0.540 : p = 9-5 versus Ha: p>0.5/4e * P > 9-5 is which of the following? 

0.59-0.5 


/ 0.59(0.41) 


a. 2=0.59-0.50.59(0.41)100 «Vm 


&N 


0.59—0.5 
/ 0.5(0.5) 


b. z=0.59-0.50.5(0.5)100  \ i 


»¥ 
“~ 


0.5—0.59 
} 0.59(0.41) 


c. z=0.5-0.590.59(0.41)100  -\ 


N 


0.5—0,59 
/ 0.5(0.5) 


d. z=0.5-0.590.5(0.5)100  V ™ 


~ 
“~ 


_ _ 0.59-0.5 
we 


e. z=0.59-0.5100 V100 


T9.6 A researcher claims to have found a drug that causes people to grow taller. The coach 
of the basketball team at Brandon University has expressed interest but demands 
evidence. Over 1000 Brandon students volunteer to participate in an experiment to test 
this new drug. Fifty of the volunteers are randomly selected, their heights are measured, 
and they are given the drug. Two weeks later, their heights are measured again. The 
power of the test to detect an average increase in height of 1 inch could be increased by 


a. using only volunteers from the basketball team in the experiment. 
b. using a=0.01@ = 0.01 instead of a=0.05a = 0.05, 

c. using a=0.05a = 0.05 instead of a=0.010 = 0.01, 

d. giving the drug to 25 randomly selected students instead of 50. 

e. using a two-sided test instead of a one-sided test. 


T9.7 A 95% confidence interval for the proportion of viewers of a certain reality television 


show who are over 30 years old is (0.26, 0.35). Suppose the show’s producers want to 
test the hypothesis HO: p=0.2540 : p = 9-25 against Ha: p~0.25Ha :p # 0.25. Which 
of the following is an appropriate conclusion for them to draw at the a=0.05a = 0.05 
significance level? 


a. Fail to reject HO;/40; there is convincing evidence that the true proportion of 
viewers of this reality TV show who are over 30 years old equals 0.25. 


b. Fail to reject HO;0; there is not convincing evidence that the true proportion of 
viewers of this reality TV show who are over 30 years old differs from 0.25. 


c. Reject H0;/0; there is not convincing evidence that the true proportion of viewers 
of this reality TV show who are over 30 years old differs from 0.25. 


d. Reject H0;/40; there is convincing evidence that the true proportion of viewers of 
this reality TV show who are over 30 years old is greater than 0.25. 


e. Reject H0;/40; there is convincing evidence that the true proportion of viewers of 
this reality TV show who are over 30 years old differs from 0.25. 


T9.8 In a test of HO : p=0.440 : p = 9.4 against Ha: p#0.44a :p #04 a random sample of 
size 100 yields a standardized test statistic of z=1.282 = 1.28. Which of the following 
is closest to the P-value for this test? 


a. 0.90 
b. 0.40 
c. 0.05 
d. 0.20 
e. 0.10 


T9.9 Bags of a certain brand of tortilla chips claim to have a net weight of 14 ounces. Net 
weights vary slightly from bag to bag and are Normally distributed with mean pnp. A 
representative of a consumer advocacy group wishes to see if there is convincing 
evidence that the mean net weight is less than advertised and so intends to test the 
hypotheses 


HO: p=14440: # = 14 

Ha: p<14He: # < 14 
A Type I error in this situation would mean concluding that the bags 
a. are being underfilled when they aren’t. 
b. are being underfilled when they are. 


c. are not being underfilled when they are. 


aan 


are not being underfilled when they aren’t. 


e. are being overfilled when they are underfilled. 


T9.10 Which of the following has the greatest probability? 
a. P(t>2)P (¢> 2) ifthass degrees of freedom. 
b. P(t>2)P (t > 2) ifthas 2 degrees of freedom. 
C P(z>2)P (2 > 2) ifz is a standard Normal random variable. 
d. P(t<2)P (t <2) ifthas5 degrees of freedom. 


e, P(z<2)P (2 < 2) if zis a standard Normal random variable. 


Section Il: Free Response Show all your work. Indicate clearly the methods you use, 
because you will be graded on the correctness of your methods as well as on the accuracy and 
completeness of your results and explanations. 


T9.11 A software company is trying to decide whether to produce an upgrade of one of its 
programs. Customers would have to pay $100 for the upgrade. For the upgrade to be 
profitable, the company must sell it to more than 20% of their customers. You contact a 
random sample of 60 customers and find that 16 would be willing to pay $100 for the 
upgrade. 


a. Do the sample data give convincing evidence that more than 20% of the company’s 
customers are willing to purchase the upgrade? Carry out an appropriate test at the 
a=0.05a = 0.05 significance level. 


b. Which would be a more serious mistake in this setting—a Type I error or a Type II 
error? Justify your answer. 


c. Suppose that 30% of the company’s customers would be willing to pay $100 for the 
upgrade. The power of the test to detect this fact is 0.60. Interpret this value. 


T9.12 According to the Bureau of Labor Statistics, the average age of American workers is 
41.9 years. The manager of a large technology company believes that the company’s 
employees tend to be younger, on average. So she takes a random sample of 12 
employees and records their ages. Here are the data: 


27 38 32 24 30 47 42 38 27 43 37 33 


a. State appropriate hypotheses for testing the manager’s belief. Be sure to define the 
parameter of interest. 


b. State the conditions for performing a test of the hypotheses in (a), and determine 
whether each condition is met. 


c. The P-value of the test is 0.003. Interpret this value. What conclusion would you 
make? 


T9.13 A government report says that the average amount of money spent per U.S. household 
per week on food is about $158. A random sample of 50 households in a small city is 
selected, and their weekly spending on food is recorded. The sample data have a mean 


of $165 and a standard deviation of $20. Is there convincing evidence that the mean 
weekly spending on food in this city differs from the national figure of $158? 


Comparing Two 


Populations or Treatments 
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Introduction 


Section 10.1 Comparing Two Proportions 


Cumulative AP® Practice Test 3 


INTRODUCTION 


Which of two often-prescribed drugs—Lipitor or Pravachol—helps lower “bad cholesterol” 
more? Researchers designed an experiment, called the PROVE-IT Study, to find out. They 
used about 4000 people with heart disease as subjects. These individuals were randomly 
assigned to one of two treatment groups: Lipitor or Pravachol. At the end of the study, 
researchers compared the proportion of subjects in each group who died, had a heart attack, or 
suffered other serious consequences within two years. For those using Pravachol, the 
proportion was 0.263; for those using Lipitor, it was 0.224. Could such a difference have 
occurred purely by the chance involved in the random assignment? This is a question about 
comparing two proportions. 


Who studies more in college—men or women? Researchers asked separate random samples 
of 30 males and 30 females at a large university how many minutes they studied on a typical 
weeknight. The females reported studying an average of x F=165.172F = 165.17 minutes; the 
male average was xX M=117.17%™ = 117.17 minutes. How large is the difference in the 
corresponding population means? This is a question about comparing two means. 


What if we want to compare means in a setting that involves measuring the same 
quantitative variable twice for each individual or once for each of two individuals that are much 
alike? For instance, a researcher studied a random sample of identical twins who had been 
separated and adopted at birth. In each case, one twin was adopted by a high-income family 
and the other by a low-income family. Both twins were given an IQ test as adults. Did the twins 
raised in high-income families have significantly higher average IQ scores than the twins raised 
in low-income families? This is a question about comparing two means for paired data. 


Comparing two proportions or means based on random sampling or a randomized 
experiment is one of the most common situations encountered in statistical practice. In the 
PROVE-IT experiment, the goal of inference is to determine whether the treatments (Lipitor 
and Pravachol) caused the observed difference in the proportion of subjects who experienced 
serious consequences in the two groups. For the college study-time survey, the goal of 
inference is to draw a conclusion about the difference in actual mean study times for all women 
and all men at the university. In the observational study of identical twins raised separately, the 
goal is to make an inference about whether twins raised in high-income families have higher 
mean adult IQ scores than twins raised in low-income families. 


The following activity gives you a taste of what lies ahead in this chapter. 


ACTIVITY Who likes tattoos? 


For their response bias project (page 292), Sarah and Miranda investigated whether the 
characteristics of the interviewer can affect the response to a survey question. At the Tucson 
Mall, they asked 60 shoppers the question “Do you like tattoos?” When interviewing 30 of 
the shoppers, Sarah and Miranda wore long-sleeved tops that covered their tattoos. For the 


remaining 30 shoppers, the interviewers wore tank tops that revealed their tattoos. The 
choice of long-sleeve or tank top was determined at random. Sarah and Miranda suspected 
that more people would answer “Yes” when their tattoos were visible. 


Rawpixel.com/Shutterstock 
What happened in the experiment? The two-way table summarizes the results:2 


Clothing worn 


Tank top Long sleeves Total 
: Yes 32 
Like tattoos? 
No 28 
Total 30 30 60 


The difference (Tank top — Long sleeves) in the proportions of people who said they like 


tattoos is 18/30-14/30=0.600-0.467=0.13318/30 — 14/30 = 0.600 — 0.467 = 0.133 Does 
this difference provide convincing evidence that the appearance of the interviewer has an 
effect on the response, or could the difference be due to chance variation in the random 
assignment? 


In this activity, your class will investigate whether the results of the experiment are 


Statistically significant. Let’s see what would happen just by chance if we randomly reassign 
the 60 people in this experiment to the two treatments (tank top and long sleeves) many 
times, assuming the treatment received doesn’t affect whether or not a person says they like 
tattoos. 


1. 


Using 60 index cards or equally sized pieces of paper, write “Yes” on 32 and “No” on 28. 
Shuffle the cards and divide them at random into two piles of 30—one for the tank top 
treatment group and one for the long sleeves treatment group. Be sure to determine which 
pile will represent each group before you deal. 


. Calculate the difference (Tank top — Long sleeves) in the proportions of “Yes” responses 


for the two groups. 


. Your teacher will draw and label axes for a class dotplot. Plot your result from Step 2 on 


the graph. 


. Repeat Steps 2 and 3 if needed to get a total of at least 40 trials of the simulation for your 


class. 


5. How often did a difference in proportions of 0.133 or greater occur due only to the chance 
involved in the random assignment? What conclusion would you make about the effect of 
the interviewer’s appearance? 


We used software to perform 100 trials of the simulation described in the activity. In each 
trial, the computer randomly reassigned the 60 subjects in the experiment to the two treatments, 
assuming that the treatment received would not affect whether each person says that he or she 
likes tattoos. Each dot in the graph represents the difference (Tank top — Long sleeves) in the 
proportion of respondents in the two groups who say they like tattoos for a particular trial. In 
this simulation, 19 of the 100 trials (in red) produced a difference in proportions of at least 
0.133, so the approximate P-value is 0.19. A difference this large could easily occur just due to 
the chance variation in random assignment! Sarah and Miranda’s data do not provide 
convincing evidence that the appearance of the interviewer caused a higher proportion of 
people to say they like tattoos when the interviewer’s tattoos were visible. 
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SECTION 10.1 Comparing Two Proportions 


LEARNING TARGETS By the end of the section, you should be able to: 
e Describe the shape, center, and variability of the sampling distribution of 
p*1-p’2?P 1 Po. 
Determine whether the conditions are met for doing inference about a difference 
between two proportions. 


Construct and interpret a confidence interval for a difference between two 
proportions. 


Calculate the standardized test statistic and P-value for a test about a difference 
between two proportions. 


Perform a significance test about a difference between two proportions. 


Suppose we want to compare the proportions of individuals with a certain characteristic in 
Population 1 and Population 2. Let’s call these parameters of interest p1?1 and p2P2. The 
preferred strategy is to take a separate random sample from each population and to compare the 
sample proportions pA1P 1 and pA2P2 2 with that characteristic. 

What if we want to compare the effectiveness of Treatment 1 and Treatment 2 ina 
completely randomized experiment? This time, the parameters p1?! and p2?P2 that we want to 
compare are the true proportions of successful outcomes for each treatment. We use the 
proportions of successes in the two treatment groups, pA1P 1 and pA2P 2, to make the 
comparison. Here’s a table that summarizes these two situations: 


Population or treatment Parameter Statistic Sample size 
1 pi? pn1P 1 nw 
2 p2P2 pr2P2 n2te2 


We compare the populations or treatments by doing inference about the difference p1—p2 
P1 ~ P2 between the parameters. The statistic that estimates this difference is the difference 
between the two sample proportions, p\1—pA2P1 P2. To use pA1—pA2P1 P2 for inference, 
we must know its sampling distribution. 


The Sampling Distribution of a Difference Between Two 
Proportions 


To explore the sampling distribution of p\1-pA2P i ~ P2, let’s start with two populations 
having a known proportion of successes. Suppose there are two large high schools—each with 
more than 2000 students—in a certain town. At School 1, 70% of students did their homework 


last night (p1=0.70)(P1 = 0.70), Only 50% of the students at School 2 did their homework last 
night (p2=0.50)(P2 = 0.50) The counselor at School 1 selects an SRS of 100 students and 
records the proportion p11 that did the homework. School 2’s counselor selects an SRS of 
200 students and records the proportion pr2P2 that did the homework. What can we say about 
the difference p\1—p2P1 ~ P2 in the sample proportions? 


MachineHeadz/Getty Images 


In Chapter 7, we saw that the sampling distribution of a sample proportion pAP has the 
following properties: 


Shape: Approximately Normal if np>10"P = 10 and n(1—p)=10" (1—p) > 10 


cr p(l—p) . 
Variability : t= = ifn < 0.10.N 


Variability:op\=p(1—p)n if n<0.10N 


For the sampling distributions of pA1P1 and pr2P2 in this case: 


Sampling distribution of pr1P1 
Shape Approximately Normal; 
mpi = 100 (0.70) = 70 > 10 and 
n1p1=100(0.70)=70210 andn1(1-p1)=100(0.30)=30 210% (1 — pi) = 100 (0.30) = 30 > 10 


Center =p, = 0.70 


up1=p1=0.70'» | 


Variability — PAill-py, —— /0.7(0.3) 


Pl ny 100 


op*1=p1(1-p1)n1=0.7(0.3)100=0.0458 = 9.0458 
because 100 < 10% of all students at School 1. 


What about the sampling distribution of pA1—p2P1 ~ P2? We used software to randomly 
select 100 students from School 1 and 200 students from School 2. Our first set of samples 
gave p\1=0.72P 1 = 0.72 and p\2=0.47P2 = 0.47 resulting in a difference of 
p\1-p\2=0.72-0.47=0.25P 1 ~ Pz = 0.72 — 0.47 = 0.25. A red dot for this value appears in 
Figure 10.1. The dotplot shows the results of repeating this process 500 times. 
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FIGURE 10.1 Simulated sampling distribution of the difference in sample proportions p*1-p%2 
Pi ~ P2 in 500 SRSs of size n1=100%1 = 100 froma population with p1=0.70P1 = 0.70 and 500 
SRSs of size n2=200"2 = 200 from a population with p2=0.50P2 = 9.50. 


The figure suggests that the sampling distribution of p\1-pA2P1 ~ P2 has an approximately 
Normal shape. This makes sense from what you learned in Section 6.2 because we are 
subtracting two independent random variables, pA1P 1 and pr2P 2, that have approximately 
Normal distributions. 

The mean of the sampling distribution is 0.20. The true proportion of students who did last 
night’s homework at School 1 is p1=0.70P1 = 9-70 and at School 2 is p2=0.50P2 = 9-50, We 
expect the difference pA1-p2P1 ~ P2 to center on the actual difference in the population 
proportions, p1—p2=0.70—-0.50=0.20P1 ~ p2 = 9.70 — 0.50 = 0.20. The standard deviation of 
the sampling distribution is 0.058. It can be found using the formula 


p1(1- p 1)nl1+p2(1- p 2)n2=0.7(0.3)100+0.5(0.5)200=0.058 


py (1-; 1- 0.7(0.3) 0.5(0.5 
ewe [0-7(0.3) , 0.5(0.5) 


= 0.0 
ny n5 V~ 100 | iia 


That is, the difference (School 1-School 2)(Schooll — School2) in the sample proportions of 
students at the two schools who did their homework last night typically varies by about 0.058 
from the true difference in proportions of 0.20. 


THE SAMPLING DISTRIBUTION OF p41-p‘2?P: — P2 


Choose an SRS of size n1™1 from Population 1 with proportion of successes p1P1 and an 
independent SRS of size n2®2 from Population 2 with proportion of successes p2P2. Then: 


e The sampling distribution of p\1-pA2P1 P2 is approximately Normal if the Large 
Counts condition is met for both samples: np11>10, n1(I-p1)=10, n2p2>10, 
mpi 2 10, m1 (1— pr) 2 10, map2 2 10, and n2(1-p2)>10 (1 — Pa) 2 10, 


e The mean of the sampling distribution of p\1-p2P 1 — P2 is up’1-p\1=p1-p2 
Hp .—p, — Pi — Pa 


e The standard deviation of the sampling distribution of p\1—pA2P 1 — P2 is 


_ /py(i-p1) _ P2(1-P2) 
Op = = 4{ ———— + ——__— 
ha eee _ _ P\—P2 \ ny ng 
op1—-p2=p1(1-p1)n1+p2(1-p2)n2 


as long as the 10% condition is met for both samples: n1<0.10N1™ < 9-101 and 
n2<0.10N2%2 < 0.10N2 


Note that the formula for the standard deviation of the sampling distribution is only correct 
for independent random samples from the corresponding populations. We will adjust the 
Random condition for inference about a difference between two proportions to reflect this 
added requirement. The standard deviation of the sampling distribution tells us how much the 
difference in sample proportions will typically vary from the difference in the population 
proportions if we repeat the random sampling process many times. 


Think About It 


WHERE DO THE FORMULAS FOR THE MEAN AND STANDARD DEVIATION 
OF THE SAMPLING DISTRIBUTION OF p41 - pr2P P2 COME FROM? Both 
pA1P 1 and pr2P2 2 are random variables. That is, their values would vary in repeated 
independent SRSs of size n1™1 and n2”2. Independent random samples yield independent 
random variables p\1P1 and pr2P2, 2. The statistic p41- pr2P1 P2 is the difference of these 
two independent random variables. 

In Chapter 6, we learned that for any two random variables X and Y, 


uxX-Y=pX-pYex-y = hx — By 
For the random variables pAlPr and pr2P2 , we have 
HpA1—pA2=pp\1—pp2=p1—p2"Pr-P2 ~ May ~ Mpp ~ PE Pe 
We also learned in Chapter 6 that for independent random variables X and Y, 
OX-Y2=0X2+0¥27X-¥ = % ty 
For the random variables pAlP and pA2P 2, we have 


op\1—p22=o0p12+0p22=(p1(1-p1)n1)2+(p2(1-p2)n2)2 


— 2 ——-"» 2 
9 2 2 ( aes | ( a) 

O- - =O. +O: = QeovX__"_"_— + —_—_— 
P1-P2 P| P2 \ ny \ ny» 


pi(1—p1)  p2 (1-2) 
SS —————— EE 
=p1(1—-p1)n1+p2(1—-p2)n2 "1 "2 


So op\1-p/2=p1(1-p1)n1+p2(1-p2)n2°P1-P2 — Vm 7 


When the conditions are met, we can use the Normal density curve shown in Figure 10.2 to 
model the sampling distribution of p\1—p2?1 ~ P2. Note that this would allow us to calculate 
probabilities involving p41—p42?1 ~ P2 with a Normal distribution. 


Standard deviation: 


p,(- Py), pz(1 -p2) 
ny ny 


| Mean: p; ~ p2| | Mean: p; ~ p2| P2 


Values of B, ~P> 
FIGURE 10.2 Select independent SRSs from two populations having proportions of successes 
p1P1 and p2P2. The proportions of successes in the two samples are p%1?1 and p’2P?2. When 


the conditions are met, the sampling distribution of p1P1 —p2P2 is approximately Normal with 
pie an 4 P2(1—py) 


mean p1-p2?1 — P2 and standard deviation p1(1-p1)n1+p2(1- p2)n2V ra 


EXAMPLE | Yummy goldfish! (>) 
Describing the sampling distribution of p41-p42?1 — P2 


PROBLEM: Your teacher brings two bags of colored goldfish crackers to class. Bag 1 has 
25% red crackers and Bag 2 has 35% red crackers. Each bag contains more than 1000 
crackers. Using a paper cup, your teacher takes an SRS of 50 crackers from Bag 1 anda 
separate SRS of 40 crackers from Bag 2. Let p\1—p2?1 ~ P2 be the difference in the sample 
proportions of red crackers. 

a. What is the shape of the sampling distribution of p\1—pA2P 1 — P2? Why? 

b. Find the mean of the sampling distribution. 


c. Calculate and interpret the standard deviation of the sampling distribution. 


Envision/Getty Images 


SOLUTION: 


mpi = 50(0.25) = 12.5, iq — ps) 


a. Approximately Normal, because nip1 = 50(0.25) = 12.5 
= 50(0.75) = 37.5% (1-1) = 50(0.75) = 37.5, n2p2 = 40(0.35) = 14 
n2p2 = 40 (0.35) = 14 and n2(1 ~ p2) oe 40(0.65) = 2672 (1 _ p2) = 40 (0.65) == 26 are all 


>10.= 10. 


Note that these values are the expected numbers of successes and failures in the 


two samples. 


b MpA1—pr2 = 0.25 — 0.35 = -0.10P1-P2 ~ 0.25 — 0.35 = — 0.10 


= Pi — Ps 


pp1—p\2=p1—p2!r Po 


c. Because 50 < 10% of all crackers in Bag 1 and 40 < 10% of all crackers in Bag 2, 
op\1—ph2 = 0.25(0.75)50+0.35(0.65)40 = 0.0971 
/0.25(0.75) 0.35(0.65) 
O- = = 4 ———_+———— 
ri-p2 ~V~ 50 40 
The difference (Bag 1— Bag 2) in the sample proportions of red goldfish crackers 
typically varies by about 0.097 from the true difference in proportions of -0.10.~9.10. 


= 0.0971 


fe, G=Pi) , aQ—P9) 
<a 


oD P2 =a ; 
op\1—pA2=p1(1-p1)ni+p2(1—p2)n2 yoo 


FOR PRACTICE, TRY EXERCISE 1 


Confidence Intervals for p1 -— p2?”: — P2 


When data come from two independent random samples or two groups in a randomized 
experiment (the Random condition), the statistic p\1-p2P i ~ P2 is our best guess for the 
value of p1—p2?1 ~ P2. The method we use to calculate a confidence interval for p1—p2?! ~ P2 
requires that the sampling distribution of pA1-pA2P1 ~ P2 be approximately Normal. Earlier, 
we noted that this will be true whenever n1p11P1, n1(1-p1)™! (1—pi), n2p2"2P2, and 
n2(1—-p2)"2 (1 — p2) are all at least 10. Because we don’t know the value of pl?! or p2P2 when 
we are estimating their difference, we use p\1P1 and pA2P 2 when checking the Large Counts 
condition. 


CONDITIONS FOR CONSTRUCTING A CONFIDENCE INTERVAL 
ABOUT A DIFFERENCE IN PROPORTIONS 


e Random: The data come from two independent random samples or from two groups ina 
randomized experiment. 


© 10%: When sampling without replacement, n1<0.10N1% < 9.101 and n2<0.10N2 
no < 0.10.N> : 


e Large Counts: The counts of “successes” and “failures” in each sample or group—n1p1 
MP1, n1(1-pA1)™ (1— Pi), n2pA2"2P2, n2(1—pA2)" (1 — P2)_are all at least 10. 


Recall from Chapter 4 that the Random condition is important for determining the scope of 
inference. Random sampling allows us to generalize our results to the populations of interest; 
random assignment in an experiment permits us to draw cause-and-effect conclusions. 


EXAMPLE |Do you prefer brand names? (>) 
Checking conditions 


Ariel Skelley/Getty Images 


PROBLEM: A Harris Interactive survey asked independent random samples of adults from 
the United States and Germany about the importance of brand names when buying clothes. 
Of the 2309 U.S. adults surveyed, 26% said brand names were important, compared with 
22% of the 1058 German adults surveyed. Let pU=?U ~ the true proportion of all U.S. 
adults who think brand names are important when buying clothes and pG=P¢ ~ the true 
proportion of all German adults who think brand names are important when buying clothes. 
Check if the conditions for calculating a confidence interval for pU-pGPU ~ PG¢ are met. 


SOLUTION: 


e Random: Independent random samples of 2309 UES: adults and 1058 German adults. / 


© 10%: 2309<10%2309 < 10% of all U.S. adults and 1058 << 10% of all German adults 
/ 


Be sure to mention independent random samples from the populations of interest 


when checking the Random condition. 


e Large Counts?nUp\U = 2309(0.26) = 600.34->600 
nu Py = 2309 (0.26) = 600.34 + 600 yqy(1 — PAU) = 2309(0.74) = 1708.661709 
ny (1 — By) = 2309 (0.74) = 1708.66 — 1709 ng yrg = 1058(0.22) = 232.76 233 
NG Pq = 1058 (0.22) = 232.76 — 233, nG(1 - prG) = 1058(0.78) = 825.24—>825 
ng (1— pq) = 1058 (0.78) = 825.24 825 0, aff s10> 10. ¥ 


We round these values to the nearest whole number because they represent the 


actual numbers of successes and failures in the two samples. 


FOR PRACTICE, TRY EXERCISE 5 


If the conditions are met, we can use our familiar formula to calculate a confidence interval 
for p1—p2:P1 ~ P2 : 


Statistic+(critical value)-(standard deviation of statistic) 
statistic + (critical value) - (standard deviation of statistic) 


=(p/1—p/2)+z*-(standard deviation of statistic) 
= (p, — py) + z* - (standard deviation of statistic) 
As mentioned earlier, the standard deviation of the sampling distribution of p\1-p2P1 P2 is 
[Pi (t—P1) , P2 (1-2) 
a a ee 
op\1—p\2=p1(1-p1)n1+p2(1-p2)n2 | Voom 2 


Because we don’t know the values of the parameters p1?! and p2P2, we replace them in the 
standard deviation formula with the sample proportions. The result is the standard error of 
pA1—pA2P1 = P2 : 
| 
SEp41—-p42=p/1(1-p41)n1+p42(1-p2)n2 


This value estimates how much the difference in sample proportions will typically vary from 
the difference in the true proportions if we repeat the random sampling or random assignment 
many times. 

When the Large Counts condition is met, we find the critical value z*2° for the given 
confidence level using Table A or technology. Our confidence interval for p1—p2?1 ~ P2 is 
therefore 


statistic+(critical value)-(standard deviation of statistic) 
statistic + (critical value) - (standard deviation of statistic) 


= (p p )+z* /P1Q—P1) | b2(1-P9) 
a 1 Rata 2 i 5 ee 
=(p\1—p2)+2z*pA1(1—p41)n1+p2(1-pA2)n2 es m2 


This is often called a two-sample z interval for a difference between two proportions. 


TWO-SAMPLE z INTERVAL FOR A DIFFERENCE BETWEEN TWO 
PROPORTIONS 


When the conditions are met, a C% confidence interval for p1—p2?!1 ~ P2 is 


. 2), .. /P1(l-P1) , b2(1-#2) 
(By — Bg) £2°4{ —————— + 
(p\1-pA2)+z*p\1(1-p1)n1+p42(1—-p2)n2 1 - 


where z*2* is the critical value for the standard Normal curve with C%C % of the area 
between -z*~2" and z*2*. 


Let’s return to the brand names example. Recall that Harris Interactive took independent 
random samples of 2309 U.S. adults and 1058 German adults, and found that p\U=0.26 
Py = 0.26 and p\G=0.22Pe = 0.22 We already confirmed that the conditions are met. A 95% 
confidence interval for pU-pGPU ~ PG@ is 


(0.26-0.22)+1.960.26(1-0.26)2309+0.22(1-0.22)1058 


(0.26(1—0.26) 0.22(1—0.22) 


2309 ; 1058 
=0.04+0.03 = 0.04 + 0.03 


(0.26 — 0.22) + 1.96 


=(0.01, 0.07) = (0-01, 0.07) 


Interpretation: We are 95% confident that the interval from 0.01 to 0.07 captures pU-pG 
PU ~ PG= the difference in the true proportions of all U.S. adults and all German adults who 
think brand names are important when buying clothes. 


Note that the confidence interval does not include 0 as a plausible value for pU-pGPU ~ Pe 
, SO we have convincing evidence of a difference between the population proportions. In fact, it 
is believable that pU-pG?U ~ Pe has any value between 0.01 and 0.07. We can restate this in 
context as follows: The interval suggests that the importance of brand names when buying 
clothes for U.S. adults is between 1 and 7 percentage points higher than for German adults. 


It would not be correct to say that the importance of brand names when buying clothes for 
U.S. adults is between 1 and 7 percent higher than for German adults. To see why, suppose that 
pU=0.25Pu = 9.25 and pG=0.20Pe = 9.20. The difference pU-pG=0.25-0.20=0.05 
Pu ~ Pg = 0.25 ~ 0.20 = 0.05 | or 5 percentage points. But the proportion of U.S. adults who 
think brand names are important when buying clothes is 0.05/0.20=0.259-05/0.20 = 0.25 


25% higher than the corresponding proportion of German adults. 


or 


The researchers in the preceding example selected independent random samples from the 
two populations they wanted to compare. In practice, it’s common to take one random sample 
that includes individuals from both populations of interest and then to separate the chosen 
individuals into two groups. For instance, a polling company may randomly select 1000 U.S. 
adults, then separate the Republicans from the Democrats to estimate the difference in the 
proportion of all people in each party who favor the death penalty. The two-sample z 
procedures for comparing proportions are still valid in such situations, provided that the two 
groups can be viewed as independent samples from their respective populations of interest. 


INFERENCE FOR EXPERIMENTS So far, we have focused on doing inference using data 
that were produced by random sampling. However, many important statistical results come 
from randomized comparative experiments. Defining the parameters in experimental settings is 
more challenging. 


The “Who likes tattoos?” activity on page 619 describes an experiment that used 60 mall 
shoppers as subjects. Sarah and Miranda randomly assigned half the subjects to be asked, “Do 
you like tattoos?” by an interviewer wearing a tank top with tattoos showing, and the other half 
to be asked the same question by the same interviewer wearing long sleeves that hid the tattoos. 
Then Sarah and Miranda compared the proportions of people in the two groups who said 
“Yes.” The parameters in this setting are 


p1=P1 ~ the true proportion of people like these who would say they like tattoos 
when asked by the interviewer with tattoos showing 


p2=P2 ~ the true proportion of people like these who would say they like tattoos 
when asked by the interviewer with tattoos hidden 


Most experiments on people use recruited volunteers as subjects. When subjects are not 
randomly selected, researchers cannot generalize the results of an experiment to some larger 


populations of interest. But researchers can draw cause-and-effect conclusions that apply to 
people like those who took part in the experiment. This same logic applies to experiments on 
animals or things. Also, unless the experimental units are randomly selected without 
replacement, we don’t have to check the 10% condition when performing inference about an 
experiment. 

The following example shows how to construct and interpret a confidence interval for a 
difference in proportions. As usual with inference problems, we follow the four-step process. 


EXAMPLE | Treating lower back pain (>) 
Confidence interval for p; — p2 


Syda Productions/Shutterstock 


PROBLEM: Patients with lower back pain are often given nonsteroidal anti-inflammatory 
drugs (NSAIDs) like naproxen to help ease their pain. Researchers wondered if taking 
Valium along with the naproxen would affect pain relief. To find out, they recruited 112 
patients with severe lower back pain and randomly assigned them to one of two treatments: 
naproxen and Valium or naproxen and placebo. After 1 week, 39 of the 57 subjects who took 
naproxen and Valium reported reduced lower back pain, compared with 43 of the 55 subjects 
in the naproxen and placebo group.2 


a. Construct and interpret a 99% confidence interval for the difference in the proportion of 
patients like these who would report reduced lower back pain after taking naproxen and 
Valium versus after taking naproxen and placebo for a week. 


b. Based on the confidence interval in part (a), what conclusion would you make about 
whether taking Valium along with naproxen affects pain relief? Justify your answer. 


SOLUTION: 


a. STATE: 99% CI for pia p2Pi — P2, where pi =Pl ~ true proportion of ‘patients like 
these who would report reduced lower back pain after taking naproxen and Valium for 


a week and p2 aes — true proportion of patients [ike these who would report reduced 
lower back pain after taking naproxen and placebo for a week. 


Be sure to indicate the order of subtraction when defining the parameter. Then you 


can just mimic the wording in your conclusion. 


PLAN: Tr wo-sample z interval for (ee pri — p2 


e Random: Randomly assigned patients to take naproxen and Valium or naproxen and 


placebo. / 


¢ Large Counts: 39, 57 — 39 =1897 — 39 = 18, 43, and 55 - 43 = 1295 — 43 = 12 are 
allz1oall > 10, ¥ 


We don’t have to check the 10% condition because researchers did not sample 


patients without replacement from a larger population. 


= = = 0.684 Po = a = 0.782 


DO: p\1 = Oe = 0.684"! De a= 355 0e2 
(0.684 — 0.782) +2.5760.684(0.316)57+0.782(0.218)55 


eZ , /0.684(0.316) — 0.782(0.218) 
(0.684 — 0.782) + 2.5764) —S oe 


= -0.098+0.214 = 0.098 + 0.214 
= (a , 0.116) ( — 0.312, 0.116) 


Pp {= Po Z* —————— os 
(p\1—p42)+z*p\1(1-p1)n1+p2(1—-p2)n2 yo ny n9 


Refer to the Technology Corner that follows the example. The calculator’s 2-PropZInt 
gives (-0.3114, 0.1162).(—9.3114, 0.1162) . 


CONCLUDE: We are 99% confident that the interval from —0.312 to 0.116 captures p1 — 
p2 =P1 — P2 = the difference in the true proportions of ‘patients like these who would report 
reduced pain after taking naproxen and Valium versus after taking naproxen and a 


placebo for a week. 


b. Because the interval includes 0 as a plausible value for pi p2Pi — P2, we don’t have 
convincing evidence that taking Valium along with naproxen affects pain relief ‘for 
patients like these. 


The interval suggests that the true proportion of patients like these who would report 


reduced pain after taking naproxen and Valium is between 31.2 percentage points 
lower and 11.6 percentage points higher than after taking naproxen and placebo. 


FOR PRACTICE, TRY EXERCISE 9 


We could have subtracted the proportions in the opposite order in part (a) of the example. 
The resulting 99% confidence interval for p2—p1?2 ~ P1 is 
(0.782—0.684)+2.5760.782(0.218)55+0.684(0.316)57 
/0.782(0.218) 0.684(0.316) 
55 57 
=0.09840.214 = 0.098 + 0.214 


(0.782 — 0.684) + 2.5764/ 


=(-0.116, 0.312)= (0-116, 0.312) 


Notice that the endpoints of the interval have the same values but opposite signs to the ones in 
the example. This interval suggests that the true proportion of patients like these who would 
report reduced pain after taking naproxen and placebo is between 11.6 percentage points lower 
and 31.2 percentage points higher than after taking naproxen and Valium. That’s equivalent to 
our interpretation of the confidence interval for p1—p2?! ~ P2 in part (a) of the example. 

The fact that 0 is included in a confidence interval for p1—p2?! ~ ?2 means that we don’t 
have convincing evidence of a difference between the true proportions. Keep in mind that 0 is 
just one of many plausible values for p1—p2?! ~ P2 based on the sample data. og Never 
suggest that you believe the difference between the true proportions is 0 just because 0 is 
in the interval! 

You can use technology to perform the calculations in the “Do” step. Remember that this 
comes with potential benefits and risks on the AP® Statistics exam. 


23. Technology Corner | CONSTRUCTING A CONFIDENCE 
INTERVAL FOR A DIFFERENCE IN PROPORTIONS |») 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


The TI-83/84 can be used to construct a confidence interval for p1—p2?! ~ P2. We’|l 
demonstrate using the preceding example. Of n1=57" = 57 subjects who took naproxen 
and Valium, X1=39%1 = 39 reported reduced lower back pain after a week. Of n2=55 

m2 = 55 subjects who took naproxen and placebo, X2=432 = 43 reported reduced lower 
back pain after a week. To construct a confidence interval: 


e Press |STAT|, then choose TESTS and 2-PropZInt. 


e When the 2-PropZInt screen appears, enterr the values shown. Note that the values of x7, 


ny, Xz, and ny must be integers. 


e Highlight “Calculate” and press |ENTER|. 


x1:39 

ni:57 

x2:43 

n2:55 
C-Level:0.99 
Calculate 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


2-PropZint 
( -@.3114,.0.11623) 
P1=@. 6842105263 
®2=0. 7818181818 
ni=57 
n2=55 


AP® EXAM TIP 


The formula for the two-sample z interval for p1-p2?1 ~ P2 often leads to calculation 
errors by students. As a result, your teacher may recommend using the calculator’s 
2-PropZInt feature to compute the confidence interval on the AP® Statistics exam. 
Be sure to name the procedure (two-sample z interval for p1—-p2?1 ~ P2) in the 
“Plan” step and give the interval (-0.311, 0.116)(—9-311, 9.116) jn the “Do” step. 


CHECK YOUR UNDERSTANDING 


\ 


v A Pew Research Center poll asked independent random samples of working women and 
men how much they value job security. Of the 806 women, 709 said job security was very or 
extremely important, compared with 802 of the 944 men surveyed. Construct and interpret a 


95% confidence interval for the difference in the proportion of all working women and men 
who consider job security very or extremely important. 


Significance Tests for p1 — p2?: — P2 


An observed difference between two sample proportions can reflect an actual difference in the 
parameters, or it may just be due to chance variation in random sampling or random 
assignment. Significance tests help us decide which explanation makes more sense. 


STATING HYPOTHESES AND CHECKING CONDITIONS In a test for comparing two 
proportions, the null hypothesis has the general form 


HO: p1—p2=hypothesized value/1o: pi — p2 = hypothesized value 


We’re often interested in situations in which the hypothesized difference is 0. Then the null 
hypothesis says that there is no difference between the two parameters: 


HO: p1-p2=040: pi — p2 = 0 


(You will sometimes see the null hypothesis written in the equivalent form HO: p1=p2 
Ho: pi = P2.) The alternative hypothesis says what kind of difference we expect. 

The conditions for performing a significance test about p1—p2?1 ~ P2 are the same as for 
constructing a confidence interval. 


CONDITIONS FOR PERFORMING A SIGNIFICANCE TEST ABOUT A 
DIFFERENCE BETWEEN TWO PROPORTIONS 


e Random: The data come from two independent random samples or from two groups in a 
randomized experiment. 


© 10%: When sampling without replacement, n1<0.10N1% < 9.10 and n2<0.10N2 
no < 0.10.N> ; 


e Large Counts: The counts of “successes” and “failures” in each sample or group—n1p/1 
MP1, n1(1-pA1)™ (1— Pi), n2pA2"2P2, n2(1—pA2)" (1 — P2)__are all at least 10. 


Here’s an example that illustrates how to state hypotheses and check conditions. 


EXAMPLE |Hungry children (>) 
Stating hypotheses and checking conditions 


Alamy 


PROBLEM: Researchers designed a study to compare the proportion of children who come 
to school without eating breakfast in two low- income elementary schools. An SRS of 80 
students from School 1 found that 19 missed breakfast today. At School 2, an SRS of 150 
students included 26 who missed breakfast today. More than 1500 students attend each 
school. Do these data give convincing evidence of a difference in the population proportions 
at the a=0.05a = 0.05 significance level? 
a. State appropriate hypotheses for performing a significance test. Be sure to define the 
parameters of interest. 


b. Check if the conditions for performing the test are met. 
SOLUTION: 


a. ‘Ho: p1 — p2 = oHo: mi — m2 = 0 
‘Ha: p1 — pa : Pi — P2 zoF 9 
where pl =P1 = the true proportion of all students at School 1 who missed breakfast 
today and p2 =P2 = the true proportion of all students at School 2 who missed 
breakfast today. 


You can also state the hypotheses as 


Ho: pi=p2 Ho: pi = po 


Ha: pixp2 Ha: pi # Pr 


e Random? Independent random samples of 80 students from School 1 and 150 
students from School 2.4 


© 10%: 80<1080 < 10% of students at School 1; 150<10150 < 10% of students at 
School 2. / 


¢ Large Counts? 19, 80 - 19 = 6180 — 19 = 61, 36, and 150 - 26 = 124150 — 26 = 124 
are all 102 10. ¥ 


The numbers of successes (missed breakfast!) and failures (ate breakfast) in both 


samples are at least 10. 
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CALCULATIONS: STANDARDIZED TEST STATISTIC AND P-VALUE If the 
conditions are met, we can proceed with calculations. To do a test of HO: p1—p2=0 
Ho: pi — p2 = 9, standardize p\1-p2P1 ~ P2 to get az statistic: 

standardized test statistic=statistic-parameterstandard deviation of statistic 
statistic— parameter 


standardized test statistic ={—-—-——_____ 
standard deviation of statistic 


[Pi (1—P1)  P2(1—P9) 
| ——————— + ———————- 
z=(p\1—p’2)-Op1(1-p1)n1+p2(1—p2)n2 Voom n9 


y— 
“— 


With a little factoring, we can rewrite the denominator of the standardized test statistic as 
om mrs AS Se PI 
/p(l- P),/ et me 


formula sheet under the heading “Standard Deviation of Statistic” for a difference of sample 


p(1-p)1n1+1n2 . You'll find this expression on the AP® Statistics exam 


proportions in the special case when p1=p2P1 = P2. 


You may be tempted to replace p1P! and p2P2 in the denominator with the corresponding 
sample proportions pA1P1 and pr2P2, Don’t do it! Here’s why. 

A significance test begins by assuming that HO: p1-p2=040: pi — P2 = 9 is true. In that 
case, pl=p2P1 = P2, We call the common value of these two parameters p?. Substituting this 
value in place of both p1?! and p2 in the denominator of the standardized test 
Statistic yields 


Z=(p41—p42)-Op(1—p)n1+p(1—-p)n2 


Unfortunately, we don’t know the common value of p. To estimate p, we combine (or 
“pool”) the data from the two samples as if they came from one larger sample. This combined 
sample proportion is 


p/=number of successes in both samples combinednumber of individuals in both samples combin 


In other words, p/ gives the overall proportion of successes in the combined samples. Use 


p/ in place of both p1 and p2 in the denominator of the standardized 
test statistic: 


Some people prefer to use p* to check the Large Counts condition because this is 


consistent with assuming that HO: p1-p2=0 
is true. If the expected counts 


nip’ ,n1(1-p) 

, n2p% ’ 
and n2(1-p‘) are all at least 
10, the sampling distribution of p%1-p%2 is approximately 
Normal. 


Z=(p41—p’2)-OpA(1—p*)n1+p(1-p4)n2 


When the Large Counts condition is met, this z statistic will have approximately the 
standard Normal distribution. We can find the appropriate P-value using Table A or 
technology. 


EXAMPLE | Calculating the standardized test statistic and P- 
value > 
Who eats breakfast? 


PROBLEM: Refer to the preceding example. The two-way table summarizes the data from 
the independent random samples of children at School 1 and School 2. We already 
confirmed that the conditions for performing a significance test are met. 


School 


1 2 Total 
No 45 
Breakfast? 
Yes 185 
Total 80 150 230 


a. Explain why the sample results give some evidence for the alternative hypothesis. 


b. Calculate the standardized test statistic and P-value. 


c. What conclusion would you make using a= 0.05 ? 
SOLUTION: 


a. The observed difference in the sample proportions 1s et mp 2 =1980-26150=0.2375 
— 0.1733 =0.0642 


, which gives some evidence in favor of Ha: p1—p2*0 


because 0.06420. 


The rightmost column in the table above makes it easy to see that the overall 
(combined) proportion of successes in the two samples is p“=45/230 


b. p=19+2680+150=45230=0.196 


Z=(0.2375 -0.1733) -00.196(0.804)80+0.196(0.804)150=0.06420.055 =1.17 


Using Table A: P(z<-1.17 0rz=1.17) 
=2(0.1210) 


=0.2420 


Z= (pA1 -ph2) — op” (1 —pr)n1+pA(1 -pr)n2 


Using technology: normalcdf(lower: 1.17, upper: 1000, mean: o, SD: 1) X 2=0.2420 


c. Because the P-value of O24 20-6 —0.05 
, we fail to reject Ho 


. There is not convincing evidence of a difference in the true proportions 


of students at School 1 and School 2 who skipped breakfast today. 


A standard normal curve is shown. 
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The two-sample z test and two-sample z interval for the difference between two proportions don’t 


always give consistent results. That’s because the “standard deviation of the statistic” used in 
calculating the standardized test statistic is 


pA(1—-p*)n1+p(1—-p*)n2 


but for the confidence interval, it’s 


pA1(1-p‘1)n1+p42(1-p2)n2 


What does the P-value in the example tell us? If there is no difference in the population 
proportions of students who missed breakfast today at the two schools and we repeated the 
random sampling process many times, we’d get a difference in sample proportions as large as 
or larger than 0.0642 in either direction about 24% of the time. With such a high probability of 
getting a result like this just by chance when the null hypothesis is true, we don’t have enough 
evidence to reject HO ; 

We can get additional information about the difference between the population proportions 
who missed breakfast today at School 1 and School 2 with a confidence interval. Technology 
gives the 95% confidence interval for p1—p2 as —0.047 


to 0.175. That is, we are 95% confident that the difference in the true 
proportions of students who missed breakfast at the two schools is between 4.7 percentage 
points lower at School 1 and 17.5 percentage points higher at School 1. This is consistent with 


our “fail to reject HO ” conclusion because 0 is included in the interval of 


plausible values for p1—p2 


24. Technology Corner | PERFORMING A SIGNIFICANCE TEST 
FOR A DIFFERENCE IN PROPORTIONS |») 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


The TI-83/84 can be used to perform significance tests for comparing two proportions. Here, 


we use the data from the hungry children example. To perform a test of HO: p1—p2=0 
versus Ha: p1—p20: 


e Press [STAT] then choose TESTS and 2-PropZTest. 

e When the 2-PropZTest screen appears, enter the values shown. Specify the alternative 
hypothesis p1#p2 

e If you select “Calculate” and press [ENTER|, you will see that the standardized test 
Statistic is z=1.168 and the P-value is 0.2427. Do you see the combined 
proportion of students who didn’t eat breakfast? It’s the value labeled p/ , 0.1957. 


e If you select the “Draw” option, you will see the screen shown here. 


A screenshot of a computer output is shown. 


A standard normal curve is shown. 


AP® EXAM TIP 


The formula for the two-sample Z statistic for a test about p1-p2 


often leads to calculation errors by students. As a 
result, your teacher may recommend using the calculator’s 2-PropZTest feature to 


perform calculations on the AP® Statistics exam. Be sure to name the procedure 


(two-sample z test for p1-p2 ) in the “Plan” step and 
report the standardized test statistic (z=1.17) 


and P-value (0.2427) in the “Do” 
step. 


Putting It All Together: Two-Sample z Test for p1—- p2 


Here is a summary of the details for the two-sample z test for the difference between two 
proportions. 


TWO-SAMPLE z TEST FOR THE DIFFERENCE BETWEEN TWO 
PROPORTIONS 


Suppose the conditions are met. To test the hypothesis HO: p1—p2=0 
, first find the pooled proportion 


p of successes in both samples combined. Then compute the standardized test statistic 


Z=(p41—p42)-Op\(1-p*)n1+p(1—-p*)n2 


Find the P-value by calculating the probability of getting a z statistic this large or larger in 


the direction specified by the alternative hypothesis Ha in a standard Normal 
distribution. 


As with any test, be sure to follow the four-step process. 


EXAMPLE | Cholesterol and heart attacks (>) 
Performing a significance test about p, — p2 


PROBLEM: High levels of cholesterol in the blood are associated with a higher risk of heart 
attacks. Will using a drug to lower blood cholesterol reduce heart attacks? The Helsinki 
Heart Study recruited middle-aged men with high cholesterol but no history of other serious 
medical problems to investigate this question. The volunteer subjects were assigned at 
random to one of two treatments: 2051 men took the drug gemfibrozil to reduce their 
cholesterol levels, and a control group of 2030 men took a placebo. During the next five 
years, 56 men in the gemfibrozil group and 84 men in the placebo group had heart attacks. 


a. Do the results of this study give convincing evidence at the a=0.01 
significance level that gemfibrozil is effective in preventing heart attacks? 


b. Interpret the P-value you got in part (a) in the context of this experiment. 
SOLUTION: 


a. STATE: We want to test 


‘Ho: pG -~pC=o0 
Ha: pG-pC<o 
where pG= the true heart attack rate for middle-aged men like 
these who take gemfibrozil and pc= the true heart attack rate for 


middle-aged men like these who take a placebo using C=O 


PLAN: T3 wo-sample z test ‘for pG-pe 
e Random: Volunteer subjects were randomly assigned to gemfibrozil or placebo. / 
e Large Counts: 56, 2051 — 56 = 1995, 84, and 2030 - 84 = 1946 are all =10.¢ 


Note that we do not have to check the 10% condition because the subjects in the 
experiment were not sampled without replacement from some larger population. 


The numbers of successes (heart attacks!) and failures (no heart attacks) in both 
samples are at least 10. 


DO: 
e ip G 502051 = (ONO 


pArC=842030=0.0414 


The sample result gives some evidence in favor of Ha: pi — p2 < 0 because 0.0273 


— 0.0414 = -0.0141 <0. 


Ppr=56484205142030=1404081 = 0.0343 


© z=(0.0273-0.0414) -00.0343(0.9657)2051+0.0343(0.9657)2030 


=—0.01410.00§7=—2.47 


e P-value 


Using Table A: P(z<-2.47)=0.0068 


On the TI-83/84, the 2-PropZTest gives z = -2.47 and P-value = 0.0068. 


Using technology: normalcdf(lower: -1000 , Upper: -2.47 , mean:o, 


SD:1)=0.0068 
CONCLUDE: Because our P-value of 0.0068<a=0.01 


, we reject Ho . There 
1S convincing evidence of a lower heart attack rate for middle-aged men [ike these who take 
gemfibrozil than for those who take only a placebo. 


(b) Assuming ‘Ho: piap2—0 


is true, there is a 
0.0068 probability of getting a difference (Gemfibrozil ~ Placebo) in heart attack rate for 


the two groups of —0.0141 or less just by the chance involved in the 
random assignment. 


A standard normal curve is shown. 
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We chose a=0.01 in the example to reduce the chance of making a Type I 
error—finding convincing evidence that gemfibrozil reduces heart attack risk when it really 
doesn’t. This error could have serious consequences if an ineffective drug was given to lots of 
middle-aged men with high cholesterol! 

The random assignment in the Helsinki Heart Study allowed researchers to draw a cause- 
and-effect conclusion. They could say that gemfibrozil reduces the rate of heart attacks for 
middle-aged men like those who took part in the experiment. Because the subjects were not 
randomly selected from a larger population, researchers could not generalize the findings of 
this study any further. No conclusions could be drawn about the effectiveness of gemfibrozil at 
preventing heart attacks for all middle-aged men, for men of other ages, or for women. 


Think About It 


WHY DO THE INFERENCE METHODS FOR RANDOM SAMPLING WORK FOR 
RANDOMIZED EXPERIMENTS? Confidence intervals and tests for p1—p2 


are based on the sampling distribution of p41—p/2 


. But in most experiments, researchers don’t select 
subjects at random from any larger populations. They do randomly assign subjects to 
treatments. We can think about what would happen if the random assignment were repeated 
many times under the assumption that HO: p1—p2=0 


is true. That is, we assume that 
the specific treatment received doesn’t affect an individual subject’s response. 

Let’s see what would happen just by chance if we randomly reassign the 4081 subjects in 
the Helsinki Heart Study to the two groups many times, assuming the drug received doesn’t 
affect whether or not each individual has a heart attack. We used software to redo the 
random assignment 500 times. Figure 10.3 shows the value of p\G-—p4C 

in the 500 simulated trials. This distribution 
(sometimes referred to as the randomization distribution of p\G-—pC 


) has an approximately Normal shape with mean 0 and 
standard deviation 0.0058. This matches well with the distribution we used to perform 
calculations in the example. Because the Large Counts condition was met and we assumed 
that HO: pG-pC=0 is true, 
we used for the standard deviation of the statistic. 


FIGURE 10.3 Dotplot of the values of p*G-—p*C from 
each of 500 simulated random reassignments of subjects to treatment groups in the Helsinki 
Heart Study, assuming no treatment effect. 


p\(1-p)n1+p\(1-p)n2=0.0343(1-0.0343)2051+0.0343(1-0.0343)2030 


=0.0057 


In the Helsinki Heart Study, the difference in the proportions of subjects who had a heart 
attack in the gemfibrozil and placebo groups was 0.0273-0.0414= 


-0.0141 . How likely is it that a difference this 
large or larger would happen just by chance when HO is true? Figure 10.3 


provides a rough answer: 5 of the 500 random reassignments yielded a difference in 


proportions less than or equal to —0.0141 . That is, our estimate of the P- 
value is 0.01. This is quite close to the 0.0068 P-value that we calculated in the preceding 
example. Neither of these values is the exact P-value. To get the exact P-value in an 
experiment, you would have to consider the difference in sample proportions for all possible 
random assignments. This approach is called a permutation test. 


CHECK YOUR UNDERSTANDING 


. 


y To study the long-term effects of preschool programs for poor children, researchers 
designed an experiment. They recruited 123 children who had never attended preschool from 
low-income families in Michigan. Researchers randomly assigned 62 of the children to 
attend preschool (paid for by the study budget) and the other 61 to serve as a control group 
who would not go to preschool. One response variable of interest was the need for social 
services as adults. Over a 10-year period, 38 children in the preschool group and 49 in the 


control group have needed social services.4 


1. Do these data provide convincing evidence that preschool reduces the later need for social 
services for children like the ones in this study? Justify your answer. 

2. Based on your conclusion to Question 1, could you have made a Type I error or a Type II 
error? Explain your reasoning. 


3. Should you generalize the result in Question 1 to all children from low-income families 
who have never attended preschool? Why or why not? 


Section 10.1 Summary 


e Choose independent SRSs of size n1 from Population 1 with proportion of 
successes p1 and of size n2 from Population 2 with proportion of 
successes p2 . The sampling distribution of p\1—p/2 


has the following properties: 


« Shape: Approximately Normal if the Large Counts condition is met: n1p1, n1(1—-p1), n2p2 


, and n2(1-p2) are 
all at least 10. 


« Center: The mean is pp41—p2=p1-p2 


» Variability: The standard deviation is op\1-p42=p1(1-p1)n1+p2(1-p2)n2 


as long as the 10% condition is met when sampling without replacement: n1<0.10N1 
and n2<0.10N2 


e Confidence intervals and significance tests to compare the proportions p1 and p2 
of successes for two populations or treatments are based on the difference 
p\1-pA2 between the sample proportions. 
e Before estimating or testing a claim about p1—p2 , check that 
these conditions are met: 


« Random: The data come from two independent random samples or from two groups in a 
randomized experiment. 
10%: When sampling without replacement, n1<0.10N1 


and n2<0.10N2 


« Large Counts: The counts of “successes” and “failures” in each sample or group—n1p1 


, n1(1-p1) 
, n2p/A2 
, n2(1—p/2) 
— are all at least 
10. 
e When conditions are met, a C% confidence interval for p1—p2 is 
(p41—p/2)+z*p41(1-p41)n1+p42(1-p/2)n2 
where z* is the standard Normal critical value with C% of its area 
between —z* and z* . This is 


called a two-sample z interval for p1—p2 
e Significance tests of HO: p1—p2=0 
use the combined (pooled) 
sample proportionp’ in the formula for the standard deviation of the statistic: 


p/=count of successes in both samples combinedcount of individuals in both samples combine: 


e When conditions are met, the two-sample z test for p1 — p2 
uses the standardized test statistic 


Z=(p1—p/2)-Op(1—p*)n1+p(1-p*)n2 


with P-values calculated from the standard Normal distribution. 


e Be sure to follow the four-step process whenever you construct a confidence interval or 
perform a significance test for comparing two proportions. 


10.1 Technology Corners 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


23. Constructing a confidence interval for a difference in proportions 


24. Performing a significance test for a difference in Page 629 
proportions Page 633 


Section 10.1 Exercises 


1. pg 624 ») I want red! A candy maker offers Child and Adult bags of jelly beans with 


different color mixes. The company claims that the Child mix has 30% red jelly beans, 
while the Adult mix contains 15% red jelly beans. Assume that the candy maker’s claim is 
true. Suppose we take a random sample of 50 jelly beans from the Child mix and a 


separate random sample of 100 jelly beans from the Adult mix. Let pAC 
and pAA be the sample proportions of red jelly beans from the Child and 
Adult mixes, respectively. 
a. What is the shape of the sampling distribution of p\C-—pAA 
2? Why? 


b. Find the mean of the sampling distribution. 


c. Calculate and interpret the standard deviation of the sampling distribution. 


2. Literacy A researcher reports that 80% of high school graduates, but only 40% of high 
school dropouts, would pass a basic literacy test.2 Assume that the researcher’s claim is 
true. Suppose we give a basic literacy test to a random sample of 60 high school graduates 


and a separate random sample of 75 high school dropouts. Let p\G and 
pAD be the sample proportions of graduates and dropouts, respectively, 
who pass the test. 
a. What is the shape of the sampling distribution of p\G-p/D 

2? Why? 


Be 


b. Find the mean of the sampling distribution. 
c. Calculate and interpret the standard deviation of the sampling distribution. 
I want red! Refer to Exercise 1. 


a. Find the probability that the proportion of red jelly beans in the Child sample is less 
than or equal to the proportion of red jelly beans in the Adult sample, assuming that the 
company’s claim is true. 


b. Suppose that the Child and Adult samples contain an equal proportion of red jelly 


beans. Based on your result in part (a), would this give you reason to doubt the 
company’s claim? Explain your reasoning. 


Literacy Refer to Exercise 2. 
a. Find the probability that the proportion of graduates who pass the test is at most 0.20 


higher than the proportion of dropouts who pass, assuming that the researcher’s report 
is correct. 


b. Suppose that the difference (Graduate — Dropout) in the sample proportions who pass 


the test is exactly 0.20. Based on your result in part (a), would this give you reason to 
doubt the researcher’s claim? Explain your reasoning. 


pg 625 |») Don’t drink the water! The movie A Civil Action (1998) tells the story of a 


major legal battle that took place in the small town of Woburn, Massachusetts. A town 
well that supplied water to east Woburn residents was contaminated by industrial 
chemicals. During the period that residents drank water from this well, 16 of 414 babies 
born had birth defects. On the west side of Woburn, 3 of 228 babies born during the same 


time period had birth defects. Let p1= the true proportion of all babies 


born with birth defects in west Woburn and p2= the true proportion of all 
babies born with birth defects in east Woburn. Check if the conditions for calculating a 


confidence interval for p1—p2 are met. 


Broken crackers We don’t like to find broken crackers when we open the package. How 
can makers reduce breaking? One idea is to microwave the crackers for 30 seconds right 
after baking them. Randomly assign 65 newly baked crackers to the microwave and 
another 65 to a control group that is not microwaved. After 1 day, none of the microwave 


group were broken and 16 of the control group were broken.® Let p1= 
the true proportions of crackers like these that would break if baked in the microwave and 


p2= the true proportions of crackers like these that would break if not 
microwaved. Check if the conditions for calculating a confidence interval for p1—p2 


are met. 


Cockroaches The pesticide diazinon is commonly used to treat infestations of the German 
cockroach, Blattella germanica. A study investigated the persistence of this pesticide on 


10. 


11. 


various types of surfaces. Researchers applied a 0.5% emulsion of diazinon to glass and 
plasterboard. After 14 days, they randomly assigned 72 cockroaches to two groups of 36, 
placed one group on each surface, and recorded the number that died within 48 hours. On 


glass, 18 cockroaches died, while on plasterboard, 25 died. If p1 and p2 
are the true proportions of cockroaches like these that would die within 48 


hours on glass treated with diazinon and on plasterboard treated with diazinon, 
respectively, check if the conditions for calculating a confidence interval for p1—p2 


are met. 
Digital video disks A company that records and sells rewritable DVDs wants to compare 
the reliability of DVD fabricating machines produced by two different manufacturers. 


They randomly select 500 DVDs produced by each fabricator and find that 484 of the 
disks produced by the first machine are acceptable and 480 of the disks produced by the 


second machine are acceptable. If p1 and p2 are the proportions of 
acceptable DVDs produced by the first and second machines, respectively, check if the 
conditions for calculating a confidence interval for p1—p2 are 
met. 


pg 628 |») Young adults living at home A surprising number of young adults (ages 19 


to 25) still live in their parents’ homes. The National Institutes of Health surveyed 
independent random samples of 2253 men and 2629 women in this age group.2 The 
survey found that 986 of the men and 923 of the women lived with their parents. 


a. Construct and interpret a 99% confidence interval for the difference in the true 


proportions of men and women aged 19 to 25 who live in their parents’ homes. 


b. Does your interval from part (a) give convincing evidence of a difference between the 


population proportions? Justify your answer. 


Where’s Egypt? In a Pew Research poll, 287 out of 522 randomly selected U.S. men 
were able to identify Egypt when it was highlighted on a map of the Middle East. When 
520 randomly selected U.S. women were asked, 233 were able to do so. 


a. Construct and interpret a 95% confidence interval for the difference in the true 


proportion of U.S. men and U.S. women who can identify Egypt on a map. 


b. Based on your interval, is there convincing evidence of a difference in the true 


proportions of U.S. men and women who can identify Egypt on a map? Justify your 
answer. 


Response bias Does the appearance of the interviewer influence how people respond to a 
survey question? Ken (white, with blond hair) and Hassan (darker, with Middle Eastern 
features) conducted an experiment to address this question. They took turns (in a random 
order) walking up to people on the main street of a small town, identifying themselves as 
students from a local high school, and asking them, “Do you support President Obama’s 


12. 


13. 


14. 


15. 


decision to launch airstrikes in Iraq?” Of the 50 people Hassan spoke to, 11 said “Yes,” 
while 21 of the 44 people Ken spoke to said “Yes.” Construct and interpret a 90% 
confidence interval for the difference in the proportion of people like these who would say 
they support President Obama’s decision when asked by Hassan versus when asked by 
Ken. 


Quit smoking Nicotine patches are often used to help smokers quit. Does giving medicine 
to fight depression help? A randomized double-blind experiment assigned 244 smokers to 
receive nicotine patches and another 245 to receive both a patch and the antidepressant 
drug bupropion. After a year, 40 subjects in the nicotine patch group had abstained from 
smoking, as had 87 in the patch-plus-drug group. Construct and interpret a 99% 
confidence interval for the difference in the true proportion of smokers like these who 
would abstain when using bupropion and a nicotine patch and the proportion who would 
abstain when using only a patch. 


Ban junk food! A CBS News poll asked 606 randomly selected women and 442 
randomly selected men, “Do you think putting a special tax on junk food would encourage 
more people to lose weight?” 170 of the women and 102 of the men said “Yes.”8 A 99% 
confidence interval for the difference (Women — Men) in the true proportion of people in 


each population who would say “Yes” is —0.020 to 0.120. Does the 
confidence interval provide convincing evidence that the two population proportions are 
equal? Explain your answer. 


Artificial trees? An association of Christmas tree growers in Indiana wants to know if 
there is a difference in preference for natural trees between urban and rural households. So 
the association sponsored a survey of Indiana households that had a Christmas tree last 
year to find out. In a random sample of 160 rural households, 64 had a natural tree. In a 
separate random sample of 261 urban households, 89 had a natural tree. A 95% 
confidence interval for the difference (Rural — Urban) in the true proportion of households 


in each population that had a natural tree is —0.036 to 0.154. Does the 
confidence interval provide convincing evidence that the two population proportions are 
equal? Explain your answer. 


pg 631 |») Children make choices Many new products introduced into the market are 


targeted toward children. The choice behavior of children with regard to new products is 
of particular interest to companies that design marketing strategies for these products. As 
part of one study, randomly selected children in different age groups were compared on 
their ability to sort new products into the correct product category (milk or juice).2 Here 
are some of the data: 


Age group N Number who sorted correctly 
4-to 5-year-olds 50 10 
6-to 7-year-olds 53 28 


16. 


WP 


18. 


19. 


Researchers want to know if a greater proportion of 6- to 7-year-olds can sort correctly 
than 4- to 5-year-olds. 


a. State appropriate hypotheses for performing a significance test. Be sure to define the 
parameters of interest. 


b. Check if the conditions for performing the test are met. 


Steroids in high school A study by the National Athletic Trainers Association surveyed 
random samples of 1679 high school freshmen and 1366 high school seniors in Illinois. 
Results showed that 34 of the freshmen and 24 of the seniors had used anabolic steroids. 
Steroids, which are dangerous, are sometimes used in an attempt to improve athletic 
performance.22 Researchers want to know if there is a difference in the proportion of all 
Illinois high school freshmen and seniors who have used anabolic steroids. 


a. State appropriate hypotheses for performing a significance test. Be sure to define the 
parameters of interest. 


b. Check if the conditions for performing the test are met. 


Shrubs and fire Fire is a serious threat to shrubs in dry climates. Some shrubs can 
resprout from their roots after their tops are destroyed. Researchers wondered if fire would 
help with resprouting. One study of resprouting took place in a dry area of Mexico.t! The 
researchers randomly assigned shrubs to treatment and control groups. They clipped the 
tops of all the shrubs. They then applied a propane torch to the stumps of the treatment 
group to simulate a fire. All 12 of the shrubs in the treatment group resprouted. Only 8 of 
the 12 shrubs in the control group resprouted. 


a. State appropriate hypotheses for performing a significance test. Be sure to define the 
parameters of interest. 


b. Check if the conditions for performing the test are met. 


Ticks Lyme disease is spread in the northeastern United States by infected ticks. The ticks 
are infected mainly by feeding on mice, so more mice result in more infected ticks. The 
mouse population, in turn, rises and falls with the abundance of acorns, their favored food. 
Experimenters studied two similar forest areas in a year when the acorn crop failed. To see 
if mice are more likely to breed when there are more acorns, the researchers added 
hundreds of thousands of acorns to one area to imitate an abundant acorn crop, while 
leaving the other area untouched. The next spring, 54 of the 72 mice trapped in the first 


area were in breeding condition, versus 10 of the 17 mice trapped in the second area.42 


a. State appropriate hypotheses for performing a significance test. Be sure to define the 
parameters of interest. 


b. Check if the conditions for performing the test are met. 


pg 632 |») Children make choices Refer to Exercise 15. 


a. Explain why the sample results give some evidence for the alternative hypothesis. 
b. Calculate the standardized test statistic and P-value. 


c. What conclusion would you make? 


20. Steroids in high school Refer to Exercise 16. 


a. Explain why the sample results give some evidence for the alternative hypothesis. 
b. Calculate the standardized test statistic and P-value. 


c. What conclusion would you make? 


21. pg 635 Bag lunch? Phoebe has a hunch that older students at her very large high 
Ne] 


22. 


age 


school are more likely to bring a bag lunch than younger students because they have 
grown tired of cafeteria food. She takes a simple random sample of 80 sophomores and 
finds that 52 of them bring a bag lunch. A simple random sample of 104 seniors reveals 
that 78 of them bring a bag lunch. 


a. Do these data give convincing evidence to support Phoebe’s hunch at the a=0.05 


significance level? 


b. Interpret the P-value from part (a) in the context of this study. 


Are teenagers going deaf? In a study of 3000 randomly selected teenagers in 1990, 450 
showed some hearing loss. In a similar study of 1800 teenagers reported in 2010, 351 
showed some hearing loss.12 


a. Do these data give convincing evidence that the proportion of all teens with hearing 


loss has increased at the a=0.01 significance level? 


b. Interpret the P-value from part (a) in the context of this study. 


Preventing peanut allergies A recent study of peanut allergies—the LEAP trial— 
explored the relationship between early exposure to peanuts and the subsequent 
development of an allergy to peanuts. Infants (4 to 11 months old) who had shown 
evidence of other kinds of allergies were randomly assigned to one of two groups. Group 
1 consumed a baby-food form of peanut butter. Group 2 avoided peanut butter. At 5 years 
old, 10 of 307 children in the peanut-consumption group were allergic to peanuts, and 55 
of 321 children in the peanut-avoidance group were allergic to peanuts.14 


a. Does this study provide convincing evidence of a difference at the a= 0.05 


significance level in the development of peanut allergies in infants 
like the ones in this study who consume or avoid peanut butter? 


b. Based on your conclusion in part (a), which mistake— a Type I error or a Type II error 


—could you have made? Explain your answer. 


c. Should you generalize the result in part (a) to all infants? Why or why not? 


24. Lowering bad cholesterol Which of two widely prescribed drugs—Lipitor or Pravachol 
—helps lower “bad cholesterol” more? In an experiment, called the PROVE-IT Study, 
researchers recruited about 4000 people with heart disease as subjects. These volunteers 
were randomly assigned to one of two treatment groups: Lipitor or Pravachol. At the end 
of the study, researchers compared the proportion of subjects in each group who died, had 
a heart attack, or suffered other serious consequences within two years. For the 2063 
subjects using Pravachol, the proportion was 0.263. For the 2099 subjects using Lipitor, 
the proportion was 0.224.149 


a. Does this study provide convincing evidence at the a=0.05 
significance level of a difference in the effectiveness of Lipitor and Pravachol for 
people like the ones in this study? 


b. Based on your conclusion in part (a), which mistake—a Type I error or a Type II error 
—could you have made? Explain your answer. 


c. Should you generalize the result in part (a) to all people with heart disease? Why or 
why not? 


25. Preventing peanut allergies Refer to Exercise 23. 


a. Construct and interpret a 95% confidence interval for the difference between the true 
proportions. Assume that the conditions for inference are met. 


b. Explain how the confidence interval provides more information than the test in 
Exercise 23. 


26. Lowering bad cholesterol Refer to Exercise 24. 


a. Construct and interpret a 95% confidence interval for the difference between the true 
proportions. Assume that the conditions for inference are met. 


b. Explain how the confidence interval provides more information than the test in 
Exercise 24. 


Exercises 27 and 28 involve the following setting. Some women would like to have children but 
cannot do so for medical reasons. One option for these women is a procedure called in vitro 
fertilization (IVF), which involves fertilizing an egg outside the woman’s body and implanting 
it in her uterus. 


27. Prayer and pregnancy Two hundred women who were about to undergo IVF served as 
subjects in an experiment. Each subject was randomly assigned to either a treatment group 
or a control group. Several people (called intercessors) prayed intentionally for the women 
in the treatment group, although they did not know the women, a process known as 
intercessory prayer. Prayers continued for 3 weeks following IVF. The intercessors did not 


pray for the women in the control group. Here are the results: 44 of the 88 women in the 
treatment group got pregnant, compared to 21 out of 81 in the control group.1® 


Is the pregnancy rate significantly higher for women who received intercessory prayer? To 
find out, researchers perform a test of HO: p1=p2 


versus Ha: p1>p2 


, where p1 and p2 
are the actual pregnancy rates for women like those in the study who do and don’t receive 
intercessory prayer, respectively. 


a. Name the appropriate test and check that the conditions for carrying out this test are 
met. 


b. The appropriate test from part (a) yields a P-value of 0.0007. Interpret this P-value in 
context. 


c. What conclusion should researchers draw at the a=0.05 significance 
level? 


d. The women in the study did not know whether they were being prayed for. Explain 
why this is important. 


28. Acupuncture and pregnancy A study reported in the medical journal Fertility and 
Sterility sought to determine whether the ancient Chinese art of acupuncture could help 
infertile women become pregnant. A total of 160 healthy women who planned to have 
IVF were recruited for the study. Half of the subjects (80) were randomly assigned to 
receive acupuncture 25 minutes before implanting the embryo and again 25 minutes after 
the implant. The remaining 80 women were assigned to a control group and instructed to 
lie still for 25 minutes after the embryo transfer. Results are shown in the table. 


Acupuncture group Control group 
Pregnant 34 21 
Not pregnant 46 59 
Total 80 80 


Is the pregnancy rate significantly higher for women who received acupuncture? To find 
out, researchers perform a test of HO: p1=p2 


versus Ha: pl>p2 


, where p1 and p2 
are the actual pregnancy rates for women like those in the study who do and don’t receive 
acupuncture, respectively. 


a. Name the appropriate test and check that the conditions for carrying out this test are 
met. 


b. The appropriate test from part (a) yields a P-value of 0.0152. Interpret this P-value in 


context. 


c. What conclusion should researchers draw at the a=0.05 significance 
level? 


d. The women in the study knew whether or not they received acupuncture. Explain why 
this is important. 


29. Texting and driving Does providing additional information affect responses to a survey 
question? Two statistics students decided to investigate this issue by asking different 
versions of a question about texting and driving. Fifty mall shoppers were divided into two 
groups of 25 at random. The first group was asked version A and the other half were asked 
version B. Here are the actual questions: 


e Version A: A lot of people text and drive. Are you one of them? 


e Version B: About 6000 deaths occur per year due to texting and driving. Knowing the 
potential consequences, do you text and drive? 

Of the 25 shoppers assigned to version A, 16 admitted to texting and driving. Of the 25 

shoppers assigned to version B, only 12 admitted to texting and driving. 


a. State appropriate hypotheses for performing a significance test. Be sure to define the 
parameters of interest. 


b. Explain why you should not use the methods of this section to calculate the P-value. 


c. We performed 100 trials of a simulation to see what differences (Version A—Version B) 
in proportions would occur due only to chance variation in the random assignment, 
assuming that the question asked doesn’t matter. A dotplot of the results is shown here. 
What is the estimated P-value? 


d. What conclusion would you draw? 


30. Botox benefits? You may have heard that Botox (botulinum toxin type A) is used for 
cosmetic surgery, but could it have other beneficial uses? A total of 31 patients who 
suffered chronic low-back pain were randomly assigned to receive 200 units of either 
Botox or saline solution through 5 injections at 5 different locations in their backs. The 
saline injection was not expected to reduce pain but was given as a placebo treatment. 
Pain relief was defined as a patient’s pain level being reduced to less than half of the 
original pain level after 8 weeks. Here are the results: 


Botox Saline Total 
Pain relief 10 3 
No pain relief 5 13 
Total 15 16 


a. State appropriate hypotheses for performing a significance test. Be sure to define the 
parameters of interest. 


b. Explain why you should not use the methods of this section to calculate the P-value. 


c. We performed 100 trials of a simulation to see what differences in proportions would 
occur due only to chance variation in the random assignment, assuming that the type of 
injection (Botox or saline) doesn’t matter. A dotplot of the results is shown here. What 
is the estimated P-value? 


d. What conclusion would you draw? 


A dot plot representing the stimulated difference (Botox minus Saline) is shown. 


Multiple Choice Select the best answer for Exercises 31-34. 


Exercises 31—33 refer to the following setting. A sample survey interviews SRSs of 500 female 
college students and 550 male college students. Researchers want to determine whether there is 
a difference in the proportion of male and female college students who worked for pay last 
summer. In all, 410 of the females and 484 of the males say they worked for pay last summer. 


31. Let pM and pF be the proportions of all college males and 
females who worked last summer. The hypotheses to be tested are 


o2. 


a. 


HO: pM-pF=0 versus 
Ha: pM-pF40 


HO: pM-pF=0 versus 
Ha: pM-pF>0 


HO: pM-pF=0 versus 
Ha: pM-pF<0 


. HO: pM-pF>0 


versus Ha: pM—-pF=0 


. HO: pM-pF40 


versus Ha: pM—-pF=0 


The researchers report that the results were statistically significant at the 1% level. Which 
of the following is the most appropriate conclusion? 


a. 


Because the P-value is less than 1%, fail to reject HO . There is not 
convincing evidence that the proportion of male college students in the study who 
worked for pay last summer is different from the proportion of female college students 
in the study who worked for pay last summer. 


. Because the P-value is less than 1%, fail to reject HO . There is not 


convincing evidence that the proportion of all male college students who worked for 
pay last summer is different from the proportion of all female college students who 
worked for pay last summer. 


. Because the P-value is less than 1%, reject HO . There is convincing 


evidence that the proportion of all male college students who worked for pay last 
summer is the same as the proportion of all female college students who worked for pay 
last summer. 


. Because the P-value is less than 1%, reject HO . There is convincing 


evidence that the proportion of all male college students in the study who worked for 
pay last summer is different from the proportion of all female college students in the 
study who worked for pay last summer. 


. Because the P-value is less than 1%, reject HO . There is convincing 


evidence that the proportion of all male college students who worked for pay last 
summer is different from the proportion of all female college students who worked for 


pay last summer. 


33. Which of the following is the correct margin of error for a 99% confidence interval for the 
difference in the proportion of male and female college students who worked for pay last 
summer? 


a. 2.5760.851(0.149)550+0.851(0.149)500 


b. 2.5760.851(0.149)1050 


c. 2.5760.880(0.120)550+0.820(0.180)500 


d. 1.9600.851(0.149)550+0.851(0.149)500 


e. 1.9600.880(0.120)550+0.820(0.180)500 


34. In an experiment to learn whether substance M can help restore memory, the brains of 20 
rats were treated to damage their memories. First, the rats were trained to run a maze. 
After a day, 10 rats (determined at random) were given substance M and 7 of them 
succeeded in the maze. Only 2 of the 10 control rats were successful. The two-sample z 
test for the difference in the true proportions 


a. gives Z=2.25, P<0.02 


b. gives z=2.60, P<0.005 


c. gives z=2.25, P<0.04 but not<0.02 


d. should not be used because the Random condition is violated. 
e. should not be used because the Large Counts condition is violated. 
Recycle and Review 


Exercises 35 and 36 refer to the following setting. Thirty randomly selected seniors at Council 
High School were asked to report the age (in years) and mileage of their main vehicles. Here is 
a scatterplot of the data: 


We used Minitab to perform a least-squares regression analysis for these data. Part of the 
computer output from this regression is shown here. 


Predictor Coef Stdev t-ratio P 
Constant —13832 8773 -1.58 0.126 
Age 14954 1546 9.67 0.000 
S = 22723 R-sq = 77.0% R-sq(adj) = 76.1% 


35. Drive my car (3.2) 


a. What is the equation of the least-squares regression line? Be sure to define any symbols 
you use. 


b. Interpret the slope of the least-squares line. 


C. 


One student reported that her 10-year-old car had 110,000 miles on it. Find and 
interpret the residual for this data point. 


36. Drive my car (3.2, 4.3) 


a. 


Explain what the value of r2 tells you about how well the least-squares line 
fits the data. 


. The mean age of the students’ cars in the sample was x =5 years. 


Find the mean mileage of the cars in the sample. 


. Interpret the value of s 


. Would it be reasonable to use the least-squares line to predict a car’s mileage from its 


age for a Council High School teacher? Justify your answer. 


SECTION 10.2 Comparing Two Means 


LEARNING TARGETS By the end of the section, you should be able to: 


e Describe the shape, center, and variability of the sampling distribution of 
305 


= —p——= 0.200 = 20.0% 
xX 1-x 21526 °L1 


5? 


Determine whether the conditions are met for doing inference about a difference 
between two means. 


Construct and interpret a confidence interval for a difference between two 
means. 


Calculate the standardized test statistic and P-value for a test about a difference 
between two means. 


Perform a significance test about a difference between two means. 


In the preceding section, we developed methods for comparing two proportions. What if we 


want to compare the mean of some quantitative variable for the individuals in Population 1 and 
305 


. == 0.200 = 20.0% 
Population 2? Our parameters of interest are the population means 111526 1 and 


305 

——= 0.200 = 20.0%). 

[121526 #2 . Once again, the best approach is to take independent random samples 
305 

—,——= 0.200 = 20.0%% — 

from each population and to compare the sample means x 11526 “1 and x 2 


305 _ 0.200 = 20.0%= 
1526 = U.2 = £uU. "Eo ; 


Suppose we want to compare the average effectiveness of two treatments in a randomized 
305 


305 
experiment. In this case, the parameters 111526 “1 and 1121526 
the true mean responses for Treatment 1 and Treatment 2, respectively. We use the mean 
305 305 
—, ——= 0.200 = 20.0% % —5——= 0.200 = 20.0% %. 
response in the two groups, x 11526 “1 and x 21526 “2 , to make the 
comparison. Here’s a table that summarizes these two situations: 


0.200 = 20.0%, 0.200 = 20.0% 45 


are 


Population or treatment Parameter Statistic Sample size 
1 ae = 0.200 = 20.0% = eS: = ().200 = 20.0 %z 305, = (),.200 = 20.0 %. 
We xX eee Ne 
2 Kel = (0.200 = 20.0% _— 305 = ().200 = 20.0 %= 305 = (),.200 = 20.0 %. 
Uae ee xX 2 ee NO ee 


_ We compare the populations or treatments by doing inference about the difference 1-2 


305 
= 0.200 = 20.0% 4, — yy: mer ee 
1526 #1 ~ #2 between the parameters. The statistic that estimates this difference is 
305 
4 y= 0.200 = 20.0% RF, — ZF. 
the difference between the two sample means, x 1-x 21526 “17—#2 .To use 
305 


0.200 = 


_ = 20.0%7, — *®. . ‘ F ; , F 
x 1-x 21526 “1 ~ #2 for inference, we must know its sampling distribution. 


The Sampling Distribution of a Difference Between Two Means 


oe 
To explore the sampling distribution of x 1-x 21526) °?"°~ °°°"%1 — €2 | Jet’s start with two 


Normally distributed populations having known means and standard deviations. Based on 
information from the U.S. National Health and Nutrition Examination Survey (NHANES), the 
heights of 10-year-old girls can be modeled by a Normal distribution with mean pG=56.4 


305 305 
= 0.200 = 20.0%), . — ; go ——_= 0.200 = 20.0%, — 
1526 ‘tg = 56-4 inches and standard deviation oG=2.71526 ‘ag = 2.7 


inches. The heights of 10-year-old boys can be modeled by a Normal distribution with mean 


305 305 
——= 0.200 = 20.0%): p = od ——_ = 0.200 = 20.0% 
UB=55.71526 tp = 59.7 inches and standard deviation oB=3.81526 x 


7p = 3.8 inches.48 The table summarizes this information. 


Monkey Business Images/Shutterstock 


Population Shape Mean Standard deviation 
10-year-old Approximatel 305 _ 0.200 = 20.0%,,,. — 305 _ 9.200 = 20.0% 
‘irs aera : uG=56.4 1526 7° = 29-94ug = 56.4 ogo 71526 ~ 7 = 29940 = 2.7 
in in 
10-year-old Approximatel 305 _ 0.200 = 20.0%,,., — 305 _ 9.200 = 20.0% 
ioe prteeiet y UB=55.71526 0.200 = 20.0% 4 = 55.7 ps aisag = 0-200 = 20.0% rn = 3.8 


in in 


Suppose we take independent SRSs of 12 girls and 8 boys of this age and measure their 


heights. What can we say about the difference x G-x B1526— 0.200 = 20.0%G — Zp in the 
average heights of the sample of girls and the sample of boys? 


305, a 
In Chapter 7, we saw that the sampling distribution of asample mean x 1526 — See eae 


has the following properties: 


Shape: (1) If the population distribution is Normal, then so is the sampling distribution of x — 
305 


iia = Ng (2) If the population distribution isn’t Normal, the sampling distribution of 
ee 
x 1526) 2200 = 20.0% Will be approximately Normal if the sample size is large enough (say, 


oa 
n>301526”"° = 7°-°"°n > 30 ) by the central limit theorem (CLT). 


305 
—_ ——= 0.200 = 20.0% 7) 
Center: Lx =[11526 ‘Center : uz = 


a 0.200 = 20.0% 


: Variability : co; =-—ifn < 0.10N 
Variability: ox =on if n<0.10Ni526 vn 


_ ~ 20.0% __ a ~ 20.0% 
For the sampling distributions of x G1526 — 9200 = MO8E6 and x Bie Oe ED in 
this case: 
305 _ 0.200 = 20.0% 
Sampling distribution of x Gi526 ~~ G Sampling distribt 
Shape Approximately Normal, because the population distribution is Approximately No! 
approximately Normal approximately Nor 
Center 308 , one ae inches = 
ux G=pG=56.41526 7 0-200 = 20-0% uz, = He = 56.4 ux” B=yB=55.7.... 
Variability 305 iin ee es ES eh 8 
ox G=oGnG=2.712=0.781526 ~ °200= 20.0%" 2G ig ~ Vi2 ox B=oBnB=3.88 
inch because 305 % of all 10-year-old inches because 
12<101686 Oe POS = 10 y gi 
girls in the United States. boys in the United 


— 2 = 0.200 = 20.0%% , — = 
What about the sampling distribution of x G-x Bi5s26 ~~ ““G ~ ©B 2? We used 


software to take an SRS of 12 ten-year-old girls and 8 ten-year-old boys. Our first set of 


305 catia _ 305, . : 
samples gave x G=56.091526 — 0.200 = 20.0%F ¢ = 56.09 inches and x B=54.681526 "700 = 20.0% 


- ee 305_ ~ 90.09 
€p = 54.68 inches, resulting in a difference of x G-x B=56.09-54.68=1.41i526 °°"? ~ mae 


Tg — Fp = 56.09 — 54.68 = 1.41 inches. A red dot for this value appears in Figure 10.4. The 
dotplot shows the results of repeating this process 1000 times. 


Shape: Approximately Normal 
Center: Mean = 0.7 


Variability: SD = 1.55 


oad —2 0 2 4 6 
¥,-X, 


FIGURE 10.4 Simulated sampling distribution of the difference in sample means x G-x B 


305 305 
= 0.200 = = 0.200 = 20.0 tng —_ 


1526 20.0% “ZG — £B in 1000 SRSs of size nG=121526 12 from an 


305, aaa 
approximately Normally distributed population with uG=56.41526 — 0.200 = 20.011G = 56.4 inches 


305 _ 99 200 = 20.0% 305 


and oG=2.71526 ‘oq = 2.7 inches and 1000 SRSs of size nB=81526 "70? = 20.0” 


305 coeinae 
np = 8 from an approximately Normally distributed population with uB=55.71526— Se a 


05 
9 » 4 
— 0.200 —_ 20.0%¢ p 


3 
4p = 55.7 inches and oB=3.81526 = 3.8 inches. 


305 a 
The figure suggests that the sampling distribution of x G-x B1526— T= HANG — Ep 


has an approximately Normal shape. This makes sense from what you learned in Section 6.2 
305 
—.——= 0.200 = 20.0 %% 
because we are subtracting two independent random variables, x G1526 "£G and 


i = 0.200 = 


x B1526 °"F eB | that have approximately Normal distributions. 


The mean of the sampling distribution is 0.7. The true mean height of all 10-year-old girls 
is tG=56.41526 aa ~ 0.200 = 20.0°41; = 56.4 inches and the true mean height of all 10-year-old boys 
is pB=55. 8.7 0.200 = 20.0118 = 55.7 inches. We expect the difference x G-x B 
1526 SOsEG —E wd 1 center on the actual difference in the population means, 


uG-pB=56.4-55.7=0.715207 0.200 = 20.0% 1 — up = 56.4 — 55.7 = 0.7 inch. The standard 
deviation of the sampling distribution is 1.55 inches. It can be found using the formula 


305 . iene 
oG2nG+oB2nB=2.7212+3.828=1.551526 | 7) = 9.0” 


That is, the difference (Girls — Boys) in the sample mean heights typically varies by about 1.55 
inches from the true difference in means of 0.7 inch. 


ee _ 
THE SAMPLING DISTRIBUTION OF x 1-x 21526 9700 = 70-°%@ — 2 


0.200 = 20.0%) 6 nn Population 1 with mean p11 


305 
ae 20.0% = 0.200 = 20.0%, ; 
1526 “1 and standard deviation 011526 "71 and an independent SRS of 


305 305 
’ —— = 0.200 = 20.0% 0.200 = 20.0% 
size n21526 “n2 from Population 2 with mean 121526 — “pa 


305 
ae —— — 0.200 = 20.0%, 
deviation 021526 “02 . Then: 


305 
Choose an SRS of size n11526 — 
and standard 


a oe ee 
e The sampling distribution of x 1-x 21526 ~ 0.200 = 20-0°F — £2 is Normal if both 


population distributions are Normal. It is approximately Normal if both sample sizes are 


305, cae ; 
large (n1>30 and n2>30)is26~ °°? = 79°” (n1 > 30andnz > 30) oF if one population is 
Normally distributed and the other sample size is large. 


305 - 
e The mean of the sampling distribution of x 1-x 21526 — Seen Es Es is 


= 0.200 = 20.0% 5,5, = M1 — He 


3 
ux 1-x 2=1-121526 


3 ‘ e ov — 
e The standard deviation of the sampling distribution of x 1-x 21526 — Same Ne — Ee 


is 


2 2 

305 5 =| ae: 

ox“ 1-x-2=01 21 +0222 1598 ~ 0-200 = 20.08 #12 \ ny ng 
a 
as long as the 10% condition is met for both samples: n1<0.10N1i526 °°"? ~ °°" 


c 


305 - 
nm <0.10M) and n2<0.10N2i538" 0.200 = 20.0%n» < 0.10No | 


Note that the formula for the standard deviation of the sampling distribution is only correct 
for independent random samples from the corresponding populations. As in Section 10.1, we 
will adjust the Random condition for inference about a difference between two means to reflect 
this added requirement. The standard deviation of the sampling distribution tells us how much 
the difference in sample means will typically vary from the difference in the population means 
if we repeat the random sampling process many times. 


Think About It 


WHERE DO THE FORMULAS FOR THE MEAN AND STANDARD DEVIATION 
mae 9 20.0% = 
OF THE SAMPLING DISTRIBUTION OF x -1-x 21526 °?00 = 2°.°%%1-Z2 COME 
305 non 9 05 Om 
FROM? Both x lis” 200 = 20.05% and x 21526 70> 20.045 are random variables. 


305, since 
That is, eso values would vary in repeated independent SRSs of size n11526— aa | 


= 0.200 = 20.0% 
and n21526 “n2 . Independent a samples yield independent random 
305 
—,——= 0.200 = 20.0% - 0.200 = 20.0%. eile 
ee x 11526 “©1 and x 21526 2 . The statistic x 1-x 2 
——= 0.200 = 20.0%F, — Fy 
i536 1 is the difference of these two independent random variables. 


In Chapter 6, we learned that for any two random variables X and Y, 


305 ; ; = 


305 


- 0.200 = 20.0% 0.200 = 20.0% 5. 
1 and x 21526 ©2 we have 


305 
For the random variables x 11526 


305 
ux 1-x 2="x 1-px 2=p1-p21526 ~ 


0.200 = 20.02 obs - 23 — Me, — Beg = #1 — B2 
We also learned in Chapter 6 that for independent random variables X and Y, 


305 . 
oX-Y2=0X2+o0Y2i52%0 200 = 20.0% y = 0% + OF 


305 305 
’ —,——= 0.200 = 20.0% = 0.200 = 20.0%, 
For the random variables x 11526 “£1 and x 21526 “£2 we have 


ox 1-x 22=o0x 12+ox 22=(01n1)2+(o62n2)2=012n1+022n2 


2 i 2 2 
o a — oe + o2 — a — 72 ama: eat J 
%1—22 ad | z2 J /ny ny ng 


[a4 
— —_ 305 € . w0 = en = + o_o 
So ox 1-x 2=012n1+022n2 i596 = 2200 = 20.0% 21-22 ~ Y mT ty 


When the conditions are met, we can use the Normal density curve shown in Figure 10.5 to 


305, Sopnigien, - css 
model the sampling distribution of x 1-x 21526 ~ 0.200 = 20-0F) — £2 _ Note that this would 
305, catia ai 
allow us to calculate probabilities involving x 1-x 21526 — 0.200 = 20.04% — Z2 with a Normal 
distribution. 
Standard deviation: 
o o 
1, Se 
Values of X, —X, 
305 _ 0.200 = 20.0% 
FIGURE 10.5 Select independent SRSs from two populations having means 11526 
305 305 


= 0.200 = 20.0% 


eu ~~ 0.200 
‘#2 and standard deviations 011526 


Ul —— = 20.0% 
1 and y21526 1 and 02 
305 305 305 


—— = 0.200 = 20.0% ——= 0.200 = 20.0% 0.200 = 20.0% 
1526 “82, The two sample means are X11526 . 


°X and X21526~ 
X2_ When the conditions are met, the sampling distribution of the difference x 1-x 2 


ee 0.200 = sl = 0.200 = 20.0% 77, — fio 


ORs. . . ; ; ~ ——— 
1526 "£1 — £2 is approximately Normal with mean p 1-p 21526 


305 ft 4 3 
= 0.200 = 20.0%! mT Te 


and standard deviation 0612n1+022n2 1526 


Noh 


EXAMPLE | Medium or large drink? (>) 
305 


Describing the sampling distribution of x 1-x 21526 
ty — Zo 


= 0.200 = 20.0% 


RichLegg/Getty Images 


PROBLEM: A fast-food restaurant uses an automated filling machine to pour its soft drinks. 
The machine has different settings for small, medium, and large drink cups. According to the 
machine’s manufacturer, when the large setting is chosen, the amount of liquid L dispensed 
by the machine follows a Normal distribution with mean 27 ounces and standard deviation 
0.8 ounce. When the medium setting is chosen, the amount of liquid M dispensed follows a 
Normal distribution with mean 17 ounces and standard deviation 0.5 ounce. To test the 
manufacturer’s claim, the restaurant manager measures the amount of liquid in each of 20 
cups filled using the large setting and 25 cups filled using the medium setting. Let x L-x M 


1526 *0.0Z, ~ Zu pe the difference in the sample mean amount of liquid under the 
two settings. 


a 
a. What is the shape of the sampling distribution of x L-x M1526 — 00 = OE T — OMe 


Why? 
b. Find the mean of the sampling distribution. 


c. Calculate and interpret the standard deviation of the sampling distribution. 
SOLUTION: 


a. Normal, because both population distributions are Normal. 


305 


6. uxL — x"M = 27 - 17 = 101626 = °O = 0.0% iz, zy = 27 — 17 = 10 


™L ~ 7M ounces 


0.200 = 20. O%pz - z, — 1 — fe 


305 
Ux 1-x 2=y1-U21526 © 


305 

c ox L-x™M = 0.80220+0.50225 = 0.2051526 
a te y OOO, OF 

Orr -tM aa V 20 20 25 = 0. 05 ounce 


= 0.200 = 20.0% 


— = ——_= 0.200 = 20.0%%2,-z2 — 
Ox 1-x 2=012n1+022n21526 i 


305 mr nerre . a3: 
abi difference (Large cup ae iveR OTe cup) 1526 = 0.200 = 20.0% (Large cup — Medium cup) in 
the sample mean amounts of liquid typically varies by about 0.2 ounce from the true 
difference in means of 10 ounces. 


Note that we do not have to check the 10% condition because we are not sampling 
without replacement from a finite population. 


FOR PRACTICE, TRY EXERCISE 37 


305 
Confidence Intervals for p1 — p2i526 


= 0.200 = 20.0% 41) — ple 


When data come from two independent random samples or two groups in a randomized 


305 
_ us 4 nee = 0.200 = 20.0% 5, _ FH, : 
experiment (the Random condition), the statistic x 1-x 21526 “©1 — ©2 is our best 


305 
——= 0.200 = 20.0% 4, — ’ 
guess for the value of 11—p121526 ‘#1 ~ #2 , Before constructing a confidence 


interval for a difference in means, we must check that the conditions for performing inference 
are met. 


CONDITIONS FOR CONSTRUCTING A CONFIDENCE INTERVAL 
ABOUT A DIFFERENCE IN MEANS 


e Random: The data come from two independent random samples or from two groups ina 
randomized experiment. 


eee mange oi 
© 10%: When sampling without replacement, n1<0.10N11526 — 0.200 = 20.07%, < 0.10Nj 


305 
——= 0,200 = 20.0%,. — 
and n2<0.10N21526 700 = 7%9""ng < 0.10.N2 | 


Normal/Large Sample: For each sample, the corresponding population distribution (or 


the true distribution of response to the treatment) is Normal or the sample size is large 
305 


—— = 0.200 = 20.0%. 
n>301526 - 


n = 30 . For each sample, if the population (treatment) distribution 


305 
——= 0,200 = 20.0% 
has unknown shape and n<301526 : 


no strong skewness or outliers. 


n < 30, a graph of the sample data shows 


Recall from Chapter 4 that the Random condition is important for determining the scope of 
inference. Random sampling allows us to generalize our results to the populations of interest; 
random assignment in an experiment permits us to draw cause-and-effect conclusions. 


EXAMPLE |Do bigger apartments cost more money? » 
Checking conditions 


NOW RENTING 


Justin Sullivan/Getty Images 


PROBLEM: A college student wants to compare the cost of one- and two-bedroom 
apartments near campus. She collects the following data on monthly rents (in dollars) for a 
random sample of 10 apartments of each type. 


1 bedroom 500 650 600 505 450 550 915 495 650 eco 
2 bedroom 595 500 580 650 675 675 750 500 495 670 
35 _ 0.200 = 20.0% 4, = 
Let p1=1526 ~ “Fl = the true mean monthly rent of all one-bedroom apartments 


305 J 
near campus and p2=1526 “#2 = the true mean monthly rent of all two-bedroom 
apartments near campus. Check if the conditions for calculating a confidence interval for 
305 
1—p121526 


0.200 = 20.0° 


0.200 = 20.0% 1, — pp 
#41 —~ #2 are met. 


SOLUTION: 


e Random: Independent random samples of 10 one-bedroom apartments and 10 two- 
bedroom apartments near campus. / 


305 
2 — 0.200 = 20.0% 1) 
© 10%: We can assume that 10<10%1526 10 < 10% % of all one-bedroom 
305 
S — 0.200 = 20.0% 1) 
apartments near campus and that 10<10%1526 10 < 10% of all two- 


bedroom apartments near campus. / 


Be sure to mention independent random samples from the populations of interest 


when checking the Random condition. 


e Normal/Large Sample? The sample sizes are small, but the dotplots don’t show any 


outliers OT strong skewness. / 


Because there is no strong skewness or outliers in either sample, it is plausible that 


the population distributions of monthly rent for one-bedroom and two-bedroom 
apartments near the campus are Normal. 


1 bedroom 


ee ee e eS 
SR eee en Se, SE Ee Se Sea ME: 
350 400 450 S500 sS0 600 650 #00 #50 


Monthly rent ($) 


FOR PRACTICE, TRY EXERCISE 41 


If the conditions are met, we can use our familiar formula to calculate a confidence interval 
305, saad 
305 
statistic+(critical value)-(standard deviation of statistic)1526 — 


statistic + (criticalvalue) - (standarddeviationofstatistic) 


0.200 = 20.0% 


As mentioned earlier, the standard deviation of the sampling distribution of x 1-x 2 
305 


0.200 = 20.0% — Fo is 


1526 
—_ oe oz 
Be Oz = =! 1 2 
— = 0.200 = 20.0% 71~*2 +— 
ox 1-x 2=012n1+022n? 1526 a oe 


305 305 
——= 0.200 = 20.0%, ——= 0.200 = 20.0%, 
Because we usually don’t know the values of 011526 "A. and 021526 "02 , 
305 
; oe ——= 0.200 = 20.0% g 
we replace them with the sample standard deviations s11526 "S81 and s2 
305 305 


——-= 0.200 = 20.0%, 


1526 = 0.200 = 20.0% F 5 — Bo 


. The result is the standard error of x 1-x 21526 


a 7 ae 0.200 = 20 grea 
SEx 1-x 2=s12n1+s2202a 


This value estimates how much the difference in sample means will typically vary from the 
difference in the true means if we repeat the random sampling or random assignment many 
times. 


305 
i F aa —— = 0.200 = 20.0%, 
In the unlikely event that both population standard deviations 011526 "71 and o2 
305 


= 0.200 = 20.0%. . 
1526 92 are known, the interval becomes 


- 


305 


(x 1-x 2)+z*012n1+022n21526 


(£1 — Z2) + 2*4| —+— 
= 0.200 = 20.0% ny ng 
This interval is known as a two-sample z interval for a difference in means. It is rarely used in 
practice. 


When the Normal/Large Sample condition is met, we find the critical value t* 


cel = 0.200 = 
1526 


20.0%" for a given confidence level using Table B or technology. Our confidence 


= 0.200 = 20-0% 4) — He ic therefore 


interval for 11—y121526 


305 


- a ae . os y= 0,200 = 20.0% 
Sstatistic+(critical value)-(standard deviation of statistic) 1526 Vee ae 
statistic + (criticalvalue) - (standard deviation of statistic) 

oe 305 _ ong — on ne (£1 — E2) +t fac UT sD 
=(x1-x”2)4t¥s12n1+s22n2i526~ 0200 = 20.0% as 


This is often called a two-sample t interval for a difference between two means. 


There is just one issue left to resolve: What df should we use to find the t*1526 
t* critical value? It turns out that there are two practical options. 


= 0.200 = 20.0% 


Option 1 (Technology): Use the t distribution with degrees of freedom calculated from the 
data by the following formula. Note that the df given by this formula is usually not a whole 
number. This option results in confidence intervals with a margin of error that is approximately 
correct for the stated confidence level. A significance test using this option gives a P-value that 
is approximately correct. 


Statisticians B. L. Welch and F. E. Satterthwaite discovered this fairly remarkable formula in the 
1940s. 


305 


df=(s12n1+s22n2)21n1-1(s12n1)2+1n2-1(s22n2)21526 70 = 20.0% 
s2 62 \* 
ny ng 
df = 
2\2 2\ 2 
1 $4 1 85 


Option 2 (Conservative): Use the ¢ distribution with degrees of freedom equal to the smaller 
305 


305 
——= 0.200 = 20.0%p, - == 0.200 = 20.0%. 
of n1-11526 ‘my ~ 1 and n2-11526 ‘n2 —1 | With this option, the 


resulting confidence interval has a margin of error as large as or larger than is needed for the 
desired confidence level. A significance test using this option gives a P-value greater than or 


equal to the true P-value. 


Simulation studies reveal that the two-sample t procedures are most accurate when the sizes of 
the two samples are equal and the population (treatment) distributions have similar shapes. In 
planning a two-sample study, choose equal sample sizes whenever possible. 


As you can imagine, Option 2 was much more popular in the days when Table B and a 
four-function calculator were the main calculation tools. 


TWO-SAMPLE t INTERVAL FOR A DIFFERENCE BETWEEN TWO 
MEANS 


a 
When the conditions are met, a C%1526 °°"? = *°°"°C'% confidence interval for 1-2 
305 


— 0.200 = 20.0% : 
596 = 0-200 = 20.0% 11 — pho jo 


8 
oe i weet —@Z2)tt+ 1,2 
(x 1-x 2)+t*s12n1+s22n2ae my MO 


305 
——-= 0.200 = 

where t*1526 
305 


between —t*1526 — ~ °0-0¢x for the t distribution with degrees 


of freedom using either Option 1 (technology) or Option 2 (the smaller of n1—-1 
305 is Bl cece ene 
ae ey Ad ond n2-1ie ea ). 


305 
20.0%45 ; me ’ —_— 0.200 = 20.0%, ; 
"t* is the critical value with C%1526 °C% of its area 


0.200 0.200 


305 
= 20.0% SA 
"—t* and t*1526 


Let’s return to the apartment-rental example. Recall that the college student took 


305 
; —— = 0.200 = 20.0%), — 
independent random samples of n1=101526 ny = 


5 
——= 0.200 = 20.0%), — ane 
n2=101526 ‘ng = 10 two-bedroom apartments. Here are summary statistics on the 


monthly rents (in dollars) of these apartments: 


10 one-bedroom apartments and 


Group name n Mean SD Min Q1 Med Q3 Max 
1: 1-bedroom 10 531 82.792 395 495 510.0 600 650 
2: 2-bedroom 10 609 89.312 495 500 622.5 675 750 


We already confirmed that the conditions are met. 


Using Option 1, df=17.898i526 °°"? ~ *°-°”“df = 17.898 and the 90% confidence interval is ( 


305 ; i | ae 00 0% 
~144.8021526~ "70° = 7°-9—144.802 | -11.198i526~ °°? = 7°-°*—11.198 ) | See the Technology 
Corner after the next example for details on how to obtain this interval on the TI-83/84. Using 


Option 2, df=the smaller of ni-19e oa = M48) — 2. and 121-958 Same = ae 


nz —~1=9 | The critical value for a 90% confidence level in a t distribution with 9 degrees of 


= 0.200 = 20.0 %4 5 


305 
freedom is t*=1.8331526 = 1.833 . So the resulting interval is 


305 
(531—-609)+1.83382.792210+89.312210=—78+70.591=(—148.591, —7.409) 1526 


g9.7922 89,3122 
1- + 1.8334 /— 
(531 — 609) 8334) a 


= —78 + 70.591 
= (—148.591, — 7.409) 


= 0.200 = 20.0% 


Notice that this interval is wider than the one obtained from technology. That is, Option 1 
305, eens 
produces a more precise estimate of 1—p121526 — Same = DON 2 than Option 2. 
305, ee 
We are 90% confident that the interval from -144.8021526 "7? ~ *°-°"— 144,802 to 


~11.198156 ~ 0.200 = 20.0% 11.198 dollars captures u1-p2=8 = 0200 = 00% 1. — ‘a= the 
difference in the true mean monthly rents of one-bedroom and two-bedroom apartments close 
to the college campus. The interval suggests that the mean monthly rent of one-bedroom 
apartments is between $11.20 and $144.80 less than the mean monthly rent for two-bedroom 
apartments near campus. 


305 
Because Option 1 produces more precise estimates of 11-y21526 — 


0.200 = 20.0% 41) — pg than 


Option 2, we will always interpret the confidence interval obtained from technology. 


The following example shows how to construct and interpret a confidence interval for a 
difference in means. As usual with inference problems, we follow the four-step process. 


EXAMPLE | Big trees, small trees, short trees, tall trees >) 
Confidence interval for 4 — He 


Danita Delimont/Getty Images 


PROBLEM: The Wade Tract Preserve in Georgia is an old-growth forest of longleaf pines 
that has survived in a relatively undisturbed state for hundreds of years. One question of 
interest to foresters who study the area is “How do the sizes of longleaf pine trees in the 
northern and southern halves of the forest compare?” To find out, researchers took random 
samples of 30 trees from each half and measured the diameter at breast height (DBH) in 
centimeters./2 Here are summary statistics and comparative boxplots of the data: 


Descriptive Statistics: North, South 


Group n Mean StDev 
North 30 23.70 17.50 
South 30 34.53 14.26 


a. Based on the graph and numerical summaries, write a few sentences comparing the sizes 
of longleaf pine trees in the two halves of the forest. 


b. Construct and interpret a 90% confidence interval for the difference in the mean DBH of 
longleaf pines in the northern and southern halves of the Wade Tract Preserve. 


SOLUTION: 


a. §S hape: The distribution of DBH in the northern sample appears skewed to the right, 
while the distribution of DBH in the southern sample appears skewed to the left. 


It would not be correct to say that the northern half of the forest is skewed to the 


right! Only distributions of quantitative variables (like DBH) can be skewed. 


Outliers: No outliers are present in either sample. 

Center: It appears that trees in the southern half of the forest have larger diameters. 
The mean and median DBH for the southern sample are much larger than the 
corresponding values for the northern sample. 


Furthermore, the boxplots show that more than 75% of the southern trees have 


diameters that are above the northern sample’s median. 


Variability: There is more variability in the DBH of the northern longleaf ‘pines. The 
range, IQR, and standard deviation are all larger for the northern sample. 


Don’t forget to include context (the variable of interest) when comparing 


distributions of quantitative data. In this case, that’s DBH. 


305 ‘ ‘ A 305 ‘ ‘ ow 
6. STATE: 90% C4 for ee 508 0.200 = 20.0% 1, — jay here ui =1596 0.200 = 20.0% 4, = 
the true mean DBH of all trees in the southern half of the forest and Uy 


305 ; , or 
inn 0 = 20-99 = the true mean DBH of all trees in the northern half of the forest. 


Be sure to indicate the order of subtraction when defining the parameter. Then you 


can mimic the wording in your conclusion. 


0.200 = 20.0% 41) — pg 


305 
PLAN: Tr wo-sample t interval for U1 — 21526 


e Random: Independent random samples of 30 trees each from the northern and southern 
halves of the forest. / 


305 a : 
© 10%: Assume 30<10%1526 9? = 79-930 < 10% of all trees in the northern half of the 
305 aaaee E 
forest and 30<10%1526 7"? = 79-°%30 < 10% of all trees in the southern half of the 


forest. / 


305 . mecca : 
—— = (}.200 = 20.0 in, = 30 


305 an 
e Normal/Large Sample: N1 = 302301526 > 30 and n2 = 30230 


a 
y= 0.200 = 20.0%, — 30> 30 y 
305 305 
: > — 0.200 = 20.0%, — > — 0,200 = 20.0%. — 
DO: x71 = 34.531526 °F) = 34.53 oy = 14.261526 ‘81 = 14.26 ny = 30 
305 
1526 


= 0.200 = 20.0%, — 30 


305 an aver 305 of 

x 2 = 23.701526 700 = 20-.0%F. = 23.70 | 9 = 17501526 200 = 29-9%s2 = 17.50 | 2 = 30 
305 
1526 


= 0.200 = 20.0%, — 30 


Option 1: 2- SampT Int gives (3.9362, 17.724) using df = 55. >38ieh = 0.200 = 20.0% f = 55.728 


Refer to Technology Corner 25 for details on how to do calculations for a 2-sample t 


interval on the TI-83/84. 


; 305 rs 
Option 2: df =29, a = 0.200 = 20.0% df = 29, m — 1.6991526 7 = 79-9%4 « = 1.699 


305 » 
df=the smaller of n1-1 and n2-1 1526 = 0.200 = 20.0%qf — the smaller of ry land Tho 1 


= = 0.200 = 20.0% 
= (34.53 — 23.70)+1.69914.26230+17.502301526 sa 


‘ ‘ * / 14.267 , 17.50? 
= (34.53 — 23.70) + 1.699,/ 420 +. 125 


z 
= 0.200 = 20.0% 


305 
(x 1-x 2)+t*s12n1+s22n21526 


305 " iio 
= 10.83+7.00i1526 700 = 29-9 — 10.83 + 7.00 


se 0.200 = 20.0% 
= (3.83, 17.83) 1526 ~ 200 = 20.0%= (3.83, 17.83) 


CONCLUDE: We are 90% confident that the interval from 3.9362 to 17.724 centimeters 
305 


captures w1 — w2 =1526 0.200 = 20-001). — a Ear difference in the true mean DBH of all 
the southern trees and the true mean DBH 7: all the northern trees. 


FOR PRACTICE, TRY EXERCISE 45 


The 90% confidence interval in the example does not include 0. This gives convincing 
evidence that the difference in the mean diameter of northern and southern trees in the Wade 


Tract Preserve isn’t 0. However, wuts confidence interval provides more information than a 


= 0.200 = 20.0% 


simple reject or fail to reject HOI 26 “Ho conclusion. It gives a set of plausible 


soe 
values for p1- 121526 26 . The interval suggests that the mean diameter of the 
southern trees is between 3.94 and 17.72 cm larger than the mean diameter of the northern 


trees. 


0.200 = 20.0% oy — ple 


We chose the parameters in the DBH example so that x 1-x ~ OTR T= MIE 23 


0.200 = 


would be positive. wae if we had defined 1111526 *0-0"14) as the true mean DBH of the 


= 0.200 = 20.0 % 1 


305 


confidence interval for p1—-p21526 
305 


sa5q = 0-200 = 20.0%(—17.72, — 3.94) 


as the true mean DBH of the southern trees? The 90% 


0.200 = 20.0% 4) — po tom technology is (—17.72, —3.94) 


northern trees and [121526 


. This interval suggests that the mean diameter of the 
northern trees is between 3.94 and 17.72 cm smaller than the mean diameter of the southern 
trees. Changing the order of subtraction doesn’t change the result. 

Notice that the interval produced by technology is narrower than the one calculated using 
the conservative method. That’s because technology uses the formula on page 651 to obtain a 
larger (and more accurate) value of df. 

As with other inference procedures, you can use technology to perform the calculations in 
the “Do” step. Remember that technology comes with potential benefits and risks on the AP® 
Statistics exam. 


25. Technology Corner | CONSTRUCTING A CONFIDENCE 
INTERVAL FOR A DIFFERENCE IN MEANS |) 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


You can use the two-sample t interval option on the TI-83/84 to construct a confidence 
interval for the difference between two means. Let’s confirm the results of the two previous 
examples with this feature. 


1. Do bigger apartments cost more money? (page 649) 
e Enter the one-bedroom monthly rents in L1 and the two-bedroom monthly rents in L2. 
e Press [STAT], then choose TESTS and 2-SampTInt.... 
e Choose Data as the input method and enter the inputs as shown. 


e Enter the confidence level: C-level: 0.90. For Pooled: choose “No.” We’ll discuss 
pooling later. 


e Highlight Calculate and press [ENTER], 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


2-SampTiInt 
Inet :PEe) Stats 
List1i:Li 
List2:L2 
Freqi:1 
Freqa2:1 
C-Level:@.9 
Pooled:ff Yes 
Calculate 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


2-SampT Int 

(-144.8, -11.2) 
df=17. 89752842 
*x1=531 

x2=609 

Sx1=82. 7915723 
Sx2=89.31218655 
ni=19 

n2=10 


2. Big Trees, Small Trees, Short Trees, Tall Trees (page 652) 
e Press [STAT], then choose TESTS and 2-SampTInt.... 
e Choose Stats as the input method and enter the summary statistics as shown. 


e Enter the confidence level: C-level: 0.90. For Pooled: choose “No.” We’ll discuss 
pooling later. 


Highlight Calculate and press [ENTER], 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


2-SampT int 
(3.9362,17. 724) 
df=55. 7276914 
x1=34.53 
x2=23.7 
Sx1=14.26 
Sx2=17.5 
ni=30 
n2=39 


2-SampT Int 

Inet:Data 
x1:34.53 
Sx1:14.26 

ni:3@ 

Xx2:23.7 

Sx2i17.5 

n2:38 
C-Level:@.9 
Pooled: Yes 


AP® EXAM TIP 

305 os 
The formula for the two-sample t interval for p1—-p2is2e "°° = 7°11 — #2 often 
leads to calculation errors by students. Also, the interval produced by technology is 
narrower than the one calculated using the conservative method. As a result, your 
teacher may recommend using the calculator’s 2-SampTInt feature to compute the 


confidence interval. Be sure to name the procedure (two-sample t interval for u1—-y2 
305 2 2 % ‘i “ x 
i526 700 = °9-°"4. — 2) in the “Plan” step and give the interval (3.9362, 17.724) and 


df (55.728) in the “Do” step. 


CHECK YOUR UNDERSTANDING 


. 


”, Mr. Wilcox’s class performed an experiment to investigate whether drinking a 
caffeinated beverage would increase pulse rates. Twenty students in the class volunteered to 
take part in the experiment. All of the students measured their initial pulse rates (in beats per 
minute). Then Mr. Wilcox randomly assigned the students into two groups of 10. Each 
student in the first group drank 12 ounces of cola with caffeine. Each student in the second 
group drank 12 ounces of caffeine-free cola. All students then measured their pulse rates 
again. The table displays the change in pulse rate for the students in both groups. 


Change in pulse rate (Final pulse rate — Initial pulse rate) Mean change 
Caffeine 8 3 5 1 4 0 6 ad) 4 0 3.2 
No caffeine 3 =2 4 =1 5 5 1 2 =] 4 2.0 


1. Construct and interpret a 95% confidence interval for the difference in true mean change 
in pulse rate for subjects like these who drink caffeine versus who drink no caffeine. 


2. What does the interval in Question 1 suggest about whether caffeine increases the average 
pulse rate of subjects like these? Justify your answer. 


0.200 = 20.0% 4, — pay 


Significance Tests for ul —p2is 


An observed difference between two sample means can reflect an actual difference in the 
305 305 
——= 0.200 = 20.0%),, — ——= 0.200 = 20.0%), ; 
parameters 11—p121526 "Ha — B2 and 121526 #2 , or it may just be due to 


chance variation in random sampling or random assignment. Significance tests help us decide 
which explanation makes more sense. 


Stating Hypotheses and Checking Conditions The null hypothesis has the general form 


305 . ’ 
HO: ,:1-11:2=hypothesized valuciae 0.200 = 20.0% Hy: j4; — 2 = hypothesized value 
We’re often interested in situations in which the hypothesized difference is 0. Then the null 
hypothesis says that there is no difference between the two parameters: 
yp y Pp 


305 ; : - 
HO: 1.1-112=01526 = 0.200 = 20.0% Hy: [ty fg = 0 


(You will sometimes see the null hypothesis written in the equivalent form HO: p1=p2 


a 
i526) = *0-0°H: 1 = 2.) The alternative hypothesis says what kind of difference we 


expect. 


= 0.200 = 20.0% 44, 


The conditions for performing a significance test about 11- sous 26 ~ #2 are 


the same as for constructing a confidence interval. 


CONDITIONS FOR PERFORMING A SIGNIFICANCE TEST ABOUT A 
DIFFERENCE IN MEANS 


Random: The data come from two independent random samples or from two groups in a 
randomized experiment. 


ae 
© 10%: When sampling without replacement, n1<0.10N11526 — 0.200 = 20.0%, < 0.10Nj 


305 
——= 0,200 = 20.0%,. — N. 
and n2<0.10N2is20 7” ng <0.10N 


Normal/Large Sample: For each sample, the corresponding population distribution (or 

the true distribution of response to the treatment) is Normal or the sample size is large 
re 

(n>30)1526 © 0.200 = 20.0%(n > 30) For each sample, if the population (treatment) 


305 
beet ——= 0.200 = 20.0% 
distribution has unknown shape and n<301526 . 


data shows no strong skewness or outliers. 


n < 30 , a graph of the sample 


Here’s an example that illustrates how to state hypotheses and check conditions. 


EXAMPLE |A longer work week? (>) 
Stating hypotheses and checking conditions 


Bill Varie/Getty Images 


PROBLEM: Has the mean number of hours Americans work in a week changed? One of the 
questions on the General Social Survey (GSS) asked respondents how many hours they work 
each week. Responses from random samples of employed Americans were recorded for 1975 
and 2014—they are summarized here: 


Year Sample size Mean SD 
1975 764 38.97 hours 13.13 hours 
2014 1501 41.91 hours 14.35 hours 


305 
Do these data give convincing evidence at the a=0.051526 
level of a difference in the true mean number of work hours per week for Americans in 1975 


and 2014? 
a. State appropriate hypotheses for performing a significance test. Be sure to define the 


parameters of interest. 


0.200 = 20.0% ert 
a = 0.05 significance 


b. Check if the conditions for performing the test are met. 


SOLUTION: 


305 
a. Ho: 42014 -—p1975 =01526 


0.200 = 20.0% Hy : pooia — Hig7s = 0 


305 
Ha: 2014 -1975#01526 


0.200 = 20.0%H, : poo14 — pio7s + 0 


305 


0.200 = 20.0% rave = 
where 1975 =1526 1975 = the true mean hours worked per week by 
305 
: : 0.200 = 20.0% 1. = 
Americans in 1975 and »2014 =1526 H2014 = the true mean hours worked 


per week by Americans in 2014. 


You could also state the hypotheses as 


305 
Ho: 42014="119751526 


0.200 = 20.0% Hy : pong = Mig7s 


305 


9 —» or ; aif - 
Ha: 2014419751526 0.200 = 20.0077, : poo1a # H1975 


© Random? Independent random samples of 764 working Americans in 1 975 and 1501 
ae Ue J 


working Americans in 2014.¢ 
305 — 0.200 = 20.0% 

© 10%: 764<10%1520. S764 < 10% of all working Americans in 1975; 

305 
0.200 = 20.0% ‘ ; ; ; 

1501<10%1526 1501 < 10% of all working Americans in 2014.¢ 

305 

e Normal/Large Sample? N1975 =764 2301526 

30. = 


9 — 9 % === ‘ 
0.200 = 20.0%n1975 = 764 > 30 and 


Be sure to mention independent random samples from the populations of interest 


when checking the Random condition. 


FOR PRACTICE, TRY EXERCISE 51 


Calculations: Standardized Test Statistic and P-Value If the conditions are met, we 
305 


can proceed with calculations. To do a test of HO: p1—-p2=01526 ~ 9200 = DONA y: jy — jig = 0 ; 


305 
oe = 0,200 = 20.0%, — FF, 
start by standardizing x 1—-x 21526 ‘1-22: 


305 


standardized test statistic=statistic-parameterstandard deviation of statistic 1526 
statistic—parameter 


= 0.200 = 20.0% 


standardized test statistic = 


standard deviation of statistic 


305 _ 0.200 = 20.0% 
In the unlikely event that both population standard deviations 011526 ~~ = ~ © %1 and o2 
305 
—— = 0.200 = 20.0% : a 
1526 "02 are known, the standardized test statistic becomes 
_ (¥1—-F2)-0 
oe o2 
305 =k. ~~ as 
= 0.200 = 20.0% ny ng 


z=(x 1-x 2)-0012n1+022n21526 


We could then use the standard Normal distribution to find the P-value. This is known as a two- 
sample z test for a difference in means. It is rarely used in practice. 


305 
; i ——= 0.200 = 20.0% 
Because we seldom know the population standard deviations 011526 "O71 and o2 
305 305 
——= 0.200 = 20.0% =, 4 = 0.200 = 20.0% %, — Fo : 
1526 “02 , we use the standard error of x 1—x 21526 “1 — ©2 in the 


denominator of the standardized test statistic: 


305 


t=(x1-x”2)-0s12n1+s22n21526~ 0200 = 20.0% 


When the Normal/Large Sample condition is met, we can find the P-value using the t 


listribution with degrees of freedom given by Option 1 (technology) or Option 2 (df = smaller 
IT ea eee 5 ae 
of n1-11526~ °° = 79.9%) — 1 and n2-11626— 7 = 99%ng — 1), 


EXAMPLE |A longer work week? (>) 
Calculating the standardized test statistic and P-value 


David Papazian/Getty Images 


PROBLEM: Refer to the previous example. The table summarizes data on hours worked per 
week from the independent random samples of working Americans in 1975 and 2014. 


Year Sample size Mean SD 
1975 764 38.97 hours 13.13 hours 
2014 1501 41.91 hours 14.35 hours 


We already confirmed that the conditions for performing a significance test are met. 
a. Explain why the sample results give some evidence for the alternative hypothesis. 
b. Calculate the standardized test statistic and P-value. 


c. What conclusion would you make? 


SOLUTION: 
a. The observed difference in the sample means is x°2014 —x"1975=41.91-38.97=2.94 
305, on nel i 
T526 > 0200 = 20.0%Fon14 — E1975 = 41.91 — 38.97 = 2.94 , which gives some evidence in 
305, a ___. 
favor of Ha: »2014-p1975#01526 0.200 = 20.0% Hq : tooia — Hi975 FO because 204. 
305 scaatead 
{529 = 0-200 = 20.0% A 
305 _ 9.200 = 20.0% 
b. t=(41.91-38.97) —014.3521501413.132764 =2.940.602 = 4.881526 
__ (AL.91—38.97)-O0 = =294 _ 
¢ = = fe 8 


SYST Sere 
/ 14.352 13.132 
\ “1501 764 


305 
t=(x"1-x 0512 N1+S22N21526 


= 0.200 = 20.0% 


P-value: 


= 0.200 = 20.0% 


305 
OIE iReol -SampT' Test gives t=4. 881526 ‘t = 4.88 and P-value = 0.00000115 
305 


T526 200 = 20.0% 90000115 using oe one = 0.200 = 20.0%q ¢ — 1660.58. 
305 305 


Option 2: df=smaller1526— 0.200 = 20.0%q f — smaller = 159g = 0-200 = 20.0% 7G4 — } 
305 nan or ne 
and 1501-1=7631520 °°? = 70-9%1501 — 1 = 763 


= 0.200 = 20.0% 


305 
Using Table B: df = 100 gives P-value <2(0.0005) =0.0011526 
< 2(0.0005) = 0.001 


Using technology: tedf (lower: 4.88, upper:1000, df:763) X2 =0.000001292. 
305 


1526 > 2-200 = 20.0%,.9 — 9000001292. 


Refer to Technology Corner 26 on page 660 for details on how to do calculations 


for a 2-sample t test on the TI-83/84. 


OO cra ave 
. Because the P-value of 0.00000115<a=0.051526 9" = *°"*9.00000115 < a = 0.05 , we 


305 ; aad 
reject Hois26 Ta = Oo . There is convincing evidence of a difference in the true 


mean hours worked by Americans in 1975 and 2014. 


t distribution 
with df = 763 


a1] | T = j 
-5 f 4 -3 -2 -f 0 1 Z 3 4 ] 5 
t=-4.88 t=4.88 


FOR PRACTICE, TRY EXERCISE 55 


What does the P-value in the example tell us? If there is no difference in the true mean 


hours worked by Americans in 1975 and in 2014, there is a 0.00000115 probability of getting a 
difference in sample means as large as or larger than 2.94 hours in either direction purely by 
the chance involved in random sampling. With such a small probability of getting a result like 
this just by chance when the null hypothesis is true, we have convincing evidence to reject HO 


5 
-_ 9, _ » 4 
1526 0.200 20.0% Hy, 


We can get more information about the difference between the population mean hours 
worked by Americans in 1975 and 2014 with a confidence interval. Technology gives the 95% 


305 _ 0.200 = 20.0% 
confidence interval for 12014-»)19751526 “#2014 ~ #1975 as 1.76 to 4.12 hours. That 
is, we are 95% confident that the true mean hours worked per week by Americans is between 
1.76 hours and 4.12 hours larger in 2014 than in 1975. This is consistent with our “reject HO 
305 


——= 0.200 = 20.0% Hy, 5 


1526 conclusion because 0 is not included in the interval of plausible values 


305 nw anne 
for 12014—1119751526 ~ 0.200 = 20.0/442914 — 141975 , 


Putting It All Together: Two-Sample t Test for yw — He 


Here is a summary of the details for the two-sample t test for a difference between two 
means. 


TWO-SAMPLE t TEST FOR A DIFFERENCE BETWEEN TWO MEANS 


305, ee 
Suppose the conditions are met. To test the hypothesis HO: p1—p2=01526 ~ Sapna 


Ho : #1 ~ #2 = 9 | compute the standardized test statistic 


< (Z1—F2)—0 


a 
t=(x 1-x 2)-0s12n1+s22n21526~ "200 = 20.0% my 79 


Find the P-value by calculating the probability of getting a t statistic this large or larger in 
= 0.200 = 20.0% Fy 


the direction specified by the alternative hypothesis Ha1526 . Use the t 
distribution with degrees of freedom approximated by Option 1 (technology) or Option 2 


05 
= 0.200 = 20.0%, 


3 ‘ « or 
(the smaller of n1—11526 = Rae eed. +, 


—1 and n2-11526 


Here’s an example that shows how to perform a two-sample t test for a difference between 
two means in a randomized experiment. 


EXAMPLE | Calcium and blood pressure (>) 
Significance test for a difference between two means 


rdrgraphe/Shutterstock 


PROBLEM: Does increasing the amount of calcium in our diet reduce blood pressure? 
Examination of a large sample of people revealed a relationship between calcium intake and 
blood pressure. Such observational studies do not establish causation. Researchers therefore 
designed a randomized comparative experiment. 


The subjects were 21 healthy men who volunteered to take part in the experiment. They were 
randomly assigned to two groups: 10 of the men received a calcium supplement for 12 
weeks, while the control group of 11 men received a placebo pill that looked identical. The 
experiment was double-blind. The response variable is the decrease in systolic (top number) 
blood pressure for a subject after 12 weeks, in millimeters of mercury. An increase appears 
as a negative number.22 Here are the data: 


Group 1 (calcium) ¢ =4 18 1/7  -3- =—S) 1-10 a =2 
Group 2 (placebo) eee ee ber OS kes Oe alter ss 


a. Do the data provide convincing evidence that a calcium supplement reduces blood 
pressure more than a placebo, on average, for subjects like the ones in this study? 


b. Interpret the P-value you got in part (a) in the context of this experiment. 


SOLUTION: 
305 PF re = 
a. STATE: Ho: uC-pP=01526 0.200 = 20.0%Hy) : pac — pp =0 ills ees 
305 sane 
759g = 0-200 = 20.0%H, : pc — up > 0 


305 . — 
where C=1526 0.200 = 20.0c = the true mean decrease in systolic blood pressure for 
healthy men [ike the ones in this study who take a calcium supplement and P= 

305 


—— = 0().200 = 20.0% 
1526 


2 ie the true mean decrease in systolic blood pressure for healthy men 


like the ones in this study who take a placebo. No significance level was given, so we'll use 
305 . Se 
scoostt 0.200 = 20.0%, _ 9.95 


| Note that we did not have to check the 10% condition because the subjects in the | 


| experiment were not sampled without replacement from some larger population. | 


PLAN: Two sample t test for Cw Pi506~ 0.200 = 2000 ic: — Jip 
© Random: The 21 subjects were randomly assigned to the calcium or placebo treatments. 


e Normal/Large Sample: The sample sizes are small, but the dotplots show no strong 
S kewness and no outliers. 


Calciuw GEES oS oe ne e@ 
e 
S gS $ 4 ee ¢ e 
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Decrease in systolic blood pressure 


Based on these graphs, it’s reasonable to believe that the true distributions of 


decrease in systolic blood pressure are Normal for subjects like these when taking 
calcium or the placebo. 


305 ; . a 305 os 
DO: ¢ x C=5.0001526 0.200 = 20.0%F 0 = 5.000 , SC=8.7431526 = 0.200 = 20.0%6-, = 8.743 
305 "305 
* —~ 9.900 = 20.0%, . _ 9 a ; 
nC = 101526 = 920° = 2-0%nc = 10 . x-—p=_o.2731596 = 020° 20.0% p = — 0.273. een 


_ RST 305 . 
{576 7 0200 = 20.0%sp = 5.901 ypairisag = 0200 = 20-%np. = 11 


305 
0.200 = 20.0% 
The sample results give some evidence in favor of Ha: ee UP>01526 © 


——= 0)! 200 = 20.0% 


Ha : Uc ~— Lp > 0 because 5.000-(-0.273) = 5.273>0. ae 
5.000 — (—0.273) = 5.273 > 0. 


305 > eee 
* t=[ 5.000-(-0.273) ] - 08.743210+5.901211 = 5.2733.2878 = 1.6041526 °°"? ~ °°" 
__ {5.000—(—0.273)}-0 5.973 
a (Sra? 50012 | -—«3-2878 = OS 
\ ane 
P-value 


0.200 = 20.0%. ~ — 


Option 1: 2- SampT! Test gives tc conte 1.604 and P-value =0.0644 
305 305 


1526 = 0.200 = 20-07 = 0.0644 using df=15. So. 1526 = 0.200 = 20.0% f = = 15.39. 


305 ? ey 
Option 2: df=9 1526 = 0.200 = 0.0%af — 9 
305 


= 0.200 = 20.07 95 < P-value <0.101526 


305 
nem = 0.200 = 20.0% _ 
Using Table B: 0.05<P1526 < 0.10 


TOE enn ene 
Using technology: tedf(lower:1.604, upper:1000, df:9) =0.07161526 "= -9*— 0.0716 


Refer to Technology Corner 26 below for details on how to do calculations for a 2- 


sample t test on the TI-83/84. 


305 ; ae 

CONCLUDE: Because the P-value of 0.0644>0=0.05,1526 0.200 = 20.00.0644 > a = 0.05, 
305 2 

we fail to reject Ho526 °%Hy The experiment does not provide convincing 

evidence that the true mean decrease in systolic blood pressure 1S higher ‘for men [ike these 


who take calcium than for men like these who take a placebo. 


0.200 = 2 


05 

e Assuming Ho:y1—-p2 _ cia I ba = 0 4, true, there is a 0.0644 
probability of getting a difference (Calcium - Placebo) in mean blood pressure reduction 
for the two groups of 5.273 or greater just by the chance involved in the random 
assignment. 


= 0.200 = 20.0% Hy : ps 


t distribution 
with df =9 


t= 1.604 


FOR PRACTICE, TRY EXERCISE 57 


Notice that technology gives smaller, more accurate P-values for two-sample t tests than 
the conservative method. That’s because calculators and software use the more complicated 
formula on page 651 to obtain a larger number of degrees of freedom. 


Why didn’t researchers find a significant difference in the calcium and blood pressure 
experiment? The difference in mean systolic blood pressures for the two groups was 5.273 
millimeters of mercury. This seems like a fairly large difference. With the small group sizes of 
10 and 11, however, this difference wasn’t large enough to reject HO: pC-pP=0 


305 
= 0.200 = 20.0%H, = pe ,= 0: 
1526 Hy : wc — Hp = 9 in favor of the one-sided alternative. We suspect that larger 


groups might show a similar difference in mean blood pressure reduction, which would 
indicate that calcium has a significant effect. If so, then the researchers in this experiment made 


305 . , ox 
a Type II error—failing to reject a false HOis2s 97"? ~ °°" | In fact, later analysis of data 
yp g J y 


from an experiment with more subjects resulted in a P-value of 0.008. Sample size strongly 
affects the power of a test. It is easier to detect a difference in the effectiveness of two 


treatments if both are applied to large numbers of subjects. 


26. Technology Corner |PERFORMING A SIGNIFICANCE TEST 
FOR A DIFFERENCE IN MEANS (>) 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 


You can use the two-sample t test option on the TI-83/84 to do the calculations for a 
significance test about the difference between two means. Let’s confirm the results of the 
two previous examples with this feature. 
1. A longer work week? (page 657) 

e Press |STAT|, then choose TESTS and 2-SampTTest. 


e Inthe 2-SampTTest screen, specify “Stats” and adjust your other settings as shown. For 
Pooled: choose “No.” We’II discuss pooling shortly. 


e Highlight “Calculate” and press [ENTER], 


NORMAL FLOAT AUTO REAL RADIAN MP fl NORMAL FLOAT AUTO REAL RADIAN MP fl 


Inpt:Data Wixb2 
*1:41.91 t=4. 880781802 
Sx1:14.35 p=1.158065087€e -6 
ni:1501 df=1660.580116 
x2:38.97 *x1=41.91 
Sx2:13.13 xX2=38.97 
n2: 764 
H1:EWW <u2 du2 

tPooled:f) Yes 


2. Calcium and blood pressure (page 659) 
e Enter the Group 1 (calcium) data in L1 and the Group 2 (placebo) data in L2. 
e Press [STAT], then choose TESTS and 2-SampTTest. 


e Inthe 2-SampTTest screen, specify “Data” and adjust your other settings as shown. For 
Pooled: choose “No.” We’II discuss pooling shortly. 


e Highlight “Calculate” and press [ENTER], 


NORMAL FLOAT AUTO REAL RADIAN MP fl NORMAL FLOAT AUTO REAL RADIAN MP fl 


2-SampT Test 2-SampTTest 
Inept: BEL) Stats 1 >iW2 
Listi:Li t=1.603717288 
List2:L2 P=0. 0644196844 


Freai:1 df=15.59051297 
Freq2:1 xi1=5 

W1i:#u2 <u2 X2=-@. 2727272727 
Pooled:[ Yes Sx1=8. 743251366 
Color: Sx 2=5. 900693334 
Calculate Draw 


Note: If you select “Draw” instead of “Calculate,” the appropriate t distribution will be 
displayed, showing the standardized test statistic and the shaded area corresponding to the 
P-value. 


AP® EXAM TIP 

305 
The formula for the two-sample t statistic for a test about uy1—y21526 
#1 ~ #2 often leads to calculation errors by students. Also, the P-value from 
technology is smaller and more accurate than the one obtained using the 
conservative method. As a result, your teacher may recommend using the 
calculator’s 2-SampTTest feature to perform calculations. Be sure to name the 


= 0.200 = 20.0% 


=, 0.200 = 20.0 741) — pg 


a 
and to report the standardized test statistic (t=1.60)is20~ °?0~ 98° (t= 1.60) | p. 


value (0.0644), and df (15.59) in the “Do” step. 


305 
procedure (two-sample t test for u1—y2 1526 


) in the “Plan” step 


Think About It 


WHY DO THE INFERENCE METHODS FOR RANDOM SAMPLING WORK FOR 


RANDOMIZED EXPERIMENTS? Confidence intervals and tests for p1-p12 
305 305 


1526 200 = 20-041 — 2 are based on the sampling distribution of x 1-x 21526 — 
<; — 2 | But in experiments, we aren’t sampling at random from any larger populations. 

We can think about what would happen if the random assignment were repeated many times 
> _ 0.200 = 20.0% Hy : ps 


0.200 = 20.0% 


305 
under the assumption that HO: p1—2=01526 1 ~ #2 = 0 is true. That is, 
we assume that the specific treatment received doesn’t affect an individual subject’s 
response. 

Let’s see what would happen just by chance if we randomly reassign the 21 subjects in 
the calcium and blood pressure experiment to the two groups many times, assuming the drug 
received doesn’t affect each individual’s change in systolic blood pressure. We used 
software to redo the random assignment 1000 times. Figure 10.6 shows the value of 


305 a _ 
x C-x Pisa 700 = 29°“ — Ep in each of the 1000 simulation trials. This distribution 


305 
; —_— ee =A p= 0.200 = 20.0% 
(sometimes referred to as the randomization distribution of x C-x P1526 : 


&c — =P ) has an approximately Normal shape with mean 0 (no difference) and standard 
deviation 3.42. This matches fairly well with the distribution we used to perform 
calculations in the example. 

In the actual experiment, the difference in the mean change in blood pressure in the 


05 aE 
calcium and placebo groups was 5.000-(—0.273)=5.2731526 | '*?) = *0.° 
5.000 — (—0.273) = 5.273 How likely is it that a difference this large or larger would 


305 9 
happen just by chance when H01526 — 0.200 = 20.0% Hp is true? Figure 10.6 provides a rough 


answer: 64 of the 1000 random reassignments (indicated by the red dots) yielded a 
difference in means greater than or equal to 5.273. That is, our estimate of the P-value is 
0.064. This is quite close to the 0.0644 P-value that we obtained in the Technology Corner. 
Neither of these values is the exact P-value. To get the exact P-value in an experiment, you 
would consider the difference in sample means for all possible random assignments. This 
approach is called a permutation test. 


° 
2 : . 64 of the 1000 
a ee.hmU°8f a . 
° se 3 .2 random reassignments 
So 9 08 89 © ees 8 S88 resulted in a difference 
- ecoee 8 @6 © Seeeeeeee 5 
4 ee ee 4 in sample means of 
os 5 ~ 5.273 or greater. 
o 


eee cece cesses w 

J eee cece ee eececoese @¢ 

oe ee es SSOSCOOHSCCHSSESSSSSSECCSESOSSOCSCOEE seceeeeeesesessse‘esse sees °° ee 4 
oo Seecececeacecaeeseoceeascesaooscosecesse eececececeessoeeeeseooosceseo 6 Seee © 

o a Sccssscscssosesessesesssossossesesossossseesossosessessessessesessesses 680860 8009800809 o 4 ee ee 
— : Cees: wneeeeete | 
-10 “8 6 i -2 0 2 4 6 8 10 
¥,.—X, = simulated difference in sample mean decrease in systolic blood pressure 


_ 305 _ 9.200 = 20.0% - 
FIGURE 10.6 Dotplot of the values of x C-x P1526 ~~ “Y."°Ec — EP from each of 1000 


Simulated random reassignments of subjects to treatment groups in the calcium and blood 
pressure experiment, assuming no treatment effect. 


CHECK YOUR UNDERSTANDING 


fy 


y How quickly do synthetic fabrics such as polyester decay in landfills? A researcher 
buried polyester strips in the soil for different lengths of time, then dug up the strips and 
measured the force required to break them. Breaking strength is easy to measure and is a 
good indicator of decay. Lower strength means the fabric has decayed more. 


For one part of the study, the researcher buried 10 strips of polyester fabric in well- 


drained soil in the summer. The strips were randomly assigned to two groups: 5 of them 
were buried for 2 weeks and the other 5 were buried for 16 weeks. Here are the breaking 
strengths in pounds:24 


Group 1 (2 weeks) 118 126 126 120 129 
Group 2 (16 weeks) 124 98 110 140 110 


Do the data give convincing evidence that polyester decays more in 16 weeks than in 2 
weeks, on average? 


THE POOLED TWO-SAMPLE t PROCEDURES (DON’T USE THEM!) Most software 
offers a choice of two-sample t procedures. One is often labeled “unequal” variances; the other, 
“equal” variances. The unequal variance procedure uses our formula for the two-sample t 
interval and test, with df calculated as shown on page 651. This test is valid whether or not the 
population variances are equal. 


Recall that the variance is the square of the standard deviation. 


The other choice is a special version of the two-sample t procedures that assumes the two 
population distributions have the same variance. This procedure combines (the statistical term 
is pools) the two sample variances to estimate the common population variance. The resulting 


Statistic is called the pooled two-sample t statistic. 
305 
The pooled t statistic has exactly the ¢ distribution with n1+n2-21526 — 


nm +2 — 2 degrees of freedom if the two population variances really are equal and the 
population distributions are exactly Normal. This method offers more degrees of freedom than 
Option 1 (Technology), which leads to narrower confidence intervals and smaller P-values. 
The pooled t procedures were in common use before software made it easy to use Option 1 for 
our two-sample t procedures. 


0.200 = 20.0% 


305 
Remember, we always use the pooled sample proportion p4=X1+X2n1+n21526 © 
X\+Xo 305 


0.200 = 20.0% 


: —— = 0.200 = 20.0% F7,, - - 
mi+m2 when performing a test of HO: p1—p2=01526 °Hy : pi — p2 = 0. that’s 
305 _ 9.200 = 20.0% co 


; —— = 0.200 = 20.0%. 
because p11526 Pl and p21526 


"P2 are equal if the null hypothesis is 
305 
true. We estimate this common value using the overall proportion p%1526 — 


0.200 = 20.0%) at 


successes in the two samples combined. 


In the real world, distributions are not exactly Normal, and population variances are not 
exactly equal. In practice, the Option 1 two-sample t procedures are almost always more 
accurate than the pooled procedures. Our advice: Never use the pooled t procedures if you have 
technology that will carry out Option 1. 


Remember, we always use the pooled sample proportion 


a *i+*2 when performing a test of 
pr=X1+X2n1+n2is20 = 0.200 = 20.0% = P g 


1 305 ; , si 
HO: p1-p2= om 0.200 = 20.0%, : p, — po = 0° That's because 1.15267 0-200 = 20.0%p and 


oD = 0.200 = 20.0%», are equal if the null hypothesis is true. We estimate this common 


value using the overall proportion one = 0.200 = 20.0% 45 of successes in the two samples 
020 


combined. 


Think About It 


HOW DOES POOLING WORK? ag we have two Normally distributed populations 


ae = 0.200 = 20.0% Ae 


with the same variance 621526 . If we take independent random samples from 


305 305 
0.200 = 20.09 = 0.200 = 20.0% 
the two populations, both bere variances $121526 si and s22 1526 : 


305 » 
8) estimate 021526 20° = 209%? The best way to combine (pool) these two estimates is to 


average them with weights equal to their degrees of freedom. This gives more weight to the 
information from the larger sample, which is reasonable. The resulting estimator of 02 


305 
> 0.200 = 20.0% 2 


1526 — Oo” is 
7 (ny —1)s?+(ng—1)s3 
305 9 9 oP ae CoS oo ae 
sp2=(n1-1)s12+(n2-1)s22n1+n2-21526 7") = 79-9 ny +ng—2 
305 


0.200 = 20.0% 


This is called the pooled estimator of 621526 — o” because it combines the 


information in both samples. 
305 


To perform inference about 1-p21526 — “M1 — #2 in this setting, we start by 
simplifying our formula for the standard deviation of the statistic: 


0.200 = 20.0% 


305 


ee = ane 08 
> 3 —S= Sp or 
o, 9% o2 ia Piccf ¥ 1 } 1 1 
Os,-3, = \| —+— = = ,/o? | —+— } =o,/—+— 
Sone ny ng = ny Ta \ ny ng \ ny 9 
305 _ 9.200 = 20.0% 305 _ 9.200 = 20.0% 
We then substitute the pooled estimate sp1526 — Wa 8p for A ee. The 
ee 
pooled two-sample 1526 — 0.200 = 0.94¢ interval is 
= = ‘ } 1 1 
(Z, —%2)+t*s,,/ —+— 
0.200 = 20.0% Ry Ng 


(x i= 2etespini-+1n2aee 
305 


The pooled two-sample t1526 
Gan. ma er 
{526 = 0-200 = 20.0% Hy: ry — pep = 0 jy 


~ 0.200 = 2°.9t statistic for a test of HO: p1-p2=0 


4 1 


305 Spy/ a 
_— = —~(é =. % / 7 
t=(x 1-x 2)-Ospin1+1n21526 eee ae: \ 1 "2 


305 
For either procedure, we use a t distribution with n1+n2-21526 — 


degrees of freedom. 


Section 10.2 Summary 


305 


; = 0.200 = 20.0%. ; 
e Choose independent SRSs of size n11526 "’ from Population 1 with mean p11 

305 305 

= 0.200 = 20.0% = ——= 0.200 = 20.0% 

1526 “M1 and standard deviation 011526 "71 and of size n2 

305 305 

——= 0.200 = 20.0%. ; ——= 0.200 = 20.0%), 

1526 "N2 from Population 2 with mean [121526 “#2 and standard 

305 305 


= 0.200 = 20.0%, = 0.200 = 20.0% 


deviation 021526 . The sampling distribution of x 1-x 21526 
1 — #2 has the following properties: 


« Shape: Normal if both population distributions are Normal; approximately Normal if both 


ee 
sample sizes are large (n1>30 and n2>30)is26 °°"? ~ °°." (ni > 30and nz > 30) 


population is Normally distributed and the other sample size is large. 


305 
» Center: Its mean is px 1-x 2=1-1121526 


or if one 


= 0.200 = 20.0% 15, _z, — M1 — pe 


305 

oe a ae ——= 0.200 = 20.0% 

« Variability: The standard deviation is ox 1-x 2=012n1+022n21526 . 
Oz,-z. — ™% "Ra 


as long as the 10% condition is met when sampling without 
305 305 


replacement: n1<0.10N11526 — 0.200 = 20.0%, <0.10M1 and n2<0.10N21526 
ng < 0.10.N> ; 


= 0.200 = 20.0% 


e Confidence intervals and tests for the difference between the means of two populations or 
305 305 
——= 0.200 = 20.0% ——= 0.200 = 20.0%). 
the mean responses to two treatments [111526 “ML and 121526 "2 
305 
based on the difference x 1-x 21526 — 


are 


- 0.200 = 20.0%. _ =. 
%1 — £2 between the sample means. 


305 
e Before estimating or testing a claim about p1—y121526 
conditions are met: 


= 0.200 = 20.0% 4, — jay , check that these 


= Random: The data come from two independent random samples or from two groups in a 
randomized experiment. 
305, cetices ob 
© 10%: When sampling without replacement, n1<0.10N1is26— "70? = 7°-°*n, < 0.10) 
305, aon 
and n2<0.10N21526~ "700 = 79-9%ng < 0.10N2 
: Normal/Large Sample: For each sample, the corresponding population distribution (or the 
true distribution of response to the treatment) is Normal or the sample size is large (n>30) 


305 


——= 0.200 = 20.0% 
1526 a(n 2 30) 


. For each an if the population (treatment) distribution has 


= 0.200 = 20.0%, 


305 
unknown shape and n<301526— n < 30 , confirm that a graph of the sample data 


shows no strong skewness or outliers. 


305 
oe ——= 0.200 = 20.0%), — po - 
e When conditions are met, a C% confidence interval for p1—y121526 ‘#1 ~ #2 is 


S| 85 
eo 305 _ g2q0< nny?) — 22) HE Ot 
x 1-x 2)+t*s12n1+s22n21526 0 aun. 
1 2 12n1+s22n2 aa ny ng» 
sr = 0.200 = ae = 0.200 = 2 


20.0% 20.0.% : 
“t* is the critical value with C,1526 C, of its area 


= 0.200 = 20.0% _ = 0.200 = 2 


where t*1526 — 


- 


305 305 
between —t*1526 t™ and t*1526 im om for the t distribution with degrees 
of freedom from either Option 1 (technology) or Option 2 (the smaller of n1-1 


305 = 
= = 0.200 = 20.0% ~ 0.200 = 20.0% 
1526 "my — 1 and n2-11526 7 = ng 


0.200 = 20.0% 41) — pg 


1 ). This is called a two-sample t 


305 

interval for 1—[121526 — 

ROT ae wee si 7 

© To test HO: p1-p2=01526 7° ~ °9-°"°Ho : Hi — H2 = 9 | use a two-sample t test for 11-12 
305 _ 9.200 = 20.0% 141 — py 


1526 2 . The standardized test statistic is 
4 (1 —T2)—0 
2 2 
5 | 85 
— 30 _o.200=200% \\n, no 
t=(x 1-x 2)-0s12n1+s22n23Re: 7 


P-values are calculated using the t distribution with degrees of freedom from either Option 1 


305 
ccnngleey) or Option 2 (the smaller of n1—11526 © 0200: = 00%, —3 and n2-1 


== 0.200 = 20.0%. 1), 


e Be sure to follow the four-step process whenever you construct a confidence interval or 
perform a significance test for comparing two means. 


10.2 Technology Corners 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 


25. Constructing a confidence interval for a difference in means 
26. Performing a significance test for a differenceinmeans Page 654 


Section 10.2 Exercises 


37. PQ 648 |») Cholesterol The level of cholesterol in the blood for all men aged 20 to 34 


oo ., “s 
follows a Normal distribution with mean EM=1881526 — Neoee Stay == ESS milligrams 


a 
per deciliter (mg/dl) and standard deviation OM=41 1526" SEOE NO yg = 41 mg/dl. For 


14-year-old boys, blood cholesterol levels follow a Normal distribution mau mean 


305 ; 
UB=1701526 — ah = Map = 170 mg/dl and standard deviation oB= 301508 
75 = 30 mg/dl. Suppose we select independent SRSs of 25 men aged 20 to 34 and 36 

0.200 = 20.0% y, 


= 0.200 = 20.0% 


305 
boys aged 14 and calculate the sample mean cholesterol levels x M1526— 


05 
= 0.200 = 20.0%= 
and x Bi526 "LB, 


305 
, ; ee — — p= = 0.200 = 20.0%%,, _ x 
a. What is the shape of the sampling distribution of x M-x B1526 2M eB ? 


Why? 
b. Find the mean of the sampling distribution. 


c. Calculate and interpret the standard deviation of the sampling distribution. 


38. How tall? The heights of young men follow a Normal distribution = mean UM=69.3 


= 0.200 = 20.0% 
1526 tm = 69.3 inches and standard deviation oM=2. Signe 


om = 2.8 ee The ea of young women follow a Normal distribution with mean 


0.200 = 20.0% 


uW=64. Sie 0.200 = 20.0 tw = 64.5 inches and standard deviation oW=2.5 
305 . » = 
1526 200 = 20.0%ow = 2.5 inches. Suppose we select independent SRSs of 16 young men 


305 
; —y pees = 0.200 = 20.0% % 
and 9 young women and calculate the sample mean heights x M1526 “M and 


305 
= 0.200 = 20.0%... 
x W1526 "LW 


305 
a. What is the shape of the sampling distribution of x M-x W1526— 
2? Why? 


0.200 = 20.0%% 1, — Fy 


b. Find the mean of the sampling distribution. 


c. Calculate and interpret the standard deviation of the sampling distribution. 


39. Cholesterol Refer to Exercise 37. 
305 a 
a. Find the probability of getting a difference in sample means x M-x B1526 — ape eave 
<M — ©B that’s less than 0 mg/dl. 


b. Should we be surprised if the sample mean cholesterol level for the 14-year-old boys 
exceeds the sample mean cholesterol level for the men? Explain your answer. 


40. How tall? Refer to Exercise 38. 


a. Find the probability of getting a difference in sample means x M-x W 
I men mene a 
1526 9) = 9945 m — Zw that’s greater than 2 inches. 


b. Should we be surprised if the sample mean height for the young men is at least 2 inches 
greater than the sample mean height for the young women? Explain your answer. 


41. pg 649 Shoes How many pairs of shoes do teenagers have? To find out, a group of 
Ne 


42. 


43. 


AP® Statistics students conducted a survey. They selected a random sample of 20 female 
students and a separate random sample of 20 male students from their school. Then they 
recorded the number of pairs of shoes that each student reported having. Here are their 
data: 


Males 14 7 6 5 12 38 8 7 10 10 

Females 50 26 26 31 57 19 24 22 23 38 

Males 10 11 4 5 22 7 5 10 35 7 

Females 13 50 13 34 23 30 49 13 15 51 
305 


0.200 = 20.0”, = the true mean number of pairs of shoes that male students at 


305 
——= 0.200 = 20.0%). 
the school have and 121526 #2 the true mean number of pairs of shoes that 


female students at the school have. Check if the conditions for calculating a confidence 


305 
——= 0.200 = 
interval for 11-p121526 


Let 1111526 — 


20.0% 11, — 
#1 — #2 are met. 


Who texts more? For their final project, a group of AP® Statistics students wanted to 
compare the texting habits of males and females. They asked a random sample of students 
from their school to record the number of text messages sent and received over a 2-day 
period. Here are their data: 


Males rey 44 28 83 0 6 78 6 

Females a he 203 102 54 379 305 179 24 

Males 5 213 73 20 214 28 it 

Females 127 65 41 af 298 6 130 0 
305 


0.200 = 20.0” = the true mean number of texts sent by male stu- dents at the 


0.200 = 


Let 111526 — 


*0-0“t12 = the true mean number of texts sent by female students 


at the school. Check if the conditions for calculating a confidence interval for p1-p2 
305 

—— = 0.200 = 
1526 


305 
school and 121526 


20.0% 1, — tx 
#1 ~ #2 are met. 


Household size How do the numbers of people living in households in the United 
Kingdom (U.K.) and South Africa compare? To help answer this question, we chose 
independent random samples of 50 students from each country. Here is a dotplot of the 
household sizes reported by the students in the survey: 


44. 


45. 


South Africa 


Place 


U.K. 
SSSHSSOHSOHSSHSSSSSESESS | Geoseseseoseseos 


0 5 10 15 2 25 30 
Household size 
305, 
Let pUK1526 
305 _ 
households and pSAis26~ "°°? ~ *°-°*H4s4 = the true mean number of people living in 
South African households. Check if the conditions for calculating a confidence interval for 


~ 0.200 = 200m = the true mean number of people living in U.K. 


305, 
wUK-pSA eae 7 0200 = 20.0! 


“uk — USA are met. 

Long words Mary was interested in comparing the mean word length in articles from a 
medical journal and an airline’s in-flight magazine. She counted the number of letters in 
the first 400 words of an article in the medical journal and in the first 100 words of an 
article in the airline magazine. Mary then used statistical software to produce the 


histograms shown. 


Medical journal Airline magazine 


Percent 


0246 8101214 2 4 6 8 1012 14 


Word length 
305 _ 0.200 = 20.0% . a? 
Let pMJ1526 HMJ = the true mean length of all words in the medical journal 
305 


and pAM1526 — 0.200 = 20.0aM = the true mean length of all words in the airline magazine. 


Check if the conditions for calculating a confidence interval for hWMJ-LpAM 
305 


ti S23 9 =— 9 oo 
1526 200 = 20.0’4a4,7 — HAM are met. 


pg 652 |») Is red wine better than white wine? Observational studies suggest that 


moderate use of alcohol by adults reduces heart attacks and that red wine may have special 


benefits. One reason may be that red wine contains polyphenols, substances that do good 
things to cholesterol in the blood and so may reduce the risk of heart attacks. In an 
experiment, healthy men were assigned at random to drink half a bottle of either red or 
white wine each day for two weeks. The level of polyphenols in their blood was measured 
before and after the 2-week period. Here are the percent changes in polyphenols for the 
subjects in each group:22 


Red wine 3.5 8.1 7.4 4.0 0.7 4.9 8.4 7.0 5.5 
White wine 3.1 0.5 -3.8 4.1 -0.6 2.1 1.9 -5.9 0.1 


a. A dotplot of the data is shown, along with summary statistics. Write a few sentences 
comparing the distributions. 


Red wine es ———— 
White wine ———— : —— - — mad — r 
—10 -5 0 5 10 
Percent change in polyphenols 
Groupname = 1 Mean SD Min Q, Med Qs. Max 
Red wine 9 5.500 2.517 0.7 3.15 5.5 7.75 8.4 
White wine 9 0.233 3.292 =ed -2.20 0.5 2.90 4.1 


b. Construct and interpret a 90% confidence interval for the difference in true mean 
percent change in polyphenol levels for healthy men like the ones in this study when 
drinking red wine versus white wine. 


. Tropical flowers Different varieties of the tropical flower Heliconia are fertilized by 
different species of hummingbirds. Researchers believe that over time, the lengths of the 
flowers and the forms of the hummingbirds’ beaks have evolved to match each other. Here 
are data on the lengths in millimeters for random samples of two color varieties of the 


same species of flower on the island of Dominica:22 


H. caribaea red 


41.90 42.01 41.93 43.09 41.17 41.69 39.78 40.57 
39.63 42.18 40.66 37.87 39.16 37.40 38.20 38.07 
38.10 37.97 38.79 38.23 38.87 37.78 38.01 


H. caribaea yellow 
36.78 37.02 36.52 36,11 36.03 35.45 38.13 37.10 
35.17 36.82 36.66 35.68 36.03 34.57 34.63 


a. A dotplot of the data is shown, along with summary statistics. Write a few sentences 
comparing the distributions. 


Red : 29 . pied 2. * = 
Yellow 477-7 : T l T T l 
34 35 36 37 #38 #%39 «©4006«64406420¢~6U«d43CO44 
Length (mm) 
Group n Mean SD Min Q1 Med Q3 Max 
Red 23 39.698 1.786 37.4 38.07 39.16 41.69 43.09 
Yellow 15 36.18 0.975 34.57 35.45 36.11 36.82 38.13 


Construct and interpret a 95% confidence interval for the difference in the true mean 
lengths of these two varieties of flowers. 


47. Paying for college College financial aid offices expect students to use summer earnings to 


48. 


help pay for college. But how large are these earnings? One large university studied this 
question by asking a random sample of 1296 students who had summer jobs how much 
they earned. The financial aid office separated the responses into two groups based on 
gender, so these can be viewed as independent samples. Here are the data in summary 


form:24 

305 eases 
Group n 159g = 0200 = 20.0 % 35 Sx 
Males 675 $1884.52 $1368.37 
Females 621 $1360.39 $1037.46 
a 


. How can you tell from the summary statistics that the distribution of earnings in each 


group is strongly skewed to the right? The use of two-sample t procedures is still 
justified. Why? 


. Construct and interpret a 90% confidence interval for the difference between the true 
mean summer earnings of male and female students at this university. 


c. Interpret the 90% confidence level in the context of this study. 


Beta blockers In a study of heart surgery, one issue was the effect of drugs called beta 
blockers on the pulse rate of patients during surgery. The available subjects were 
randomly assigned into two groups. One group received a beta blocker; the other group 


received a placebo. The pulse rate of each patient at a critical point during the operation 
was recorded. Here are the data in summary form: 


305 sacae 
Group n MTB Da = 0-200 = 20.0%, Sx 
Beta blocker 30 65.2 7.8 
Placebo 30 70.3 8.3 


a. The distribution of pulse rate in each group is not Normal. The use of two-sample t 
procedures is still justified. Why? 


b. Construct and interpret a 99% confidence interval for the difference in mean pulse rates 
for patients like these who receive a beta blocker or a placebo. 


c. Interpret the 99% confidence level in the context of this study. 


49. Reaction times Catherine and Ana wanted to know if student athletes (students on at least 


50. 


ol. 


one varsity team) have faster reaction times than non-athletes. They took separate random 
samples of 33 athletes and 30 non-athletes from their school and tested their reaction time 
using an online reaction test, which measured the time (in seconds) between when a green 
light went on and the subject pressed a key on the computer keyboard. A 95% confidence 


interval for the difference (Non-athlete — Athlete) in the mean reaction time was 
0.018+0.034i526 "70? = ?°°"9,018 + 0.034 seconds. 


a. Does the interval provide convincing evidence of a difference in the true mean reaction 
time of athletes and non-athletes? Explain your answer. 


b. Does the interval provide convincing evidence that the true mean reaction time of 


athletes and non-athletes is the same? Explain your answer. 


Bird eggs A researcher wants to see if birds that build larger nests lay larger eggs. She 
selects two random samples of nests: one of small nests and the other of large nests. Then 
she weighs one egg (chosen at random if there is more than one egg) from each nest. A 


95% confidence interval for the difference (Large — Small) between the mean mass (in 
EE 
grams) of eggs in small and large nests is 1.6+2.01526. 7°? ~ °°°""1.6 + 2.0 , 


a. Does the interval provide convincing evidence of a difference in the true mean egg 
mass of birds with small nests and birds with large nests? Explain your answer. 


b. Does the interval provide convincing evidence that the true mean egg mass of birds 


with small nests and birds with large nests is the same? Explain your answer. 


pg 656 |») Sorting the music Student researchers Adam, Edward, and Kian wondered 


if music would affect performance for certain tasks. To find out, they had student 
volunteers sort a shuffled set of 26 playing cards by face value and by color. Nineteen of 
the 38 volunteers were randomly assigned to listen to music during the sorting, while the 
others listened to no music. Here are parallel boxplots of the time in seconds that it took to 
sort the cards for the students in each group: 
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53. 


Do these data give convincing evidence of a difference in the true mean sorting times at 
305 


= 0.200 = 20.0% 
the a=0.101526 : 


a = 0.10 significance level? 
a. State appropriate hypotheses for performing a significance test. Be sure to define the 
parameters of interest. 


b. Check if the conditions for performing the test are met. 


Ice cream For a statistics class project, Jonathan and Crystal held an ice-cream-eating 
contest. They randomly selected 29 males and 35 females from their large high school to 
participate. Each student was given a small cup of ice cream and instructed to eat it as fast 
as possible. Jonathan and Crystal then recorded each contestant’s gender and time (in 
seconds), as shown in the dotplots. 


Male ——® oe 


Gender 


15 30 45 60 75 90 105 
Time to eat ice cream (sec) 


Do these data give convincing evidence of a difference in the population means at the 
305 


a=0.101526 


0.200 = 20.0% = 0.10 significance level? 
a. State appropriate hypotheses for performing a significance test. Be sure to define the 
parameters of interest. 


b. Check if the conditions for performing the test are met. 


Happy customers As the Hispanic population in the United States has grown, businesses 
have tried to understand what Hispanics like. One study interviewed separate random 
samples of Hispanic and Anglo customers leaving a bank. Customers were classified as 
Hispanic if they preferred to be interviewed in Spanish or as Anglo if they preferred 
English. Each customer rated the importance of several aspects of bank service on a 10- 
point scale.22 Here are summary results for the importance of “reliability” (the accuracy of 
account records, etc.): 


305, see 
Group n yea = 0-200 = 20.0% 35 Sx 
Anglo 92 6.37 0.60 
Hispanic 86 5.9L 0.93 


Researchers want to know if there is a difference in the mean reliability ratings of all 
Anglo and Hispanic bank customers. 


a. State appropriate hypotheses for performing a significance test. Be sure to define the 
parameters of interest. 


b. Check that the conditions for performing the test are met. 


54. Does music help or hinder memory? Many students at Matt’s school claim they can 
think more clearly while listening to their favorite kind of music. Matt believes that music 
interferes with thinking clearly. To find out which is true, Matt recruits 84 volunteers and 
randomly assigns them to two groups. The “Music” group listens to their favorite music 
while playing a “match the animals” memory game. The “No Music” group plays the 
same game in silence. Here are some descriptive statistics for the number of turns it took 
the subjects in each group to complete the game (fewer turns indicate a better 


performance): 
Group Sample size Mean SI 
Music 305, 9 50.0 % ¢ 305 _ 1) = 20.0 Ser ro = 
11=421596 = 0-200 = 20.0%, = 42 .-4=15 9397596 ~ 0-200 = 20.0%%, = 15.833 s1=3.944... 
5 05 
0 305 _ 0.200 = 305 


0.200 = 20.0%%, — 13.714 $2=3.550.... 


N 20.0%. — A — 305 _ 
music %2=421526 “ng = 42 y-2=13.7141526 


Matt wants to know if listening to music affects the average number of turns required to 
finish the memory game for students like these. 


a. State appropriate hypotheses for performing a significance test. Be sure to define the 
parameters of interest. 


b. Check if the conditions for performing the test are met. 


55. PQ 657 |») Happy customers Refer to Exercise 53. 


a. Explain why the sample results give some evidence for the alternative hypothesis. 
b. Calculate the standardized test statistic and P-value. 


c. What conclusion would you make? 


56. Does music help or hinder memory? Refer to Exercise 54. 
a. Explain why the sample results give some evidence for the alternative hypothesis. 
b. Calculate the standardized test statistic and P-value. 


c. What conclusion would you make? 


57. pg 659 Fish oil To see if fish oil can help reduce blood pressure, males with high 
Ne 


blood pressure were recruited and randomly assigned to different treatments. Seven of the 
men were assigned to a 4-week diet that included fish oil. Seven other men were assigned 
to a 4-week diet that included a mixture of oils that approximated the types of fat in a 
typical diet. Each man’s blood pressure was measured at the beginning of the study. At the 
end of the 4 weeks, each volunteer’s blood pressure was measured again and the reduction 
in diastolic blood pressure was recorded. These differences are shown in the table. Note 


that a negative value means that the subject’s blood pressure increased.2© 


Fish oil 8 12 10 14 2 0 0 


Regular oil -6 0 1 2 =o -4 2 


a. Do these data provide convincing evidence that fish oil helps reduce blood pressure 
more, on average, than regular oil? 


b. Interpret the P-value from part (a) in the context of this study. 


58. Baby birds Do birds learn to time their breeding? Blue titmice eat caterpillars. The birds 


59. 


60. 


would like lots of caterpillars around when they have young to feed, but they must breed 
much earlier. Do the birds learn from one year’s experience when to time their breeding 
next year? Researchers randomly assigned 7 pairs of birds to have the natural caterpillar 
supply supplemented while feeding their young and another 6 pairs to serve as a control 
group relying on natural food supply. The next year, they measured how many days after 
the caterpillar peak the birds produced their nestlings.22 The investigators expected the 
control group to adjust their breeding date the following year, whereas the well-fed 
supplemented group had no reason to change. Here are the data (days after caterpillar 
peak): 


Control 4.6 2.3 7.7 6.0 4.6 =12 
Supplemented 15.5 11.3 5.4 16.5 11.3 11.4 fal 


a. Do the data provide convincing evidence that birds like these that have to rely on the 
natural food supply produce their nestlings closer to the caterpillar peak, on average, 
than birds like these that have the caterpillar supply supplemented? 


b. Interpret the P-value from part (a) in the context of this study. 


Who talks more—men or women? Researchers equipped random samples of 56 male 
and 56 female students from a large university with a small device that secretly records 
sound for a random 30 seconds during each 12.5-minute period over 2 days. Then they 
counted the number of words spoken by each subject during each recording period and, 
from this, estimated how many words per day each subject speaks. The female estimates 
had a mean of 16,177 words per day with a standard deviation of 7520 words per day. For 
the male estimates, the mean was 16,569 and the standard deviation was 9108. Do these 


305 
. ae ——= 0.200 = 20.0% oe 
data provide convincing evidence at the a=0.051526 ‘a = 0.05 significance 


level of a difference in the average number of words spoken in a day by all male and all 
female students at this university? 


Gray squirrel In many parts of the northern United States, two color variants of the 
Eastern Gray Squirrel— gray and black—are found in the same habitats. A scientist 
studying squirrels in a large forest wonders if there is a difference in the sizes of the two 
color variants. He collects random samples of 40 squirrels of each color from a large 
forest and weighs them. The 40 black squirrels have a mean weight of 20.3 ounces and a 
standard deviation of 2.1 ounces. The 40 gray squirrels have a mean weight of 19.2 ounces 
and a standard deviation of 1.9 ounces. Do these data provide convincing evidence at the 


61. 


62. 


63. 


305 
a=0.011526 — 


all gray and black Eastern Gray Squirrels in this forest? 


20 = Ne = OE significance level of a difference in the mean weights of 


Who talks more—men or women? Refer to Exercise 59. 


a. Construct and interpret a 95% confidence interval for the difference between the true 
means. If you already defined parameters and checked conditions in Exercise 59, you 
don’t need to do them again here. 


b. Explain how the confidence interval provides more information than the test in 


Exercise 59. 


Gray squirrel Refer to Exercise 60. 


a. Construct and interpret a 99% confidence interval for the difference between the true 
means. If you already defined parameters and checked conditions in Exercise 60, you 
don’t need to do them again here. 


b. Explain how the confidence interval provides more information than the test in 


Exercise 60. 


Teaching reading An educator believes that new reading activities in the classroom will 
help elementary school pupils improve their reading ability. She recruits 44 third-grade 
students and randomly assigns them into two groups. One group of 21 students does these 
new activities for an 8-week period. A control group of 23 third-graders follows the same 
curriculum without the activities. At the end of the 8 weeks, all students are given the 
Degree of Reading Power (DRP) test, which measures the aspects of reading ability that 


the treatment is designed to improve. Here are parallel boxplots of the data:28 


Activities 


Group 


Control 
i 


0 10 20 30 40 S50 60 #70 80 
DRP score 


a. Write a few sentences comparing the DRP scores for the two groups. 


After checking that the conditions for inference are met, the educator performs a test of 


305 305 

——-= 0.200 = 20.0%, - —= = 0.200 = 20.0% 
HO: pA-pC=01526 °° = 799% Ho: a — Ho = 0 versus Ha: pA-pC>01526 

305 
= =.= 0.200 = 20.0% ; 
Hq: wa — Ho > 9 where pAis26 ‘HA = the true mean DRP score of third- 
305 
; ; = ——= 0.200 = 20.0% 4, 

graders like these who do the new reading activities and C1526 Hc =the 


true mean DRP score of third-graders like these who follow the same curriculum 
without the activities. Computer output from the test is shown. 


Two-sample T for DRP score 


Group N Mean StDev SE Mean 
Activities 21 51.5 cia 2.4 
Control 23 41.5 af a: 3.6 


T-Value = 2.31 P-Value = 0.013 DF = 37 


305, — 
b. What conclusion should the educator make at the a=0.05is20 7"? = 7°°”"°a@ = 0.05 
significance level? 


c. Can we conclude that the new reading activities caused an increase in the mean DRP 
score? Explain your answer. 


d. Based on your conclusion in part (b), which type of error—a Type I error or a Type II 
error—could you have made? Explain your answer. 


64. Does breast-feeding weaken bones? Breast-feeding mothers secrete calcium into their 
milk. Some of the calcium may come from their bones, so mothers may lose bone mineral. 
Researchers compared a random sample of 47 breast-feeding women with a random 
sample of 22 women of similar age who were neither pregnant nor lactating. They 
measured the percent change in the bone mineral content (BMC) of the women’s spines 
over 3 months. Here are comparative boxplots of the data:22 


Breast- 
feeding 


Not : 
pregnant 


AG 8 «6 «4 32 6 2 4 
Percent change in BMC 


a. Write a few sentences comparing the percent changes in BMC for the two groups. 


After checking that the conditions for inference are met, the researchers perform a test 
IT cscmeianeainetic ini 

of HO: pBF-pNP=0is26) °°? = °°-°*Ho: Her — HNP = 9 versus Ha: pBF-pNP<0 

305 nine ‘. 305 _ g. 20 0% 

1526, 700 = 70-.0°Ha: ar — HNP <0 where pBFis26~ "7"? = °°-9*uBF = the true mean 

ae 

percent change in BMC for breast-feeding women and p.NP1526 © wane = ENP = the 

true mean percent change in BMC for women who are not pregnant or lactating. 

Computer output from the test is shown. 


Two-sample T for BMC change 

Group N Mean StDev SE Mean 
Breastfeed 47 -3.59 2,51 0.37 
Notpregnant 22 0.31 1.30 0.28 


T-Value = -8.50 P-Value = 0.000 DF = 66 


305 ; : o 
b. What conclusion should the researchers make at the a=0.05is20 700 = 7°°“a@ = 0.05 


significance level? 


c. Can we conclude that breast-feeding causes a mother’s bones to weaken? Why or why 
not? 


d. Based on your conclusion in part (b), which type of error—a Type I error or a Type II 
error—could you have made? Explain your answer. 


65. A better drug? In a pilot study, a company’s new cholesterol-reducing drug outperforms 


66. 


67. 


the currently available drug. If the data provide convincing evidence that the mean 
cholesterol reduction with the new drug is more than 10 milligrams per deciliter of blood 
(mg/dl) greater than with the current drug, the company will begin the expensive process 
of mass-producing the new drug. For the 14 subjects who were assigned at random to the 
current drug, the mean cholesterol reduction was 54.1 mg/dl with a standard deviation of 
11.93 mg/dl. For the 15 subjects who were randomly assigned to the new drug, the mean 
cholesterol reduction was 68.7 mg/dl with a standard deviation of 13.3 mg/dl. Graphs of 
the data reveal no outliers or strong skewness. 


a. Carry out an appropriate significance test. What conclusion would you draw? (Note that 


305 . » 
the null hypothesis is not HO: p1-p2=O1526 °° = 20-°*Ho + fa — fe = 0 +) 


b. Based on your conclusion in part (a), could you have made a Type I error or a Type II 


error? Justify your answer. 


Down the toilet A company that makes hotel toilets claims that its new pressure-assisted 
toilet reduces the average amount of water used by more than 0.5 gallon per flush when 
compared to its current model. To test this claim, the company randomly selects 30 toilets 
of each type and measures the amount of water that is used when each toilet is flushed 
once. For the current-model toilets, the mean amount of water used is 1.64 gal witha 
standard deviation of 0.29 gal. For the new toilets, the mean amount of water used is 1.09 
gal with a standard deviation of 0.18 gal. 


a. Carry out an appropriate significance vole _ What conclusion would you draw? (Note that 


the null hypothesis is not HO: p1-p2= ~0int" 0.200 = 20.0% Hy : 1 — 2 = 0 .) 


b. Based on your conclusion in part (a), could you have made a Type I error or a Type II 


error? Justify your answer. 


Rewards and creativity Do external rewards—things like money, praise, fame, and 
grades—promote creativity? Researcher Teresa Amabile suspected that the answer is no, 
and that internal motivation enhances creativity. To find out, she recruited 47 experienced 
creative writers who were college students and divided them at random into two groups. 
The students in one group were given a list of statements about extrinsic reasons (E) for 


68. 


writing, such as public recognition, making money, or pleasing their parents. Students in 
the other group were given a list of statements about intrinsic reasons (I) for writing, such 
as expressing yourself and enjoying playing with words. Both groups were then instructed 
to write a poem about wee Each student’s poem was rated separately by 12 different 
poets using a creativity scale.22 These ratings were averaged to obtain an overall creativity 
score for each poem. The table shows summary statistics for the two groups. 


Group name n Mean SD 
Intrinsic 24 19.883 4.44 
Extrinsic 23 15.739 5.255 


We used software to randomly reassign the 47 subjects to the two groups 100 times, 
assuming the treatment received doesn’t affect each individual’s creativity rating. A 
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dotplot of the simulated difference (Intrinsic-Extrinsic) 1526 
(Intrinsic — Extrinsic) jy mean creativity rating is shown. 
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Simulated difference (Intrinsic — Extrinsic) in 
mean creativity rating 


a. Why did researchers randomly assign the subjects to the two treatment groups? 
b. Estimate and interpret the P-value. 
c. What conclusion would you make? 


d. Based on your conclusion in part (c), could you have made a Type I error or a Type II 
error? Justify your answer. 


Sleep deprivation Does sleep deprivation linger for more than a day? Researchers 
designed a study using 21 volunteer subjects between the ages of 18 and 25. All 21 
participants took a computer-based visual discrimination test at the start of the study. Then 
the subjects were randomly assigned into two groups. The 11 subjects in one group were 
deprived of sleep for an entire night in a laboratory setting. The 10 subjects in the other 
group were allowed unrestricted sleep for the night. Both groups were allowed as much 
sleep as they wanted for the next two nights. On Day 4, all the subjects took the same 
visual discrimination test on the computer. Researchers recorded the improvement in time 
(measured in milliseconds) from Day 1 to Day 4 on the test for each subject.2+ The table 
shows summary Statistics for the two groups. 


Group Mean SD 
Unrestricted sleep 19.82 12.17 
Sleep-deprived 3.90 14.73 


We used software to randomly reassign the 21 subjects to the two groups 100 times, 
assuming the treatment received doesn’t affect each individual’s time improvement on the 
test. A dotplot of the simulated difference (Unrestricted — Sleep-deprived) in mean time 
improvement is shown. 


20 -15 -10 -5 0 5 10 15 20 25 


Simulated difference (Unrestricted — Sleep-deprived) 
in mean time improvement 


a. Explain why the researchers didn’t let the subjects choose whether to be in the sleep- 
deprivation group or the unrestricted sleep group. 


b. Etimate and interpret the P-value. 
c. What conclusion would you make? 


d. Based on your conclusion in part (c), could you have made a Type I error or a Type II 
error? Justify your answer. 


Multiple Choice Select the best answer for Exercises 69-72. 


69. A random sample of 30 words from Jane Austen’s Pride and Prejudice had a mean length 
of 4.08 letters with a standard deviation of 2.40. A random sample of 30 words from 
Henry James’s What Maisie Knew had a mean length of 3.85 letters with a standard 
deviation of 2.26. Which of the following is a correct expression for the 95% confidence 
interval for the difference in mean word length for these two novels? 


WS pane. anneal 400-—39.8b) + 2.576 (224. = 
a. (4.08-3.85)+2.576(2.4030+2.2630)isa0~ 020? = 20.0% ) (24 +a ) 


0.200 = 20,094(4-08 — 3.85) + 2.045 ( 242 + 226) 


305 
b. (4.08-3.85)+2.045(2.4030+2.2630)1526 v3" 30 
c. (4.08-3.85)+2.045(2.40229+2. 26229) i538 = 200 = 2000 
(4.08 — 3.85) + 2.045 (yV ee ) 
305 _ 0.200 = 20.0% 
d. (4.08-3.85)+2.045(2.40230+2.26230)i526 °"  ")" 
(4.08 — 3.85) + 2.045 (yV 240° a | 
0.200 = 20.0% 


e. (4.08-3.85)+2.576(2.40230+2. 26230)i500~ 


/ 9407 2.267 
(4.08 — 3.85) + 2.576 316 ( (3 30 30 + 2 | 


Exercises 70—72 refer to the following setting. A study of road rage asked random samples of 
596 men and 523 women about their behavior while driving. Based on their answers, each 
person was assigned a road rage score on a scale of 0 to 20. The participants were chosen by 
random digit dialing of phone numbers. The researchers performed a test of the following 


hypotheses: HO: ;M=F 1526 0.200 = 20.0% Hy : wm = HF versus Ha: UM#uF 1526 
[PM FUE. 


A, 


05 305 
= ” — 9.200 = 20.0% 


70. Which of the following describes a Type II error in the context of this study? 


71, 


72s 


a. Finding convincing evidence that the true means are different for males and females, 
when in reality the true means are the same 


b. Finding convincing evidence that the true means are different for males and females, 
when in reality the true means are different 


c. Not finding convincing evidence that the true means are different for males and 
females, when in reality the true means are the same 


d. Not finding convincing evidence that the true means are different for males and 
females, when in reality the true means are different 


e. Not finding convincing evidence that the true means are different for males and 
females, when in reality there is convincing evidence that the true means are different 


The P-value for the stated hypotheses is 0.002. Interpret this value in the context of 
this study. 


a. Assuming that the true mean road rage score is the same for males and females, there is 
a 0.002 probability of getting a difference in sample means equal to the one observed in 
this study. 


b. Assuming that the true mean road rage score is the same for males and females, there is 
a 0.002 probability of getting a difference in sample means at least as large in either 
direction as the one observed in this study. 


c. Assuming that the true mean road rage score is different for males and females, there is 
a 0.002 probability of getting a difference in sample means at least as large in either 
direction as the one observed in this study. 


d. Assuming that the true mean road rage score is the same for males and females, there is 
a 0.002 probability that the null hypothesis is true. 


e. Assuming that the true mean road rage score is the same for males and females, there is 
a 0.002 probability that the alternative hypothesis is true. 


Based on the P-value in Exercise 71, which of the following must be true? 


305 
; ——= 0.200 = 20.0% 
a. A 90% confidence interval for HM-pF1526 HM 


~ FF will contain 0. 


d. A 99.9% confidence interval for M-pFi526 : 


305 


b. A 95% confidence interval for p»M-pFis26 "> 20.0 — HE will contain 0. 
, 35 _ 0.200 = 20.0% pr, — ; 
c. A 99% confidence interval for pM-pF 1526 HM ~ HF will contain 0. 
305 


0.200 = 20.0% 1. — 
HM ~ HF will contain 0. 


e. It is impossible to determine whether any of these statements is true based only on the 


P-value. 


Recycle and Review 


73. 


74. 


Quality control (2.2, 5.3, 6.3, 7.3) Many manufacturing companies use statistical 
techniques to ensure that the products they make meet standards. One common way to do 
this is to take a random sample of products at regular intervals throughout the production 
shift. Assuming that the process is working properly, the mean measurement x — 


305 
——= 0.200 = 20.0% b cpetinaes 
1526 © from a random sample varies according to a Normal distribution with 


305 305 
——— = 0.200 = 20.0% ,- er ——= 0.200 = 20.0 %q— 
mean [1X 1526 Mz and standard deviation ox 1526 "ome 


question that follows, assume that the process is working properly. 


. For each 


a. What’s the probability that at least one of the next two sample means will fall more 
305 305 


——— = 0.200 = 20.0%2¢- ——— = 0.200 = 20.0% ),— 
than 20x 1526 20% from the target Mean px 1526 Mz ? 


b. What’s the probability that the first sample mean that is greater than px +20x 


305 
1526 


0.200 = 20.0’ + 205 is the one from the fourth sample taken? 


Plant managers are trying to develop a criterion for determining when the process is not 
working properly. One idea they have is to look at the 5 most recent sample means. If at 


oe 
least 4 of the 5 fall outside the interval (jx -ox , px +ox )is2e 27? = °00" 


(Mz — OF, Me +3) | they will conclude that the process isn’t working. 


c. Find the probability that at least 4 of the 5 most recent sample means fall outside the 
interval, assuming the process is working properly. Is this a reasonable criterion? 
Explain your reasoning. 


Stop doing homework! (4.3) Researchers in Spain interviewed 7725 13-year-olds about 
their homework habits—how much time they spent per night on homework and whether 
they got help from their parents or not—and then had them take a test with 24 math 
questions and 24 science questions. They found that students who spent between 90 and 
100 minutes on homework did only a little better on the test than those who spent 60 to 70 
minutes on homework. Beyond 100 minutes, students who spent more time did worse than 
those who spent less time. The researchers concluded that 60 to 70 minutes per night is the 
optimum time for students to spend on homework.22 Is it appropriate to conclude that 
students who reduce their homework time from 120 minutes to 70 minutes will likely 
improve their performance on tests such as those used in this study? Why or why not? 


SECTION 10.3 Comparing Two Means: Paired Data 


LEARNING TARGETS By the end of the section, you should be able to: 


e Analyze the distribution of differences in a paired data set using graphs and 
summary Statistics. 


e Construct and interpret a confidence interval for a mean difference. 
e Perform a significance test about a mean difference. 


e Determine when it is appropriate to use paired t procedures versus two-sample t 
procedures. 


Section 10.2 showed you how to perform inference about the difference between two means 
when data come from two independent random samples or two groups in a randomized 
experiment. What if we want to compare means in a setting that involves measuring a 
quantitative variable twice for the same individual or for two individuals who are much alike? 
For instance, a researcher studied a random sample of identical twins who had been separated 
and adopted at birth. In each case, one twin (Twin A) was adopted by a high-income family and 
the other (Twin B) by a low-income family. Both twins were given an IQ test as adults. Here 


are their scores:22 


Evan Hurd Photography/Getty Images 


Pair 12 3 4 5 6 7 8 9 10 11 12 
Twin A’s IQ (high-income family) 128 104 108 100 116 105 100 100 103 124 114 112 
Twin B’s IQ (low-income family) 120 99 99 94 111 97 99 94 104 114 113 100 


Notice that these two groups of IQ scores did not come from independent samples of people 
who were raised in low-income and high-income families. The data were obtained from pairs 
of very similar people (identical twins), one living with a low-income family and the other 
living with a high-income family. This set of IQ scores is an example of paired data. 


DEFINITION Paired data 


Paired data result from recording two values of the same quantitative variable for 
each individual or for each pair of similar individuals. 


This section focuses on how to analyze paired data and how to perform inference about a 
true mean difference. 


Analyzing Paired Data 


The graph in Figure 10.7 shows a parallel dotplot of the IQ scores from the study of identical 
twins. We - = that the twins raised in high-income households had a higher mean IQ 


(x A=109. sine Tat = NE 5 = 109.5) than the twins raised in low-income households 


(x B=103. 6671506 = 0.200 = 20.0455 = 103. 667), SS is a similar amount of variability in IQ 


0.200 = 20. O%5, = = 9, 4Tand sB=8.66 


05 
scores for these two groups of twins: sA=9. ATiBae 


~ 0.200 = 20.0%s = 8.66 But with so much overlap between the groups, the difference in 


means does not seem statistically significant. 


Twin A ° 
(high-income family) —____*"" * *** __* * _ 
Twin B .: 
° eee ¢@ 


(low-income family) a 
90 95 100105 110 115 120 125 130 


IQ score 
FIGURE 10.7 Parallel dotplots of the IQ scores for pairs of identical twins raised in high-income 
(Twin A) and low-income (Twin B) households. 


The previous analysis ignores the fact that these are paired data. Let’s look at the difference 
in IQ scores for each pair of twins. 


Pair 1 2 3 4 5 6 7 8 9 10 11 12 
Twin A’s IQ (high-income family) 128 104 108 100 116 105 100 100 103 124 114 112 
Twin B’s IQ (low-income family) 120 99 99 94 111 97 99 94 104 114 113 100 
Difference (A — B) 8 5 9 6 5 8 1 6 -1 10 1. 22 


The dotplot in Figure 10.8 displays these differences. Almost all of the differences (11 out of 
12) are positive. This graph gives strong evidence that identical twins raised in a high-income 
household have higher IQ scores as adults, on average, than identical twins raised in a low- 
income household. 


a e o +4 : ° 
—2 0 2 4 6 8 10 12 
Difference (Twin A — Twin B) in IQ scores 


FIGURE 10.8 Dotplot of difference in IQ scores for each pair of twins. 


The mean difference in IQ scores is 
305 
x diff=x A-B=8+5+9+ --- +1+1212=7012=5.833 points1526 
8+5+9+--+14+12 70 A 
——z &x“S 5.833 points 
This value tells us that the IQ score of the twin in each pair who was raised in a high-income 
household is 5.83 points higher than the twin who was raised in a low-income household, on 
average. 


= 0.200 = 20.0% 


Lait = Tap 


305 — 0.200 = 20.0% 
The standard deviation of the difference in IQ scores is sdiff=3.931526 ~“"~ 


Sait = 3.93 points. This value is much smaller than the standard deviations we computed earlier 


305 _ 0.200 = 20.0% 
when we (incorrectly) viewed the two groups of twins as unrelated: sA=9.471526 ~~“ 


5 ° y 
8a = 9.47 points and sB=8.661526 — 0.200 = 20.0%5, = 8.66 points. yp Remember: The proper 


method of analysis depends on how the data are produced. 


ANALYZING PAIRED DATA 


To analyze paired data, start by computing the difference for each pair. Then make a graph 
305 


= — = = 4 Ga 
of the differences. Use the mean difference x diff1526 0.200 = 20-097 nie and the standard 
0.200 


305 
deviation of the differences sdiffi526 — 


= 20.0% ,. ae 
"Sdiff as summary statistics. 


EXAMPLE | Math and music (>) 
Analyzing paired data 


gawrav/Getty Images 
PROBLEM: Does music help or hinder performance in math? Student researchers Abigail, 


Carolyn, and Leah designed an experiment using 30 student volunteers to find out. Each 
subject completed a 50-question single-digit arithmetic test with and without music playing. 


For each subject, the order of the music and no music treatments was randomly assigned, and 
the time to complete the test in seconds was recorded for each treatment. Here are the data, 
along with the difference in time for each subject: 


Student D2 Ss TAS 5 6 7 Be Oy 10) A 12) 13) 14715 
Time with music (Sec) 83 119 77 75 64 106 70 69 60 76 47 97 68 77 48 


Time without music 70 106 71 67 59 112 83 69 65 83 38 90 76 68 50 
(sec) 


Difference (Music — TS ol Si 4b Oh oo oe Or 18S ONES 7 Ora ae oe =o 
Without music) 
Student 169 17 1819 2021 225 23 524-255 26 277 28 29 20 


Time with music (sec) 78 113 71 77 37 50 58 52 47 71146 44 53 57 39 


Time without music 73 93 59 70 39 52 60 54 51 60 141 40 56 53 37 
(sec) 


Difference (Music — O20 ee ieee ase es Poe ls eas 3 A) |e 
Without music) 


a. Make a dotplot of the difference (Music — Without music) in time for each subject to 
complete the test. 


b. Describe what the graph reveals about whether music helps or hinders math performance. 


c. Calculate the mean difference and the standard deviation of the differences. Interpret the 
mean difference. 


SOLUTION: 


a. 


20 -~15 -00 -— 0 § 1 15 20 
Difference (Music — Without music) 
in time (sec) 


b. There is some evidence that music hinders performance on the math test. ie of the 30 
subjects took longer to complete the test when listening to music. 


305 
9 =— 9 | 5 a — — 
c. Mean: x diff=x “Music— Without=2.81526 0.200 = 20.0°F aitt = EMusic — Without = 2-8 
305 
> — 0,200 = 20.0% ¢ 0 — 7 2 
seconds, SD: sdiff=7.49 1526 Saitt = 7-49 seconds, 


The time it took these 30 students to complete the arithmetic quiz with music was 2.8 
seconds longer, on average, than the time it took without the music. 


To get these summary statistics using the TI-83/84, start by typing the difference 


values into L1. Then do 1-Var Stats. 


FOR PRACTICE, TRY EXERCISE 75 


There are two ways that a statistical study involving a single quantitative variable can yield 
paired data: 


.. Researchers can record two values of the variable for each individual. 


’. The researcher can form pairs of similar individuals and record the value of the variable once 
for each individual. 


We have seen one example of each method so far. The observational study of identical twins’ 
IQ scores used Method 2, with the pairs consisting of identical twins who were raised 
separately—one in a high-income household and one in a low-income household. The 
experiment investigating whether music helps or hinders learning used Method 1, with each 
subject taking a 50-question single-digit arithmetic test twice—once with music playing and 
once without—in a random order. We referred to this type of experiment as a matched pairs 
design in Chapter 4. Note that it is also possible to carry out a matched pairs experiment using 
Method 2 if the researcher forms pairs of similar subjects and randomly assigns each treatment 
to exactly one member of every pair. 


I pee 9 20.0 %Au: 
Confidence Intervals for pdiffiszs~ °?°° ~ 7°°*#ir 


When paired data come from a random sample or a randomized experiment, the statistic x diff 


305 305 
———= 0.200 = 20.0% aie: «ppm = 0.200 = 20.0% pp 5:04 
1526 “diff is a point estimate for the true mean difference jdiffi526 "Hedi, 


Before constructing a confidence interval for a mean difference, we must check that the 
conditions for performing inference are met. Aside from the paired data requirement, these 


305 
os ; ——= 0.200 = 20.0% 
conditions are the same as the ones for constructing a confidence interval for 11526 : 


F in Section 8.3. 


CONDITIONS FOR CONSTRUCTING A CONFIDENCE INTERVAL 
ABOUT A MEAN DIFFERENCE 


e Random: Paired data come from a random sample from the population of interest or from 
a randomized experiment. 
a 
© 10%: When sampling without replacement, ndiff<0.1ONdiffi5s26 — Seen See 
naire < 0.10.Nuisr. 


e Normal/Large Sample: The population distribution of differences (or the true 
distribution of differences in response to the treatments) is Normal or the number of 
= 0.200 = 20.0% (naise > 30) 


305 
differences in the sample is large (ndiff=30)1526 
(true) distribution of differences has unknown shape and the number of differences in the 
sample is less than 30, a graph of the sample differences shows no strong skewness or 
outliers. 


. If the population 


The Random condition reminds us that we need paired data to construct a confidence 


interval for dif tiaae ~ 0.200 = 20.0uaite Both the 10% and Normal/Large Sample conditions 
emphasize that the proper method of analyzing paired data is to focus on the differences within 
each pair. When the conditions are met, we can proceed to calculations. 

All of the inference procedures you have learned so far require that individual observations 
be independent. That’s why we check the 10% condition when sampling without replacement. 
For paired data, the two values of the response variable in each pair are generally not 
independent. After all, we are measuring the same quantitative variable twice for one individual 
or for two very similar individuals. Knowing the value of the variable for one of the 
measurements should help us predict the value for the other measurement. What is important is 
that the difference values be independent. 


You learned how to construct a confidence interval for a population mean 


305 
——-= 0.200 = 20.0%), - — 
1526 " in Chapter 8. Assuming that the population standard deviation o 
305 305 
= 0.200 = 20.0%, « ~~ 0.200 = 20.0%, . 
1526 “a is unknown, we can calculate a one-sample t1526 “t interval for 


the mean. The appropriate formula is 
_ Kall 
statistic+(critical value)-(standard deviation of statistic)=x +t*sxn1526 — 
statistic + (critical value) - (standard deviation of statistic) 


0.200 = 20.0% 


8x 


=r it*— 


305 
cs —— 20.0% . 
We find the critical value t*1526 ‘t’ for a given confidence level from a t 


TS enn vee 
distribution with df=n-1is26 °° *8°"df =n —1 using Table B or technology. 


Now we want to estimate the true mean difference pdiffi526 “Haitt based on a 
single sample of differences calculated from paired data. All we have to do is modify the above 
formula to fit the new setting: 


= 0.200 = 


= 0.200 = 20.02 


TT can wre 

statistict(critical value)-(standard deviation of statistic) =x difftt«sdiffndiff 1526 Seen Sey 
305 = Zaire + t* ane 

statistic + (critical value) - (standard deviation of statistic) j-5¢ = 9-200 = 20.0% “ / Ndift 


This can be referred to as a one-sample t interval for a mean difference or as a paired t 
interval for a mean difference. 


ONE-SAMPLE t INTERVAL FOR A MEAN DIFFERENCE (PAIRED t 
INTERVAL FOR A MEAN DIFFERENCE) 


305 

When the conditions are met, a C% confidence interval for pdiff1526 © 
= . diff 

2 HF aman anne al +t 

x diff+tssdiffndiffis2e “~~ ~~" diff 


0.200 = 20.0% aise i 


305 


= 0.200 = 
where t*1526 


20-0" is the critical value with C% of its area between —t* 


305 305 
= 0.200 = 20.0% _4+ = 0.200 = 20.0% 4 ee . ; 
1526 °—t" and t*1526 ‘t” for the t distribution with ndiff-1 
305 
= 0.200 = 20.0% s.<. — 
1526 *naitt — 1 degrees of freedom. 


As with any inference procedure, follow the four-step process. 


EXAMPLE | Which twin is smarter? (>) fi. 
4 STEP 
Confidence interval for a mean difference 


Jodi Cobb/Getty Images 


PROBLEM: The data from the random sample of identical twins are shown again in the 
following table. Construct and interpret a 95% confidence interval for the true mean 
difference in IQ scores among twins raised in high-income and low-income households. 


Pair 1 2 3 4 5 6 7 8 9§ 10 11 12 
Twin A’s IQ (high-income family) 128 104 108 100 116 105 100 100 103 124 114 112 
Twin B’s IQ (low-income family) 120 99 99 94111 97 99 94 104 114 113 100 
Difference (A — B) S70 5) 1 O° OV6 oe So 6 1 10 de 2 
SOLUTION: 


Follow the four-step process! 


305 canes 
STATE: 95% C4 for pdiff 1526 — 0.200 = 20.0enait = the true mean difference (High income — 
Low income) in 1Q scores for pairs of identical twins raised in separate households. 

by fe 


Be sure to indicate the order of subtraction when defining the parameter. Then you 


can mimic the wording in your conclusion. 


a 
PLAN: One-sample t interval for wdiffis26— C200 © SUDO ps set 


© Random? Random sample of 12 pairs of identical twins, one raised in a high-income 
household and the other in a low-income household. / 


© 10%: Assume 12<10% of all pairs of identical twins raised in separate households. 4 


e Normal/Large Sample? The number of differences is small, but the dotplot doesn’t show 


any strong skewness OT outliers. / 


-4 0 2 4 6 g 10 12 
Difference (Twin A — Twin B) in IQ scores 
305, en nem 
DO: x diff=5.833, Sdiff=3.93, ndiff=121526 — 0.200 = 20.003 aire = 5.833, Saree = 3.93, nag = 12 
305 
with 95% confidence and df=12-1=11,t* =2.201.1526 
ay = 12-2 = 31,8 = 2.201; 


The Tinterval function on the Tl-83/84 gives (3.338, 8.329). 


: 83342 2013 te = (2.200 = 20.0 %0-833 + 2.201 — 


= 0.200 = 20.0% 


8 . 
E aig¢ +t se 


305 aud 
x difftt«sdiffndiffiszs 02? = °°” Vl diff 


305 canted 

=5.83342.4971526 0 = 79-9%_ 5833 + 2.497 
ci 

= (3.336, 8.330)1526 

CONCLUDE: We are 95% confident that the interval from 2.230 10 S.230 captures the 

305 
true mean difference (High income—Low income)1526 
(High income — Low income) 


= 0.200 = 20.0%= (3.336, 8.330) 


= 0.200 = 20.0% 


in IQ scores among pairs of identical twins raised in 
separate households. 


FOR PRACTICE, TRY EXERCISE 79 
The 95% confidence interval in the example suggests that IQs are between 3.336 and 8.330 
points higher, on average, for twins raised in high-income households. However, we can’t 


conclude that household income level caused an increase in average IQ score because this was 
an observational study, not an experiment. 


CHECK YOUR UNDERSTANDING 


. 


y The data from the matched pairs experiment investigating whether music helps or 
hinders math performance (page 675) are reproduced here. 


Student 1 2s, A eo OG OPO, Al aie to 14> 15 
Time with music (sec) 83 119 77 75 64106 70 69 60 76 47 97 68 77 48 


Time without music 70 106 71 67 59112 83 69 65 83 38 90 76 68 50 
(sec) 


Difference (Music — 13 16.6178 ) S=6 =1o OY HS 07 9 5O. ih 8 =8— Oe 2 
Without music) 


Student 1o7 4771819 —20 28 22 23 24 525.260 27 228: 29" 730 
Time with music (sec) 78 113 71 77 37 50 58 52 47 71146 44 53 57 39 


Time without music 73, 93 59° 70 39 52 60 54 51 60 141 40 56 53 37 
(sec) 


Difference S20 A237 -=2) 2 = 2 2 4 CS 4 RR a ee 
(Music—Without 
music) 


1. Construct and interpret a 90% confidence interval for the true mean difference. 


2. What does the interval in Question 1 suggest about whether music helps or hinders math 
performance? Explain your answer. 


ae 
Significance Tests for pdiffiszs~ °° = 2°.°*4Hi 


When paired data come from a random sample or a randomized pene we may want to 


perform a significance test about the true mean difference ucitfiaae 0.200 = 20.0% wait The null 
hypothesis has the general form 


305, pein i 
HO: udiff=hypothesized value 1526 ~ 0.200 = 20.0°Ho : Maite = hypothesized value 


We’ll focus on situations where the hypothesized difference is 0. Then the null hypothesis says 
that the true mean difference is 0: 


305 
+ ee _qwoe= 0.200 = 20.0%, = ppace = 
HO: pdiff=01526 0.200 = 20.07 : pratt = 0 


The alternative hypothesis says what kind of difference we expect. 
305 
The conditions for performing a significance test about pdiffi526 — 
same as the ones for constructing a confidence interval for a mean difference. 


| CONDITIONS FOR PERFORMING A SIGNIFICANCE TEST ABOUT A | 


on) — 9 Pee 
0.200 = 20.0% 1 4:4 are the 


MEAN DIFFERENCE 


e Random: Paired data come from a random sample from the population of interest or from 
a randomized experiment. 


305 _ 0.200 = 20.0% 
© 10%: When sampling without replacement, ndiff<0.10Ndiffis26 ~~ ee 


naire < 0. 10.Nair . 


Normal/Large Sample: The population distribution of differences (or the true 
distribution of differences in response to the treatments) is Normal or the number of 


> — 9.200 = 20.0% (naire > 30) 


differences in the sample is large (ndiff=30)1526 . If the population 


(true) distribution of differences has unknown shape and the number of differences in the 
sample is less than 30, a graph of the sample differences shows no strong skewness or 
outliers. 


When conditions are met, we can carry out a one-sample t test for a mean difference (also 
known as a paired t test for a mean difference). The standardized test statistic is 
5 ae" 
Sdiff 


305 sees 
t=x diff-Osdiffndiffisze "2° ~ 7°°” diff 


We can use Table B or technology to find the P-value from the ¢ distribution with df=ndiff-1 
305 


ONE-SAMPLE t TEST FOR A MEAN DIFFERENCE (PAIRED t TEST FOR 
A MEAN DIFFERENCE) 


305 sei 
Suppose the conditions are met. To test the hypothesis HO: pdiff=01526 ~ See Say 
Ho : paitt = 0, compute the standardized test statistic 
i 
Sdiff 


305 . scaaa 
t=x diff-Osdiffndiffisze "2° ~ 7°°” diff 


Find the P-value by calculating the probability of getting a t statistic this large or larger in 


305 
the direction specified by the alternative hypothesis Ha1526 


305 
a pq seme = 0.200 = 20.0%) 
distribution with ndiff-1is2e "7° ~ 7°-°*naite — 1 degrees of freedom. 


= 0.200 = 20.0% F7, _Use the t 


As with any inference procedure, be sure to follow the four-step process when performing a 
test about a mean difference. 


EXAMPLE |Is caffeine dependence real? (>) 
Significance test about a mean difference 


MrPants/Getty Images 


PROBLEM: Researchers designed an experiment to study the effects of caffeine 
withdrawal. They recruited 11 volunteers who were diagnosed as being caffeine dependent 
to serve as subjects. Each subject was barred from coffee, colas, and other substances with 
caffeine for the duration of the experiment. During one 2-day period, subjects took capsules 
containing their normal caffeine intake. During another 2-day period, they took placebo 
capsules. The order in which subjects took caffeine and the placebo was randomized. At the 
end of each 2-day period, a test for depression was given to all 11 subjects. Researchers 
wanted to know whether being deprived of caffeine would lead to an increase in 


depression.24 


The table displays data on the subjects’ depression test scores. Higher scores show more 
symptoms of depression. 


Subject dre 48) 6A 5 On Fel S18 oe Ue 10041 

Depression (caffeine) Be POSS 6 oS Or eo Ounre Os aie ee ene 

Depression (placebo) 16923 57 14 240 6 3 15 12 0 
305 


Do the data provide convincing evidence at the a=0.051526 — See Sa ey = 0,05 
significance level that caffeine withdrawal increases depression score, on average, for 
subjects like the ones in this experiment? 


SOLUTION: 


Follow the four-step process! 


305 


STATE: Ho: pdiff=01526 0.200 = 20.0%Hy : pai = 0 


Start by calculating the difference in depression test scores for each subject. We 


chose the order placebo — caffeine for subtraction so that most values would be 
positive. 


es 
Ha: pidiff+0is26 07° ~ 7°.°°Ha = batt + 0 


305 ff 
where pdiffi526 0.200 = 20.0<uaitt = the true mean difference (Placebo - Caffeine) in 


depression test score for subjects like these. Because no significance level is given, well use 
305 OSES 
= 0.01506 0.200 = 20.0%, 0.05. 
305 


0.200 = 20.0% pp 6c 
PLAN: Paired t test for pdiffis26 Lait 


e Random: Researchers randomly assigned the treatments — placebo then caffeine, 
caffeine then placebo—to the subjects. / 


® Normal/Large Sample: The sample size is small, but the histogram of differences doesn’t 


show any outliers OY strong skewness. / 


Note that we do not have to check the 10% condition because we are not sampling 


without replacement from a finite population. 


Change in depression 
(Placebo - Caffeine) 


Because there is no strong skewness or outliers, it is plausible that the true 


distribution of difference (Placebo — Caffeine) in depression test scores for subjects 
like these is Normal. 


305 305 
neatemnmnen ) — 9 a a val * Bevuderll 2) = D 1 % co — 
DO: x dt =7.3641526 0.200 = 20.0! fare = 7.364 sdiff= 6.9181526 0.200 0.0 Saift 6.918 


oo 
Brostoll 9 50.0 % <= 
ndiff = 111526 0.200 = 20.0% po. = 11 


0.200 = 20.0% 


305 
The sample result gives some evidence in favor of Ha:udiff>01526 


35 =.) | . 
Hq : Hatt > 9 because 7.364>01526 7" = 7°-°"7.364 > 0. 


305 6.918 
—— = 0.200 = 20.0% an 
®@ t=7.364-06.91811 =3.531526 vil 


305 ; ; 
(=x “diff-osdiffndiffis26 = 0.200 = 20.0% 


305 
—— = ().200 = 20.0%, cot can 
e P-value df =11-1=101526 df = 11-—1= 10 


t distribution 
with df = 10 


).200 = 20.0 voy 


305 
The TTest function on the TI-83/84 gives t=3.53i526 = 3.53 and P-value 


305 
Sesuders » — 9) % 
=0.00271526 > 2-700 = 29-9 9.0027 


Using Table B: P-value is between 0.0025 and 0.005. 


rr 

Using Technology: tedf (lower: 3.53, upper: 1000, df: 10)=0.002 71526 — SEEY = SU — 0027 
_, rs 

CONCLUDE: Because the P-value of 0.0027<a = 0.051526 = Mare = 299.0027 <a = 0.05) 


305 sane 
we reject Hoi526 OAH 0, We have convincing evidence that caffeine withdrawal 
Increases depression test score, on average, for subjects like the ones in this experiment. 


= 0.200 = 
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Why did the researchers randomly assign the order in which subjects received placebo and 
caffeine in the example? Researchers want to be able to conclude that any statistically 
significant change in depression score is due to the treatments themselves and not to some other 
variable. One obvious concern is the order of the treatments. Suppose that caffeine were given 
to all the subjects during the first 2-day period. What if the weather were nicer on these 2 days 
than during the second 2-day period when all subjects were given a placebo? Researchers 


wouldn’t be able to tell if a large increase in the mean depression score is due to the difference 
in weather or due to the treatments. Random assignment of the caffeine and placebo to the two 
time periods in the experiment should help ensure that no other variable (like the weather) is 
systematically affecting subjects’ responses. 


Because researchers randomly assigned the treatments, they can make an inference about 
cause and effect. The data from this experiment provide convincing evidence that depriving 
caffeine-dependent subjects like these of caffeine causes an increase in depression scores, on 


average. 
Bll 
The significance test in the example led to a simple decision: reject H0O1526 


We know from past experience that a confidence interval gives more information than a test— 
it provides the entire set of plausible values for the parameter based on the data. A 90% 
0.200 = 20.0% 


0.200 = 20.0% Hy 


305 
confidence interval for pdiff1526 diff js 


Notice that we used a 90% confidence level here, which corresponds to the one-sided 


significance test with a = 0.05. 


305 a 
x difftt«sdiffndiff=7.364+1.8126.91811=7.364+3.780=(3.584,11.144)is26 | 0790 = °°." 


Sdi 6.918 
ee ne 1.812 —— = 7.364 + 3.780 = (3.584,11.144) 
Vaift Vil 


We are 90% confident that the interval from 3.584 to 11.144 captures the true mean difference 
(Placebo — Caffeine) in depression test score for caffeine-dependent individuals like the ones in 
this study. The interval suggests that caffeine deprivation results in an average increase in 
depression test score of between 3.584 and 11.144 points for subjects like these. 


CHECK YOUR UNDERSTANDING 


i. 


y Consumers Union designed an experiment to test whether nitrogen-filled tires would 
maintain pressure better than air-filled tires. They obtained two tires from each of several 
brands and then randomly assigned one tire in each pair to be filled with air and the other to 
be filled with nitrogen. All tires were inflated to the same pressure and placed outside for a 
year. At the end of the year, Consumers Union measured the pressure in each tire. The 
pressure loss (in pounds per square inch) during the year for the tires of each brand is shown 
in the table.22 


Courtesy of Judy Starnes 


Brand Air Nitrogen (|Brand Air Nitrogen 
BF Goodrich Traction T/A 7.6 ae Pirelli P6 Four Seasons 4.4 4.2 
HR 
Bridgestone HP50 (Sears) 3.8 Ze Sumitomo HTR H4 1.4 2A 
Bridgestone Potenza GO09 3.7 1.6 Yokohama Avid H4S 4.3 3.0 
Bridgestone Potenza RE950 4.7 gE BF Goodrich Traction T/A V 5.5 3.4 
Bridgestone Potenza EL400 2.1 1.0 Bridgestone Potenza RE950 4.1 2.8 
Continental Premier Contact 4.9 cs Continental ContiExtreme 5.0 3.4 
H Contact 
Cooper Lifeliner Touring 5.2 3:5 Continental ContiProContact 4.8 So 
SLE 
Dayton Daytona HR 3.4 3.2 Cooper Lifeliner Touring oz 2.5 
SLE 
Falken Ziex ZE-512 4.1 ree) General Exclaim UHP 6.8 aT 
Fuzion Hrl Zon Ze Hankook Ventus V4 H105 SL 1.4 
General Exclaim Sut 3.4 Michelin Energy MXV4 Plus 25 5 
Goodyear Assurance 3.8 C2 Michelin Pilot Exalto A/S 6.6 22 
Tripletred 
Hankook Optimo H418 3.0 0.9 Michelin Pilot HX MXM4 2.2 2.0 
Kumho Solus KH16 6.2 3.4 Pirelli P6 Four Seasons 25 27 
Michelin Energy MXV4 2.0 1.8 Sumitomo HTR+ 4.4 Sal 
Michelin Pilot XGT H4 1.1 0.7 


Paired Data or Two Samples? 


In Section 10.2, we used two-sample t procedures to compare the mean change in blood 
pressure for healthy men who take calcium and for healthy men who take a placebo. These 
methods require data that come from independent random samples from the two populations of 
interest or from two groups in a randomized experiment (as was the case in the preceding 
example). When the conditions are met, we can perform inference about the difference 1-2 


0.200 = 20.0%), — to - 
1 — #2 in the true means. 


1526 


In this section, we used paired t procedures to compare the mean change in depression 


scores for caffeine-dependent individuals when taking caffeine versus when taking a placebo. 
These methods require paired data that come from a random sample from the population of 
interest or from a randomized experiment (as was the case in this example because the same 11 


subjects received both treatments). When the conditions are met, we can perform inference 
305 


= 0.200 = 20.0% 
about the true mean difference pdiffi526 Hl 


diff, Gj The proper inference method 


depends on how the data were produced. 


EXAMPLE | Are you all wet? (>) 
Two samples or paired data? 


PROBLEM: In each of the following settings, decide whether you should use two-sample t 
procedures to perform inference about a difference in means or paired t procedures to 
perform inference about a mean difference. Explain your choice. 


a. Before exiting the water, scuba divers remove their fins. A maker of scuba equipment 
advertises a new style of fins that is supposed to be faster to remove. A consumer 
advocacy group suspects that the time to remove the new fins may be no different than the 
time required to remove old fins, on average. Twenty experienced scuba divers are 
recruited to test the new fins. Each diver flips a coin to determine if they wear the new fin 
on the left foot and the old fin on the right foot, or vice versa. The time to remove each 
type of fin is recorded for every diver. 


b. To study the health of aquatic life, scientists gathered a random sample of 60 White 
Piranha fish from a tributary of the Amazon River during one year. The average length of 
these fish was compared to a random sample of 82 White Piranha from the same tributary 
a decade ago. 


c. Can a wetsuit deter shark attacks? A researcher has designed a new wetsuit with color 
variations that are suspected to deter shark attacks. To test this idea, she fills two identical 
drums with bait and covers one in the standard black neoprene wetsuit and the other in the 
new suit. Over a period of one week, she selects 16 two-hour time periods and randomly 
assigns 8 of them to the drum in the black wetsuit. The other 8 are assigned to the drum 
with the new suit. During each time period, the appropriate drum is submerged in waters 
that sharks frequent and the number of times a shark bites the drum is recorded. 


SOLUTION: 


a. Paired t procedures. The data come from two measurements of the same variable (time 
to remove fin) for each diver. 


b. Tr wo-sample € procedures. The data come from independent random samples of White 
Piranha in two different years. 


If the sample sizes are different, it can’t be paired data. 


c. 1) wo-sample if procedures. The data come from two groups in a randomized experiment, 
with each group consisting of 8 time periods in which a drum with a specific wetsuit 
(standard or new) was randomly assigned to be submerged. 


FOR PRACTICE, TRY EXERCISE 91 


When designing an experiment to compare two means, a completely randomized design 
may not be the best option. A matched pairs design might be a better choice, as the following 
activity shows. 


ACTIVITY Get your heart beating! 


Are standing pulse rates higher, on average, than sitting pulse rates? In this activity, you will 
perform two experiments to try to answer this question. 


Experiment #1: Completely randomized design 


1. Your teacher will randomly assign half of the students in your class to stand and the other 
half to sit. Once the two treatment groups have been formed, students should stand or sit 
as required. Then they should measure their pulses for 1 minute. Have the subjects in each 
group record their data on the board. 


2. Analyze the data for the completely randomized design. Make parallel dotplots and 
calculate the mean pulse rate for each group. Is there some evidence that standing pulse 
rates are higher, on average? Explain your answer. 


Experiment #2: Matched pairs design 


3. To produce paired data in this setting, each student should receive both treatments in a 
random order. Because you already sat or stood in Step 1, do the opposite now. As before, 
everyone should measure his or her pulse for 1 minute after the treatment is imposed (i.e., 
once everyone is standing or sitting). Then each subject should calculate his or her 
difference (Standing — Sitting) in pulse rate and record this value on the board. 


4. Analyze the data for the matched pairs design. Make a dotplot of these differences and 
calculate their mean. Is there some evidence that standing pulse rates are higher, on 
average? Explain your answer. 


5. Which design provides more convincing evidence that standing pulse rates are higher, on 
average, than sitting pulse rates? Justify your answer. 


A statistics class with 24 students performed the “Get your heart beating” activity. Figure 
10.9 shows a dotplot of the pulse rates for their completely randomized design. The mean pulse 


305 
rate for the standing group is x 1=74.835;1526 — 
= 0.200 = 20.0% %. 


0.200 = 20.0) = 74.83; the mean for the sitting 


305 
group is x 2=68.331526 = 68.33 So the average pulse rate is 6.5 beats per 
minute higher in the standing group. However, the variability in pulse rates for the two groups 


305 ; . or 
creates a lot of overlap in the dotplots. A two-sample t test of HO: p1—p2=01526 ~ Siaye se Sean 


a 7 
Ho : #1 — #2 = 0 versus Ha: p1—2>01526 70° = 7-9" Ha oi — M2 > O yields t=1.42 
305 
1526 1.42 and a P-value of 0.09. These data do not provide convincing evidence 


that standing pulse rates are higher, on average, than sitting pulse rates for people like the 
students in this class. 
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FIGURE 10.9 Parallel dotplots of the pulse rates for the standing and sitting groups ina 
statistics class’s completely randomized design. 


What about the class’s matched pairs design? Figure 10.10 shows a dotplot of the 
difference (Standing — Sitting) in pulse rate for each of the 24 students. We can see that 21 of 


the 24 students recorded a positive difference, indicating that their standing pulse rate was 
305 


higher. The mean difference is x diff=6.831526 


305 

-c¢_p——= 0.200 = 20.0% FH], - 

one-sample t test of HO: pdiff=01526 *H : 
305 305 

= 0.200 = 20.0% = pace > ——= 0.200 = 20.0% 

1526 Ha: paitt > 9 gives t=6.4831526 °°" ~ °°-°*t = 6.483 and a P-value of 


approximately 0. These data provide very convincing evidence that standing pulse rates are 
higher, on average, than sitting pulse rates for people like the students in this class. 
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Difference (Standing — Sitting) in pulse rate 
FIGURE 10.10 Dotplot of the difference (Standing — Sitting) in pulse rate for each student in a 
statistics class’s matched pairs design. 


Let’s take one more look at the two figures. Notice that we used the same scale for both 
graphs. The matched pairs design greatly reduced the variability in the response variable by 
accounting for a big source of variability—the differences between individual students. That 
made it easier to detect the fact that standing causes an increase in the average pulse rate. In 
other words, using a paired design resulted in more power. With the large amount of variability 
in the completely randomized design, we could not draw such a conclusion. 


Section 10.3 Summary 


e Paired data result from recording two values of the same quantitative variable for each 
individual or for each pair of similar individuals. 


e To analyze paired data, start by computing the difference for each pair. Then make a graph 


305, ne 
of the differences. Use the mean difference x diffis2e 7°” ~ *°°“@aitr and the standard 


).200 = 
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ae ; = ( 20.0% ¢ ,. = 
deviation of the differences sdiff1526 Sdiff as Summary Statistics. 


0.200 = 20.0/ 


305 a 
e Before estimating or testing a claim about pidiff1526 ~ "diff, check that these 


conditions are met: 


« Random: Paired data come from a random sample from the population of interest or from 
a randomized experiment. 
a 
2 10%: When sampling without replacement, ndiff<0.10Ndiffis26 — See = Sees 
naise < 0.10Naisr. 


« Normal/Large Sample: The population distribution of differences (or the true distribution 
of differences in response to the treatments) is Normal or the number of differences in the 


TB ean en ney 
sample is large (ndiff=30)1526 ~ 0.200 = 20.0%(naitr > 30) Tf the population (true) distribution 


of differences has unknown shape and the number of differences in the sample is less than 
30, a graph of the sample differences shows no strong skewness or outliers. 


e When the conditions are met, a C% confidence interval for the true mean difference diff 
305 _ 0.200 - 20.0% 4 


1526 diff js 
8 ° 
= — Sift 
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x diff+t«sdiffndiffis26 “~~ ~~~ J diff 
305 _ 6.200 = 20.0% - ie ee ; ; 
where t*1526 t* is the critical value for a t distribution with df=ndiff—1 


305 305 
= 0.200 = 20.0% af — m.. ; 
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t* and t* 


. This is called a one-sample t interval for a mean difference or a paired 


t interval for a mean difference. 
ks 
e A significance test of HO: pdiff=01526 Haitt = 9 is called a one-sample t 


test for a mean difference or a paired t test for a mean difference. The standardized test 
Statistic is 


1.200 = 20.0% A, : 


, “diff? 
S diff 
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t=x diff—Osdiffndiffises = °20°=209* J" diff 


When the Normal/Large Sample condition is met, find the P-value using the t distribution 
ee 

with degrees of freedom equal to ndiff—11526 ~ See age — 

e The proper inference method depends on how the data were produced. For paired data, use 


).200 = 20.0 Tous 


5 
one-sample t procedures for pdiffi526 diff, For quantitative data that come from 


independent random samples from two populations of interest or from two groups in a 


05 
; ——= 0.200 = 20.0%), — yy. 
randomized experiment, use two-sample t procedures for 1-12 1526 Hi ~ #2, 


Section 10.3 Exercises 


75. PQ 674 |») Groovy tires Researchers were interested in comparing two methods for 


estimating tire wear. The first method used the amount of weight lost by a tire. The second 
method used the amount of wear in the grooves of the tire. A random sample of 16 tires 
was obtained. Both methods were used to estimate the total distance traveled by each tire. 
The table provides the two estimates (in thousands of miles) for each tire 26 


Tire Weight Groove Diff. Tire Weight Groove Diff. 
aT 45.9 35.7 10.2 9 30.4 22-2 7.3 
2 41.9 39.2 2.1 10 27.3 23.1 3.6 
3 37.0 gi.1 6.4 11 20.4 20.9 =0.5 
4 33.4 28.1 5.2 IZ 24.5 16.1 8.4 
5 31.0 24.0 7.0 13 20.9 19.9 1.0 
6 30.5 28.7 1.8 14 18.9 15.2 S21 
7 30.9 25.0 5.0 15 13.7 11.5 22 
8 31.9 23.3 8.6 16 11.4 11.2 0.2 


a. Make a dotplot of the difference (Weight — Groove) in the estimate of wear for each tire 
using the two methods. 


b. Describe what the graph reveals about whether the two methods give similar estimates 
of tire wear, on average. 


c. Calculate the mean difference and the standard deviation of the differences. Interpret 
the mean difference. 


76. Well water Trace metals found in wells affect the taste of drinking water, and high 


concentrations can pose a health risk. Researchers measured the concentration of zinc (in 
milligrams/liter) near the top and the bottom of 10 randomly selected wells in a large 
region. The data are provided in the following table.2Z 


Well 1 2 3 4 5 6 7 8 9 10 
Bottom 0.430 0.266 0.567 0.531 0.707 0.716 0.651 0.589 0.469 0.723 
Top 0.415 0.238 0.390 0.410 0.605 0.609 0.632 0.523 0.411 0.612 


Difference 0.015 0.028 0.177 0.121 0.102 0.107 0.019 0.066 0.058 0.111 


a. Make a dotplot of the difference (Bottom — Top) in the zinc concentrations for these 10 
wells. 


b. Describe what the graph reveals about whether the amount of zinc at the top and bottom 
of the wells is the same, on average. 


c. Calculate the mean difference and the standard deviation of the differences. Interpret 
the mean difference. 


77. Flight times Emirates Airline offers one outbound flight from Dubai, United Arab 
Emirates, to Doha, Qatar, and one return flight from Doha to Dubai each day. An 
experienced Emirates pilot suspects that the Dubai-to-Doha outbound flight typically takes 
longer. To find out, the pilot collects data about these flights on a random sample of 12 
days. The table displays the flight times in minutes.28 


Day 1 2 3 4 5 6 7 8 9 10 11 12 

Outbound (Dubai 75 42 62 63 54 46 52 50 42 46 43 52 
305 eireris 

_ Teng = 0-200 = 20.0% _, Doha) 

Return (Doha 42 37 37 44 42 40 41 44 41 42 48 48 
305 _ 0.200 = 20.0% _. 

Dubai) 


a. Explain why these are paired data. 


b. A dotplot of the difference (Outbound — Return) in flight time for each day is shown. 
Describe what the graph reveals about whether the outbound or return flight takes 
longer, on average. 


—5 0 5 10 15 20 2 30 35 


Difference (Outbound — Return) 
in flight time (min) 


c. Calculate the mean difference and the standard deviation of the differences. Interpret 
the standard deviation. 


78. Literacy Do males have higher literacy rates than females, on average, in Islamic 
countries? The following table shows the percent of men and women who were literate in 
24 largely Muslim nations at the time of this writing.22 


Country Male literacy (%) Female literacy (%) 
Afghanistan 43.1 12.6 
Algeria 86.0 86.0 
Azerbaijan 99.9 99.7 
Bangladesh 62.0 53.4 
Egypt 82.2 65.4 
Indonesia 97.0 89.6 
Iran 91.2 82.5 


lraq 89.0 73.6 


ri: B 


80. 


81. 


Jordan 96.6 90.2 


Kazakhstan 99.8 99.3 
Kyrgyzstan 99.3 98.1 
Lebanon 93.4 86.0 
Libya 98.6 90.7 
Malaysia 95.4 90.7 
Morocco 76.1 57.6 
Pakistan 67.0 42.0 
Saudi Arabia 90.4 81.3 
Syria 86.0 73.6 
Tajikistan 99.8 99.6 
Tunisia 95.1 80.3 
Turkey 99.3 98.2 
Turkmenistan 99.3 98.3 
Uzbekistan 99.6 99.0 
Yemen 81.2 46.8 


a. Explain why these are paired data. 


b. A dotplot of the difference (Male — Female) in literacy rate for each country is shown. 
Describe what the graph reveals about whether males have higher literacy rates than 
females in these countries, on average. 
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Difference (Male — Female) in literacy rate 


c. Calculate the mean difference and the standard deviation of the differences. Interpret 
the standard deviation. 


pq 677 |») Groovy tires Refer to Exercise 75. Construct and interpret a 95% 


confidence interval for the true mean difference (Weight — Groove) in the estimates of tire 
wear using these two methods in the population of tires. 


Well water Refer to Exercise 76. Construct and interpret a 95% confidence interval for 
the true mean difference (Bottom — Top) in the zinc concentrations of the wells in this 
region. 


Does playing the piano make you smarter? Do piano lessons improve the spatial- 
temporal reasoning of preschool children? A study designed to investigate this question 
measured the spatial-temporal reasoning of a random sample of 34 preschool children 
before and after 6 months of piano lessons. The difference (After — Before) in the 


reasoning scores for each student has mean 3.618 and standard deviation 3.055.42 


a. Construct and interpret a 90% confidence interval for the true mean difference. 


b. Based on your interval from part (a), can you conclude that taking 6 months of piano 


82. 


83. 


84. 


85. 


lessons would cause an increase in preschool students’ average reasoning scores? Why 
or why not? 


No annual fee? A bank wonders if omitting the annual credit card fee for customers who 
charge at least $2400 in a year will increase the amount charged on its credit cards. The 
bank makes this offer to an SRS of 200 of its credit card customers. It then compares how 
much these customers charge this year with the amount that they charged last year. The 
mean increase in the sample is $332, and the standard deviation is $108. 


a. Construct and interpret a 99% confidence interval for the true mean increase. 


b. Based on the interval from (a), can you conclude that dropping the annual fee would 
cause an increase in the average amount spent by this bank’s credit card customers? 
Why or why not? 


Flight times Refer to Exercise 77. Explain why it is not appropriate to perform a paired t 
test about the true mean difference (Outbound — Return) in flight times between Dubai 
and Doha. Note: See Exercise 94 for an alternative approach to perform a significance test 
in this setting. 


Literacy Refer to Exercise 78. Explain why it is not appropriate to perform a paired t test 


id ‘ 20.0%(Malo _ : . 
about the true mean difference (Male - Female)is26_ °°"? ~ °° ‘(Male — Female) j, 


literacy rates in Islamic countries. 


pg 680 |») Drive-thru or go inside? Many people think it’s faster to order at the drive- 


thru than to order inside at fast-food restaurants. To find out, Patrick and William used a 
random number generator to select 10 times over a 2-week period to visit a local Dunkin’ 
Donuts restaurant. At each of these times, one boy ordered an iced coffee at the drive-thru 
and the other ordered an iced coffee at the counter inside. A coin flip determined who 
went inside and who went to the drive-thru. The table shows the times, in seconds, that it 
took for each boy to receive his iced coffee after he placed the order.44 


Visit Inside time Drive-thru time 
1 62 55 
2 63 50 
S 320 321 
4 105 110 
5 135 124 
6 55 54 
7 92 90 
8 75 69 
9 203 200 

10 100 103 


S05 wenn mer 
Do these data provide convincing evidence at the a=0.051526 0.200 = 20-9 = 0.05 level of 


a difference in the true mean service time inside and at the drive-thru for this Dunkin’ 


86. 


87. 


Donuts restaurant? 


Better barley Does drying barley seeds in a kiln increase the yield of barley? A famous 
experiment by William S. Gosset (who discovered the t distributions) investigated this 
question. Eleven pairs of adjacent plots were marked out in a large field. For each pair, 
regular barley seeds were planted in one plot and kiln-dried seeds were planted in the 
other. A coin flip was used to determine which plot in each pair got the regular barley seed 
and which got the kiln-dried seed. The following table displays the data on barley yield 
(pound per acre) for each plot.42 


Plot Regular Kiln 
1 1903 2009 
2 1935 1915 
3 1910 2011 
4 2496 2463 
5 2108 2180 
6 1961 1925 
7 2060 2122 
8 1444 1482 
9 1612 1542 

10 1316 1443 

11 1511 1535 

305 " 
Do these data provide convincing evidence at the a=0.05i526 "°° 20.0% a = 0.05 level 


that drying barley seeds in a kiln increases the yield of barley, on average? 


Music and memory Does listening to music while studying help or hinder students’ 
learning? Two statistics students designed an experiment to find out. They selected a 
random sample of 30 students from their medium-sized high school to participate. Each 
subject was given 10 minutes to memorize two different lists of 20 words, once while 
listening to music and once in silence. The order of the two word lists was determined at 
random; so was the order of the treatments. The difference (Silence — Music) 


305 wits, 
1596 ~ 0-200 = 20.0%(Silence — Music) 


in the number of words recalled was recorded for each 
subject. The mean difference was 1.57 and the standard deviation of the differences was 
2,70; 


a. If the result of this study is statistically significant, can you conclude that the difference 
in the ability to memorize words was caused by whether students were performing the 
task in silence or with music playing? Why or why not? 


305 
; fe ——= 0.200 = 20.0% 
b. Do the data provide convincing evidence at the a=0.01 1526 a= 0.01 


significance level that the number of words recalled in silence or when listening to 
music differs, on average, for students at this school? 


c. Based on your conclusion in part (a), which type of error—a Type I error or a Type II 
error—could you have made? Explain your answer. 


88. Friday the 13th Do people behave differently on Friday the 13th? Researchers collected 


89. 


90. 


91. 


data on the number of shoppers at a random sample of 45 grocery stores on Friday the 6th 
and Friday the 13th in the same month. Then they calculated the difference (subtracting in 
the order 6th minus 13th) in the number of shoppers at each store on these 2 days. The 

305 


mean difference is —46.51526 "2"? = 2°°—46.5 and the standard deviation of the 


differences is 178.0.42 


a. If the result of this study is statistically significant, can you conclude that the difference 
in shopping behavior is due to the effect of Friday the 13th on people’s behavior? Why 
or why not? 


305 
; a ——= 0.200 = 20.0% 
b. Do these data provide convincing evidence at the a=0.051526 a = 0.05 level 


that the number of shoppers at grocery stores on these 2 days differs, on average? 


c. Based on your conclusion in part (a), which type of error—a Type I error or a Type II 
error—could you have made? Explain your answer. 


Music and memory Refer to Exercise 87. 
a. Construct and interpret a 99% confidence interval for the true mean difference. If you 


already defined the parameter and checked conditions in Exercise 87, you don’t need to 
do them again here. 


b. Explain how the confidence interval provides more information than the test in 


Exercise 87. 
Friday the 13th Refer to Exercise 88. 
a. Construct and interpret a 90% confidence interval for the true mean difference. If you 


already defined parameters and checked conditions in Exercise 88, you don’t need to do 
them again here. 


b. Explain how the confidence interval provides more information than the test in 


Exercise 88. 


pg 683 |») Two samples or paired data? In each of the following settings, decide 


whether you should use two-sample t procedures to perform inference about a difference 
in means or paired t procedures to perform inference about a mean difference. Explain 


your choice. 


a. To test the wear characteristics of two tire brands, A and B, each of 50 cars of the same 
make and model is randomly assigned Brand A tires or Brand B tires. 


b. To test the effect of background music on productivity, factory workers are observed. 


For one month, each subject works without music. For another month, the subject 


works while listening to music on an MP3 player. The month in which each subject 
listens to music is determined by a coin toss. 


c. How do young adults look back on adolescent romance? Investigators interviewed a 
random sample of 40 couples in their mid-twenties. The female and male partners were 
interviewed separately. Each was asked about his or her current relationship and also 
about a romantic relationship that lasted at least 2 months when they were aged 15 or 
16. One response variable was a measure on a numerical scale of how much the 
attractiveness of the adolescent partner mattered. You want to find out how much men 
and women differ on this measure. 


92. Two samples or paired data? In each of the following settings, decide whether you 
should use two-sample t procedures to perform inference about a difference in means or 
paired t procedures to perform inference about a mean difference. Explain your choice.*2 


a. To compare the average weight gain of pigs fed two different diets, nine pairs of pigs 
were used. The pigs in each pair were littermates. A coin toss was used to decide which 
pig in each pair got Diet A and which got Diet B. 


b. Separate random samples of male and female college professors are taken. We wish to 
compare the average salaries of male and female teachers. 


c. To test the effects of a new fertilizer, 100 plots are treated with the new fertilizer, and 
100 plots are treated with another fertilizer. A computer’s random number generator is 
used to determine which plots get which fertilizer. 


93. Have a ball! Can students throw a baseball farther than a softball? To find out, 
researchers conducted a study involving 24 randomly selected students from a large high 
school. After warming up, each student threw a baseball as far as he or she could and 
threw a softball as far as he she could, in a random order. The distance in yards for each 
throw was recorded. Here are the data, along with the difference (Baseball — Softball) in 
distance thrown, for each student: 


Student Baseball Softball Difference (Baseball — Softball) 
1 65 ow 8 
2 90 58 32 
3 75 66 9 
4 {3 61 12 
5 79 65 14 
6 68 56 12 
ri 58 53 5 
8 41 41 0 
9 56 44 12 

10 70 65 
11 64 Bi 
12 62 60 


13 73 55 18 
14 50 53 = 
15 63 34 9 
16 48 42 6 
i? 34 a2 2 
18 49 48 A 
19 48 45 3 
20 68 67 1 
21. 30 27 3 
22 26 25 1 
23 28 25 3 
24 26 aL =5 


a. Explain why these are paired data. 


b. A boxplot of the differences is shown. Explain how the graph gives some evidence that 
students like these can throw a baseball farther than a softball. 
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Difference (Baseball — Softball) 
in distance thrown (yd) 


c. State appropriate hypotheses for performing a test about the true mean difference. Be 
sure to define any parameter(s) you use. 


d. Explain why the Normal/Large Sample condition is not met in this case. 
IO re eve 
The mean difference (Baseball-Softball)is20 °°? ~ 2°-°*(Baseball — Softball) jy 
SOB cman meee. 

distance thrown for these 24 students is x diff=6.541526 °°? ~ °°°"*@airr = 6.54 yards. 
Is this a surprisingly large result if the null hypothesis is true? To find out, we can 
perform a simulation assuming that students have the same ability to throw a baseball 
and a softball. For each student, write the two distances thrown on different note cards. 
Shuffle the two cards and designate one distance to baseball and one distance to 


305, seein 
softball. Then subtract the two distances (Baseball—Softball) 1526 — ite aes 


(Baseball — Softball) Do this for all the students and find the simulated mean 
difference. Repeat many times. Here are the results of 100 trials of this simulation: 


-5 0 5 


Simulated mean difference (Baseball — Softball) 
in distance thrown (yd) 


e. Use the results of the simulation to estimate the P-value. What conclusion would you 
draw? 


94. Flight times Refer to Exercise 77. The pilot suspects that the Dubai-to-Doha outbound 


BOB etn ees 
flight typically takes longer. If so, the difference (Outbound—Return) 1526 — al 


in flight times will be positive on more days than it is negative. What if either flight is 
equally likely to take longer? Then we can model the outcome on a randomly selected day 
with a coin toss. Heads means the Dubai-to-Doha outbound flight lasts longer; tails means 
the Doha-to-Dubai return flight lasts longer. To imitate a random sample of 12 days, 
imagine tossing a fair coin 12 times. 


a. Find the probability of getting 11 or more heads in 12 tosses of a fair coin. 


b. The outbound flight took longer on 11 of the 12 days. Based on your result in part (a), 
what conclusion would you make about the Emirates pilot’s suspicion? 


Multiple Choice Select the best answer for Exercises 95-98. 


95. There are two common methods for measuring the concentration of a pollutant in fish 


96. 


tissue. Do the two methods differ, on average? You apply both methods to each fish in a 
random sample of 18 carp and use 


305 
i «ogee = 0.200 = 20.0% p) ac 6¢ 
a. the paired t test for pdiffis26 Lait. 


b. the one-sample z test for p. 


305 


c. the two-sample t test for p1—p2i526— 9700 = 2-01 — pe, 


305 


d. the two-sample z test for pl1—p2i526. °°" Map — Be. 


e. none of these. 


A study of the impact of caffeine consumption on reaction time was designed to correct 
for the impact of subjects’ prior sleep deprivation by dividing the 24 subjects into 12 pairs 
on the basis of the average hours of sleep they had had for the previous 5 nights. That is, 
the two with the highest average sleep were a pair, then the two with the next highest 
average sleep, and so on. One randomly assigned member of each pair drank 2 cups of 
caffeinated coffee, and the other drank 2 cups of decaf. Each subject’s performance on a 


standard reaction-time test was recorded. Which of the following is the correct check of 
the “Normal/Large Sample” condition for this significance test? 


I. 


II. 


III. 


d. 


e. 


Confirm graphically that the scores of the caffeine drinkers could have come from a 
Normal distribution. 


Confirm graphically that the scores of the decaf drinkers could have come from a 
Normal distribution. 


Confirm graphically that the differences in scores within each pair of subjects could 
have come from a Normal distribution. 


. Tonly 
. ILonly 


. I only 


I and II only 


I, I, and III 


Exercises 97 and 98 refer to the following setting. Researchers suspect that Variety A tomato 
plants have a different average yield than Variety B tomato plants. To find out, researchers 
randomly select 10 Variety A and 10 Variety B tomato plants. Then the researchers divide in 
half each of 10 small plots of land in different locations. For each plot, a coin toss determines 
which half of the plot gets a Variety A plant; a Variety B plant goes in the other half. After 
harvest, they compare the yield in pounds for the plants at each location. The 10 differences 
(Variety A — Variety B) in yield are recorded. A graph of the differences looks roughly 
symmetric and single-peaked with no outliers. The mean difference is x A-B=0.34 


305 
1526 
305 


1526 


= 0.200 = 20.0%F 5 _ = 


= 0.200 = 20.0% ? — 0,200 = 


= 0.34 and the standard deviation of the differences is sA-B=0.83 


305 
is = aes 20.0% 
Sap = 0.83 Tet UAB 1526 #A-B= the true mean difference 


(Variety A — Variety B) in yield for tomato plants of these two varieties. 


30 
97. A 95% confidence interval for pA — B1526 


a. 


5 a 
= 0.200 = 20.0% 14 B jc given by 


0.3421.96(0.83)i5% 0.200 = 20.00.34 = 1.96 (0.83) 


305 ee oe 0.83 
0.34#1.96(0.8310)i526 = 0.200 = 20.0960.34 = 1.96 ( a) 


Co 


305 pang on ned. d4+ 1.812 ( 
0.34+1.812(0.8310)is26 °°? ~ °°." ( a) 


0.3422,262(0.83)i5% = 0.200 = 20.00.34 + 2.262 (0.83) 


0.83 
0.200 = 20,0%0-34 + 2.262 (28) 


V10 


305 
0.3442.262(0.8310)1526 


0.200 = 20.0% Hy . 


98. The P-value oy a test of HO: pA-B= ones” Ha-B = 0 versus 


Ha: pA- B#0i526 0.200 = 20.0%H, : a—B # 0 is 0.227. Which of the following is the 
correct interpretation of this P-value? 


| _ 
a. The probability that pA-B=01526 ~ See = Sea BaD is 0.227, 


b. Given that the true mean difference (Variety A — Variety B) in yield for these two 
varieties of tomato plants is 0, the probability of getting a sample mean difference of 
0.34 is 0.227. 


c. Given that the true mean difference (Variety A — Variety B) in yield for these two 
varieties of tomato plants is 0, the probability of getting a sample mean difference of 
0.34 or greater is 0.227. 


d. Given that the true mean difference (Variety A — Variety B) in yield for these two 
varieties of tomato plants is 0, the probability of getting a sample mean difference 


305 . » 
greater than or equal to 0.34 or less than or equal to —0.341526 9200 = 0950.36 is 
0.227. 


e. Given that the true mean difference (Variety A — Variety B) in yield for these two 
varieties of tomato plants is not 0, the probability of getting a sample mean difference 


greater than or equal to 0.34 or less than or equal to —0. 3416967 0200 = 2095 —0.34 is 
0.227, 


Recycle and Review 


In each part of Exercises 99 and 100, state which inference procedure from Chapter 8, 9, or 10 
you would use. Be specific. For example, you might say, “Two-sample z test for the difference 
between two proportions.” You do not have to carry out any procedures. 


99. Which inference method? 


a. Drowning in bathtubs is a major cause of death in children less than 5 years old. A 
random sample of parents was asked many questions related to bathtub safety. Overall, 
85% of the sample said they used baby bathtubs for infants. Estimate the percent of all 
parents of young children who use baby bathtubs. 


b. How seriously do people view speeding in comparison with other annoying behaviors? 
A large random sample of adults was asked to rate a number of behaviors on a scale of 
1 (mo problem at all) to 5 (very severe problem). Do speeding drivers get a higher 
average rating than noisy neighbors? 


c. You have data from interviews with a random sample of students who failed to 
graduate from a particular college in 7 years and also from a random sample of students 
who entered at the same time and did graduate within 7 years. You will use these data 


to estimate the difference in the percents of students from rural backgrounds among 
dropouts and graduates. 


d. Do experienced computer-game players earn higher scores when they play with 
someone present to cheer them on or when they play alone? Fifty teenagers with 
experience playing a particular computer game have volunteered for a study. We 
randomly assign 25 of them to play the game alone and the other 25 to play the game 
with a supporter present. Each player’s score is recorded. 


100. Which inference method? 


a. A city planner wants to determine if there is convincing evidence of a difference in the 
average number of cars passing through two different intersections. He randomly 
selects 12 times between 6:00 a.m. and 10:00 p.m., and he and his assistant count the 
number of cars passing through each intersection during the 10-minute interval that 
begins at that time. 


b. Are more than 75% of Toyota owners generally satisfied with their vehicles? Let’s 
design a study to find out. We’|l select a random sample of 400 Toyota owners. Then 
we'll ask each individual in the sample, “Would you say that you are generally satisfied 
with your Toyota vehicle>” 


c. Are male college students more likely to binge drink than female college students? The 
Harvard School of Public Health surveys random samples of male and female 
undergraduates at four-year colleges and universities about whether they have engaged 
in binge drinking. 


d. A bank wants to know which of two incentive plans will most increase the use of its 
credit cards and by how much. It offers each incentive to a group of current credit card 
customers, determined at random, and compares the amount charged during the 
following 6 months. 


Exercises 101 and 102 refer to the following setting. Coaching companies claim that their 
courses can raise the SAT scores of high school students. Of course, students who retake the 
SAT without paying for coaching generally raise their scores. A random sample of students 
who took the SAT twice found 427 who were coached and 2733 who were uncoached.22 
Starting with their Verbal scores on the first and second tries, we have these summary statistics: 


Try 1 Try 2 Gain 
n x | SX | xX | SX | xX | SX 
305 305 305 305 305 5 
0.200 - 0.200 - 0.200 - 0.200 = ——= 0.200 = T526 = 9-200 = 20.0% 
1526 1526 1526 1526 1526 Rs 
_ _ = Sz 
r Sx H ») Sx r 
Coached 427 59 
Uncoached 2733 506 101 527 101 21 52 


101. Coaching and SAT scores Let’s first ask if students who are coached increased their 


scores significantly, on average. 


a. You could use the information on the Coached line to carry out either a two-sample t 
test comparing Try 1 with Try 2 or a paired t test using Gain. Which is the correct 


test? Why? 
b. Carry out the proper test. What do you conclude? 


102. Coaching and SAT scores What we really want to know is whether coached students 
improve more than uncoached students, on average, and whether any advantage is large 
enough to be worth paying for. Use the information above to answer these questions: 


a. How much more do coached students gain, on average, compared to uncoached 
students? Construct and interpret a 99% confidence interval. 


b. Does the interval in part (a) give convincing evidence that coached students gain 
more, on average, than uncoached students? Explain your answer. 


c. Based on your work, what is your opinion: Do you think coaching courses are worth 


paying for? 


MM _. FRAPPY! FREE RESPONSE AP® 
PROBLEM, YAY! 


The following problem is modeled after actual AP® Statistics exam free response 
questions. Your task is to generate a complete, concise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you use, because you will be 
scored on the correctness of your methods as well as on the accuracy and completeness of 
your results and explanations. 


Will using name-brand microwave popcorn result in a greater percentage of popped 
kernels than using store-brand microwave popcorn? To find out, Briana and Maggie 
randomly selected 10 bags of name-brand microwave popcorn and 10 bags of store-brand 
microwave popcorn. The chosen bags were arranged in a random order. Then each bag 
was popped for 3.5 minutes, and the percentage of popped kernels was calculated. The 
results are displayed in the following table. 


Name-brand 95 88 84 94 81 90 97 93 91 86 
Store-brand 91 89 82 82 di 78 84 86 86 90 


Do the data provide convincing evidence that using name-brand microwave popcorn will 
result in a greater mean percentage of popped kernels? 


After you finish, you can view two example solutions on the book’s website 
(highschool.bfwpub.com/tps6e). Determine whether you think each solution is 
“complete,” “substantial,” “developing,” or “minimal.” If the solution is not 
complete, what improvements would you suggest to the student who wrote it? 
Finally, your teacher will provide you with a scoring rubric. Score your response 
and note what, if anything, you would do differently to improve your own score. 


Chapter 10 Review 


Section 10.1: Comparing Two Proportions 


In this section, you learned how to construct confidence intervals and perform significance tests 

for a difference between two proportions. Inference for a difference in proportions is based on 
BO ann meravein 

the sampling distribution of p41—p421526 ~ aaa 
305 

the sampling distribution of p41—p421526 © 


P2, When the conditions are met, 


0.200 = 20°), — P2 ig approximately Normal with a 


305 : , se _ en 
mean of pp41—p2=p1-p21526 © 0.200 = 20.0°Hp,—p, — P1 ~ P2 and a standard deviation of 
Scena ones. = y/ SA ae 
op\1-pA2=p1(1-p1)n1+p2(1—p2)n21526 Pp -p2 ~ Vm n2 


The conditions for inference about a difference in proportions are the same for confidence 
intervals and significance tests. The Random condition says that the data must be from two 
independent random samples or two groups in a randomized experiment. The 10% condition 
says that each sample size should be less than 10% of the corresponding population size when 
sampling without replacement. The Large Counts condition says that the number of successes 


and number of failures from each sample/group should be at least 10. That is, n1p/1 
305 305 305 
= 0.200 = 20.0%, H —— = 0.200 = 20.0%. _-“ ~ 0.200 = 20.0% a 
6 Py n1(1—p41)1526 0.200 == 20.0.%r, (1 — pj ), n2 p21526 0.200 na Po , 
305 305 
——-= 0.200 = 20.0%n» (1 — p = 0,200 = 20.0% 
n2(1—p/2)1526 °° nz (1 — 2) are all >101526 > 10, 


A confidence interval for a difference between two proportions provides an interval of 
plausible values for the true difference in proportions. The formula is 


305 
——= 0.200 = 20.0% 


(p41-p42)+z*p41(1-p41)n1+p/2(1—p’2)n2 1526 
.  .\, » /P1(i-b1) , Bg (1-2) 
(pb, —p2) +z —_ aa 

| ny ng 


The logic of confidence intervals, including how to interpret the confidence interval and the 
confidence level, is the same as it was in Chapter 8, when you first learned about confidence 
intervals. 


Likewise, a significance test for a difference between two proportions uses the same logic 
as the significance tests you learned about in Chapter 9. We start by assuming the null 


305 ; , e 
hypothesis is true and asking how likely it would be to get evidence for Hai526 — Hue So 


as strong as or stronger than the observed result in a study by chance alone. If it is plausible 
that a difference in proportions could be the result of sampling variability or the chance 
variation due to random assignment, we do not have convincing evidence that the alternative 
hypothesis is true. However, if the difference is too big to attribute to chance, there is 


convincing evidence that the alternative hypothesis is true. For a test of HO: p1—p2=0 
305 


——= 0.200 = 20.0% 77, . a ; eee 
1526 Ho : pi — p2 = 0 the standardized test statistic is 


(pap) pO-P) 

305 _ 0.200 = 20.0% \ a sara 

zZ=(p\1—p’2)-Op\(1-pA)n1+p\(1-p)n21526 2790 = 79.0%" 1 3 
305 


——= 0.200 = 20.0% 5 . 

where p/\1526 ae P is the combined (pooled) proportion of successes: 
. xX +X 
305 _ 0.200 = 20.0% ~ ny+no 
pA=X1+X2n1+n2152% 0 °° ~—O ny +ng 


Finally, you learned that the inference techniques used for analyzing a difference in 
proportions from two independent random samples work very well for analyzing a difference in 
proportions from two groups in a completely randomized experiment when the conditions are 
met. 


Section 10.2: Comparing Two Means 


In this section, you learned how to construct confidence intervals and perform significance tests 


for a difference between two means. Inference for a difference in means is based on the 
sampling distribution of x1-x 2806 =e 
sampling distribution of x1-x 2806 
mean of wx ix O= tee : 


20.0%". — =, eo 
©) — £2, When the conditions are met, the 


= 0.200 = 20.0%F, — Fo : 
“1 ~ #2 is (approximately) Normal with a 


0.200 = 20.0% Hz, —z_ = Mi ~ F2 and a standard deviation of 
P [My 5 &% 

ox 1x 2=012n1+022n2i5H6 = 0200 = 20.087 1-H = YT 

The conditions for inference about a difference in means are the same for confidence 

intervals and significance tests. The Random condition says that the data must be from two 

independent random samples or two groups in a randomized experiment. The 10% condition 

says that each sample size should be less than 10% of the corresponding population size when 

sampling without replacement. For each sample, the corresponding population distribution (or 


the true distribution of response to the treatment) is Normal or the sample size is large (n => 
305 


= 0.200 = 
1526 


aS 30). If either population (treatment) distribution has unknown shape and n < 
30, confirm that a graph of the sample data shows no strong skewness or outliers. 


As in Chapters 8 and 9, inference techniques for means are based on the t distributions. 
There are two options for calculating the number of degrees of freedom to use. The first option 
is to use technology to calculate the degrees of freedom. The second option is to use the smaller 


305 305 
——-= 0.200 = 20.0% ——= 0.200 = 20.0%). lent 
of n1-11526 m —1 and n2-11526 ‘n2 — 1, The technology option is 


preferred because it is more accurate and produces a larger number of degrees of freedom, 
resulting in narrower confidence intervals and smaller P-values. If you are using technology, 
always choose the unpooled option when performing inference about a difference between two 
means. 


A confidence interval for a difference between two means provides an interval of plausible 
values for the true difference in means. The formula is 


a 305 _ 9.200 = 20 Pca — £2) + t"4) —+— 
(x 1-x 2)+t*s12n1+s22n2152%6 0°" = 
Use a significance test to decide between two competing hypotheses about a true difference in 


ne 
means. For a test of HO: p1-p2=01526 70 = °° Ho = be 


Statistic is 


H2 = 0 the standardized test 


_ (Z1—Tq)—0 
305 ; ‘i 2 
- 9 20.0% cate Waa 
t=(x 1-x 2)—0s12n1+s22n215a6~ 700 = 20.0 my ng 


Section 10.3: Comparing Two Means: Paired Data 


In this section, you learned how to analyze paired data, which result from measuring the same 
quantitative variable twice for one individual or for two very similar individuals. Start by 


finding the difference between the values in each pair. Then make a graph of the differences. 
305, elas 
Use the mean difference x diffis2e "7"? ~ *°°”°*faisr and the standard deviation of the 
305 


= 0.200 = 20.0% ¢ ,., — 
differences sdiffi526 "Sdiff as summary statistics. 


You can perform inference about the true mean difference pdiff1526 “Haitt when 
the conditions are met. The conditions are the same for confidence intervals and significance 
tests. The Random condition says that paired data must come from a random sample from the 
population of interest or from a randomized experiment. The 10% condition says that the 


. 


305 
sample size ndiff1526 Naift should be less than 10% of the corresponding population 


of differences when sampling without replacement. The Normal/Large Sample condition says 


that the population distribution of differences is Normal or that the sample size is large (ndiff 
305 


> 30)1sa6— = 20.0%(naitt > 30) Tf the sample size is small and the population shape is 
unknown, graph the difference values to make sure there is no strong skewness or outliers. 


= 0.200 = 20.0% 


= 0.200 = 20.0% 


As in Chapters 8 and 9, inference techniques for means are based on the t distributions. The 
305, cianien 
appropriate number of degrees of freedom is df=ndiff—11526 ~ Tae = RAE = Tage — 1 


A confidence interval for a mean difference provides an interval of plausible values for the 


305 
= 0.200 = 20.0% pp. 1:06 ; 
true mean difference pdiff1526 Hdiff, The formula is 
= . diff 
_ 305 p29 < 200% Mt +t — 
x difftt«sdiffndiffis2e ~~" ~~" v “diff 


Use a significance test to decide between two competing hypotheses about a mean difference. 


305 , 
0.200 = 20.0% Hy, 


For a test of HO: pdiff=01526 ~ : Haitt = 9 the standardized test statistic is 


a i 
i S diff 
t=x diff—-Osdiffndiffisas~ °20°= 209% "diff 


To decide whether two-sample t procedures for a difference between two means or paired t 
procedures for a mean difference are appropriate, look at how the data were produced. 


305 
Difference in proportions p1 — p2is20. °°” ~ ONG — te 


Conditions . Random: The data come from two independent random samples or from 
two groups in a randomized experiment. 


305 
o 10%: When sampling without replacement, n1<0.10N11526 T= OF 


305 
and n2<0.10N21526~ 0:70 = 20-9% nz < 0.10N2 , 
e Large Counts: The counts of “successes” and “failures” in each sample or 
305 ; 305 7 : 
group—n1p’11526~ °0= 2.9% By, n1(1-p1)1526~ 92 2.0% (1 — P,), 


N2 pr22%- 0200-20092 Pa 2(1-p2)isa6 9200 = 20-0%nz (1 — P2)__are all at least 


10. 
Name Two-sample . interval for p1- ote = 0.200 = 20.0%p, — po 
(TI-83/84) (2- ee 
Formula (p*1-p"2)4z*p"1(1-p"1)n1+p2(1- _pr2yn2i536 = 9200 20.0% p, — p,) +2) BE) , a) 
Name Two-sample , test for p1- _p2igae = 0-200 = 20.0%p, — po 
(TI-83/84) is PropZTest) 
Null = 0.200 = 20.0% F7, - = 
hypothesis HO: p1-p2= Oise O° Pi 
Formula __(s)-0 


i 


=0.200= 20.0% — 2) , rt) 


Z=(p*1-p*2)—-0p*(1-p*)n1+p*(1- o\n2anee 


where 305 _ 7 -~ _ *i+X2 -yalue from standard 
p®=X1+X2n1+n21526 ~ 0290 == NRE = sina P 


Normal distribution 


Relevant 


Learning Target 


Describe the shape, center, and variability of the 
sampling distribution of 

305 - : 
pA —pA25eae 0.200 = 20.0%, — py: 
Determine whether the conditions are met for 
doing inference about a difference between two 
proportions. 


Construct and interpret a confidence interval for a 
difference between two proportions. 


Calculate the standardized test statistic and P- 
value for a test about a difference between two 
proportions. 


Perform a significance test about a difference 
between two proportions. 


Describe the shape, center, and variability of the 
sampling distribution of 

ee 0.200 = 20.0%F, — Fy° 

Determine whether the conditions are met for 
doing inference about a difference between two 
means. 


Construct and interpret a confidence interval for a 
difference between two means. 


Calculate the standardized test statistic and P- 
value for a test about a difference between two 
means. 


Perform a significance test about a difference 
between two means. 


Analyze the distribution of differences in a paired 
data set using graphs and summary statistics. 


Construct and interpret a confidence interval for a 
mean difference. 


Perform a significance test about a mean 
difference. 


Determine when it is appropriate to use paired t 
procedures versus two-sample t procedures. 
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R10.1 


Chapter 10 Review Exercises 


These exercises are designed to help you review the important ideas and methods of the 
chapter. 


R10.1 |») American-made cars Nathan and Kyle both work for the Department of Motor 


Vehicles (DMV), but they live in different states. In Nathan’s state, 80% of the registered 
cars are made by American manufacturers. In Kyle’s state, only 60% of the registered 
cars are made by American manufacturers. Nathan selects a random sample of 100 cars 
a state and Kyle selects a random sample of 70 cars in his state. Let p\N-pAK 


i526 ”?00 = 20.0""Dy — Px be the difference (Nathan’s state — Kyle’s state) in the sample 


proportion of cars made by American manufacturers. 


ne . 
a. What is the shape of the sampling distribution of p\N-p\Kis26 07"? ~ *°°"Py — Pc? 
Why? 


b. Find the mean of the sampling distribution. 
c. Calculate and interpret the standard deviation of the sampling distribution. 


R10.2 |») Facebook As part of the Pew Internet and American Life Project, researchers 


conducted two surveys. The first survey asked a random sample of 1060 U.S. teens about 
their use of social media. A second survey posed similar questions to a random sample of 
2003 U.S. adults. In these two studies, 71.0% of teens and 58.0% of adults used 


305, aaa 
Facebook.“ Let pT 1526 — ave = pF — the true proportion of all U.S. teens who use 


305, — 
Facebook and pA1526— 0.200 = 20-.°D 4 = the true proportion of all U.S. adults who use 


Facebook. Calculate and interpret a 99% confidence interval for the difference in the true 
proportions of U.S. teens and adults who use Facebook. 


R10.3 |») Treating AIDS The drug AZT was the first drug that seemed effective in delaying 


the onset of AIDS. Evidence for AZT’s effectiveness came from a large randomized 
comparative experiment. The subjects were 870 volunteers who were infected with HIV, 
the virus that causes AIDS, but did not yet have AIDS. The study assigned 435 of the 
subjects at random to take 500 milligrams of AZT each day and another 435 to take a 
placebo. At the end of the study, 38 of the placebo subjects and 17 of the AZT subjects 
had developed AIDS. 


305 
; a ——= 0.200 = 20.0% 
a. Do the data provide convincing evidence at the a=0.051526 a = 0.05 level 


that taking AZT lowers the proportion of infected people like the ones in this study 
who will develop AIDS in a given period of time? 


b. Describe a Type I error and a Type II error in this setting and give a consequence of 


each error. 


R10.4 |») Candles A company produces candles. Machine 1 makes candles with a mean 


length of 15 cm and a standard deviation of 0.15 cm. Machine 2 makes candles with a 
mean length of 15 cm and a standard deviation of 0.10 cm. A random sample of 49 


305 , ‘ or -_ 
candles is taken from each machine. Let x 1-x 2is20) *"?~ 7"°"%1 — Fo be the 
difference (Machine 1 — Machine 2) in the sample mean length of candles. Describe the 
P 305 
shape, center, and variability of the sampling distribution of x 1-x 21526— ae ane 
V1 —F 2. 


R10.5 |») Men versus women The National Assessment of Educational Progress (NAEP) 


Young Adult Literacy Assessment Survey aNadaa separate random samples of 840 
men and 1077 women aged 21 to 25 years.*2 The mean me standard deviation of scores 


05 


on the NAEP’s test of quantitative skills were x 1=272.4015 no 0.200 = 20.0%3, = 272.40 


- 0.200 = 20.0% aw FE 
and s1=59.21526 ‘81 = 59.2 for the men in the sample. For the women, the 
305 


>) » oy. 305 ) ) oF 
results were x 2=274.731520 700 = °8-9"°F_y = 274.73 and 52=57. sims a oe en 
$9 => 57.5. 


a. Construct and interpret a 90% confidence interval for the difference in mean score for 
male and female young adults. 


b. Based only on the interval from part (a), is there convincing evidence of a difference 
in mean score for male and female young adults? 


R10.6 |») Each day I am getting better in math A “subliminal” message is below our 


threshold of awareness but may nonetheless influence us. Can subliminal messages help 
students learn math? A group of 18 students who had failed the mathematics part of the 
City University of New York Skills Assessment Test agreed to participate in a study to 
find out. All received a daily subliminal message, flashed on a screen too rapidly to be 
consciously read. The treatment group of 10 students (assigned at random) was exposed 
to “Each day I am getting better in math.” The control group of 8 students was exposed 
to a neutral message, “People are walking on the street.” All 18 students participated in a 
summer program designed to improve their math skills, and all took the assessment test 
again at the end of the program. The following table gives data on the subjects’ scores 


before and after the program.42 
Treatment group Control group 
Pretest Posttest Difference Pretest Posttest Difference 
18 24 6 18 29 11 
18 25 7 24 29 5 
21 33 12 20 24 
18 29 11 18 26 8 


18 33 15 24 38 14 


a. Explain why a two-sample t test and not a paired t test is the appropriate inference 
procedure in this setting. 


b. The following boxplots display the differences in pretest and posttest scores for the 
students in the control (C) and treatment (T) groups. Write a few sentences comparing 
the performance of these two groups. 


|, =a 
'—_ i 


a a a (a (ce Ce (a (MC 
0 2 4 6 8 10 12 14 16 18 
Difference (Posttest — Pretest) 

in test score 


Group 


Treatment Control 


305 
c. Do the data provide convincing evidence at the a=0.01 1526 


significance level that subliminal messages help students like the ones in this study 
learn math, on average? 


11) — 20.0% 
0.200 20.04, — (0.01 


d. Can we generalize these results to the population of all students who failed the 
mathematics part of the City University of New York Skills Assessment Test? Why 
or why not? 


R10.7 |») On your mark In track, sprinters typically use starting blocks because they think it 


will help them run a faster race. To test this belief, an experiment was designed where 
each sprinter on a track team ran a 50-meter dash two times, once using starting blocks 
and once with a standing start. The order of the two different types of starts was 
determined at random for each sprinter. The times (in seconds) for 8 different sprinters 
are shown in the table. 


Sprinter With blocks Standing start 
cl 6.12 6.38 
2 6.42 6.52 
3 5.98 6.09 
4 6.80 6.72 
5 5.73 5.98 
6 6.04 6.27 
7 6.55 6.71 
8 6.78 6.80 


. Make a dotplot of the difference (Standing — Blocks) in 50-meter run time for each 
sprinter. What does the graph suggest about whether starting blocks are helpful? 


. Calculate the mean difference and the standard deviation of the differences. Explain 
why the mean difference gives some evidence that starting blocks are helpful. 


. Do the data provide convincing evidence that sprinters like these run a faster race 
when using starting blocks, on average? 


. Construct and interpret a 90% confidence interval for the true mean difference. 
Explain how the confidence interval gives more information than the test in part (b). 


Chapter 10 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


T10.1 A study of road rage asked separate random samples of 596 men and 523 women 
about their behavior while driving. Based on their answers, each respondent was 
assigned a road rage score on a scale of 0 to 20. Are the conditions for performing a 
two-sample t test satisfied? 


a. Maybe; we have independent random samples, but we should look at the data to 
check Normality. 


b. No; road rage scores on a scale from 0 to 20 can’t be Normal. 
c. No; we don’t know the population standard deviations. 


d. Yes; the large sample sizes guarantee that the corresponding population 
distributions will be Normal. 


e. Yes; we have two independent random samples and large sample sizes. 


T10.2 Thirty-five people from a random sample of 125 workers from Company A admitted 
to using sick leave when they weren’t really ill. Seventeen employees from a random 
sample of 68 workers from Company B admitted that they had used sick leave when 
they weren’t ill. Which of the following is a 95% confidence interval for the difference 
in the proportions of workers at the two companies who would admit to using sick 
leave when they weren’t ill? 


= 0.200 = 20.0% 
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a. 0.03+(0.28)(0.72)125+(0.25)(0.75)681526 


{[0.28)(0.72) (0.25)(0.75) 


0.03 + VV 328 68 


305 . menscei 
b. 0.03+1.96(0.28)(0.72)125+(0.25)(0.75)68i526 °° = 0" 


0.03 + 1.96,/ —— + 


305 . mene 
c, 0.03+1.645(0.28)(0.72)125+(0.25)(0.75)68i526~ 700 = 20.0% 


0.03 + 1.645,/ + <= 
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d. 0.03+1.96(0.269)(0.731)125+(0.269)(0.731)68i526 "= *°" 


/ (0.269)(0.731) ‘. (0.269)(0.731) 
0.03 + 1.96)/ + 


305, sacs 
e. 0.08+1.645(0.269)(0.731)125+(0.269)(0.731)68i526 | °° = 700% 


/(0.269)(0.731) (0.269)(0.731) 


0.03 + 1.645,/———— + 


T10.3 The power takeoff driveline on tractors used in agriculture can be a serious hazard to 
operators of farm equipment. The driveline is covered by a shield in new tractors, but 
the shield is often missing on older tractors. Two types of shields are the bolt-on and 
the flip-up. It was believed that the bolt-on shield was perceived as a nuisance by the 
operators and deliberately removed, but the flip-up shield is easily lifted for inspection 
and maintenance and may be left in place. In a study by the U.S. National Safety 
Council, random samples of older tractors with both types of shields were taken to see 
what proportion of shields were removed. Of 183 tractors designed to have bolt-on 
shields, 35 had been removed. Of the 136 tractors with flip-up shields, 15 were 


removed. We wish to perform a test of HO: pB=pF1526 0.200 = 20.0% Hy : PB = PF versus 


305 305 
——= 0.200 = 20.0% + on > vp —— = 0.200 = 20.0% 
Ha: pB>pF 1526 H, : PB > Pr, where pB 1526 PB and pF 


305 _ 6.200 = 20.0% 
mama aaa 1, 


F are the proportions of all tractors with the bolt-on and flip-up 
shields removed, respectively. Which of the following is not a condition for performing 
the significance test? 

a. Both populations are Normally distributed. 
b. The data come from two independent samples. 
c. Both samples were chosen at random. 
d. The counts of successes and failures are large enough to use Normal calculations. 
e. Both populations are more than 10 times the corresponding sample sizes. 
305, intica 

T10.4 A quiz question gives random samples of n=101526 0.200= 20-9" = 10 observations 

from each of two Normally distributed populations. Tom uses a table of t distribution 

critical values and 9 degrees of freedom to calculate a 95% confidence interval for the 

difference in the two population means. Janelle uses her calculator’s two-sample t 

interval with 16.87 degrees of freedom to compute the 95% confidence interval. 


Assume that both students calculate the intervals correctly. Which of the following is 
true? 


a. Tom’s confidence interval is wider. 
b. Janelle’s confidence interval is wider. 


Both confidence intervals are the same width. 


a 9 


There is insufficient information to determine which confidence interval is wider. 


e. Janelle made a mistake; degrees of freedom has to be a whole number. 


Exercises T10.5 and T10.6 refer to the following setting. A researcher wished to compare the 
average amount of time spent in extracurricular activities by high school students in a suburban 
school district with that in a school district of a large city. The researcher obtained an SRS of 
60 high school students in a large suburban school district and found the mean time spent in 
extracurricular activities per week to be 6 hours with a standard deviation of 3 hours. The 
researcher also obtained an independent SRS of 40 high school students in a large city school 


district and found the mean time spent in extracurricular activities per week to be 5 hours with 
a standard deviation of 2 hours. Suppose that the researcher decides to carry out a significance 


305 
test of HO: psuburban=icityis26 97° = 20.0% : 


alternative. 


Msuburban ~ Meity versus a two-sided 


T10.5 Which is the correct standardized test statistic? 


_ —_ (6-5)-0 
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a. Z=(6-5)—0360+2401526 = 0.200 = 20.0% \ Go * 40 
, — _(6—-5)-0 
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wna 2 9 oF 7 a 2 
c. t=(6-5)-0360+240i526~ 70° = 70.0% Van” a0 
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d. t=(6—5)-0360+2401526~ °-200= 20.0% = y wo Ta 
i (6—5)—0 
ae 9 0) 0% [32 22 
e. t=(6-5)-03260+22401526  °700= 200% = Vw" a 


T10.6 The P-value for the test is 0.048. A correct conclusion is to 


305 _ 0.200 = 20.04 
a. fail to reject H01526 ~ ~ 20.0 Hy because 0.048<a=0. Osine= 


0.048 < a = 0.05, There is convincing evidence of a difference in the average time 
spent on extracurricular activities by students in the suburban and city school 
districts. 


0.200 = 20.0% 


305 nn 305 
b. fail to reject H01526 — 0.200 = 20.0 Hy because 0.048<a=0.051526 


0.048 < a = 0.05, There is not convincing evidence of a difference in the average 
time spent on extracurricular activities by students in the suburban and city school 
districts. 


= 0.200 = 20.0% 


305 _ 0.200 = 20.04 
c. fail to reject H01526 ~ ~ 20.0 Hy because 0.048<a=0. osinas 


0.048 < a = 0.05, There is convincing evidence that the average time spent on 
extracurricular activities by students in the suburban and city school districts is the 
same. 


= 0.200 = 20.0% 


305, 305 
d. reject H01526 — 0.200 = 20.0 Hy because 0.048<ca=0.051526 


0.048 < a = 0.05, There is not convincing evidence of a difference in the average 
time spent on extracurricular activities by students in the suburban and city school 
districts. 


0.200 = 20.0% 


305 305 
: —— = 0.200 = 20.0% ——= 
e. reject H01526 °Ho because 0.048<a=0.051526 


0.048 < a = 0.05, There is convincing evidence of a difference in the average time 
spent on extracurricular activities by students in the suburban and city school 


0.200 = 20.0% 


districts. 


T10.7 At a baseball game, 42 of 65 randomly selected people own an iPod. At a rock concert 
occurring at the same time across town, 34 of 52 randomly selected people own an 
iPod. A researcher wants to test the claim that the proportion of iPod owners at the two 


venues is different. A 90% confidence interval for the difference (Game — Concert) in 
ee | 
population proportions is (—0.154, 0.138) 1526 ~ 0.200 = 20.0%(—0.154, 0.138) Which of the 


following gives the correct outcome of the researcher’s test of the claim? 


a. Because the confidence interval includes 0, the researcher can conclude that the 
proportion of iPod owners at the two venues is the same. 


ane ane 
b. Because the center of the interval is —0.0081526 °°"? ~ *°-°°—0.008 | the researcher 


can conclude that a higher proportion of people at the rock concert own iPods than at 
the baseball game. 


c. Because the confidence interval includes 0, the researcher cannot conclude that the 
proportion of iPod owners at the two venues is different. 


d. Because the confidence interval includes more negative than positive values, the 
researcher can conclude that a higher proportion of people at the rock concert own 
iPods than at the baseball game. 


e. The researcher cannot draw a conclusion about a claim without performing a 
significance test. 


T10.8 An SRS of size 100 is taken from Population A with proportion 0.8 of successes. An 
independent SRS of size 400 is taken from Population B with proportion 0.5 of 
successes. The sampling distribution of the difference (A — B) in sample proportions 
has what mean and standard deviation? 


a. mean=0.3;3008 ~ 0.200 = 20.0%mean = 0.3; standard deviation =1,305m8 =a = abe 
— SB | 
305 : 305 7 
b. mean=0.3;1526 — 0.200 = 20.0mean = 0.3; standard deviation =0.40i520 70? = 70.9 
= 0.40 
C. mean=0.3:3858 = 0.200 = 20.0“mean = 0.3; standard deviation -0.047 i538 = 280 = 2000 
= 0.047 
305 : 305 . 
d. mean=0.3;1526 0.200 = 20.0mean = 0.3; standard deviation =0.00221526 "20° = 20.0% 
= 0.0022 
305 a 305 . 
e. mean=0.3;1526 0.200 = 20.0mean = 0.3; standard deviation =0.00021526 "0 = 20.0% 
= 0.0002 


T10.9 Are TV commercials louder than their surrounding programs? To find out, researchers 
collected data on 50 randomly selected commercials in a given week. With the 
television’s volume at a fixed setting, they measured the maximum loudness of each 


commercial and the maximum loudness in the first 30 seconds of regular programming 
that followed. Assuming conditions for inference are met, the most appropriate method 
for answering the question of interest is 


a. atwo-sample t test for a difference in means. 

b. atwo-sample t interval for a difference in means. 
c. a paired t test for a mean difference. 

d. a paired t interval for a mean difference. 

e. a two-sample z test for a difference in proportions. 


T10.10 Researchers want to evaluate the effect of a natural product on reducing blood 
pressure. They plan to carry out a randomized experiment to compare the mean 
reduction in blood pressure of a treatment (natural product) group and a placebo group. 

= 0.200 = 20.0% 


Then they will use the data to perform a test of HO: pT-pP= =v 26 
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Hy: wr — hp = 9 versus Ha: pT-pP>01526 
= 0.200 = 20.0% yp 


0.200 = 20.0% - — 
H,: ur — hp > 9 where uT 


= the true mean reduction in blood pressure when taking the natural 


305 
—— = 0.200 = 20.0% 4 p a, 
product and [P1526 P= the true mean reduction in blood pressure when 


taking a placebo for subjects like the ones in the experiment. The researchers would like 
to detect whether the natural product reduces blood pressure by at least 7 points more, 
on average, than the pcre: If groups of size 50 are used in the experiment, a two- 


sample ¢ test using a=0. O1i667 0.200 = 20.0% — 0.01 will have a power of 80% to detect 


a 7-point difference in mean blood pressure reduction. If the researchers want to be able 
to detect a 5-point difference instead, then the power of the test 


a. would be less than 80%. 
b. would be greater than 80%. 
would still be 80%. 


a 9 


could be either less than or greater than 80%. 


e. would vary depending on the standard deviation of the data. 


Section Il: Free Response Show all your work. Indicate clearly the methods you use, 
because you will be graded on the correctness of your methods as well as on the accuracy and 
completeness of your results and explanations. 


T10.11 Researchers wondered whether maintaining a patient’s body temperature close to 
normal by heating the patient during surgery would affect rates of infection of wounds. 
Patients were assigned at random to two groups: the normothermic group (core 


JUO 
bee ——= 0.200 = 20.0% 
temperatures were maintained at near normal, 36.5°C 1526 


heating blankets) and the San group (core temperatures were allowed to 


36.5°C, using 


= 0.200 = 


decrease to about 34. S°Cisas 26 *0.0°34.5°C), If keeping patients warm during 


surgery alters the chance of infection, patients in the two groups should show a 
difference in the average length of their hospital stays. Here are summary statistics on 
hospital stay (in number of days) for the two groups: 


Group n = wal = 0.200 = 20.0%— ki 0.200 = 20.0% ¢ 
X 1526 z SX 1526 2 

Normothermic 104 12.1 4.4 

Hypothermic 96 14.7 6.5 


a. Construct and interpret a 95% confidence interval for the difference in the true mean 
length of hospital stay for normothermic and hypothermic patients like these. 


b. Does your interval in part (a) suggest that keeping patients warm during surgery 
affects the average length of patients’ hospital stays? Justify your answer. 


c. Interpret the meaning of “95% confidence” in the context of this study. 


T10.12 A random sample of 100 of last year’s model of a certain popular car found that 20 
had a specific minor defect in the brakes. The automaker adjusted the production 
process to try to reduce the proportion of cars with the brake problem. A random 
sample of 350 of this year’s model found that 50 had the minor brake defect. 


a. Was the company’s adjustment successful? Carry out an appropriate test to support 
your answer. 


b. Based on your conclusion in part (a), which mistake—a Type I error or a Type II 
error—could have been made? Describe a possible consequence of this error. 


T10.13 “I can’t get through my day without coffee” is a common statement from many 
college students. They assume that the benefits of coffee include staying awake during 
lectures and remaining more alert during exams and tests. Students in a statistics class 
designed an experiment to measure memory retention with and without drinking a cup 
of coffee 1 hour before a test. This experiment took place on two different days in the 
same week (Monday and Wednesday). Ten students were used. Each student received 
no coffee or one cup of coffee 1 hour before the test on a particular day. The test 
consisted of a series of words flashed on a screen, after which the student had to write 
down as many of the words as possible. On the other day, each student received a 
different amount of coffee (none or one cup). 


a. One of the researchers suggested that all the subjects in the experiment drink no 
coffee before Monday’s test and one cup of coffee before Wednesday’s test. Explain 
to the researcher why this is a bad idea and suggest a better method of deciding 
when each subject receives the two treatments. 


b. The researchers actually used the better method of deciding when each subject 
receives the two treatments that you identified in part (a). For each subject, the 
number of words recalled when drinking no coffee and when drinking one cup of 
coffee is recorded in the table. Carry out an appropriate test to determine whether 
there is convincing evidence that drinking coffee improves memory, on average, for 


students like the ones in this study. 


Student No cup One cup 
1 24 25 
2 30 31 
3 22 23 
4 24 24 
5 26 27 
6 23 25 
7 26 28 
8 20 20 
9 27 27 


10 28 30 


Chapter 10 Project Which Costs More: Diesel or 
Unleaded? 


When buying or leasing a new car, one of the factors that customers consider is the type of fuel 
it uses. Some people prefer vehicles that use diesel fuel, while others favor vehicles that use 
regular unleaded gasoline. Which of these two types of fuel costs more at the pump, on 
average? Does the answer to this question depend on the state where you live? 


Researchers collected data on the price per gallon (in dollars) of diesel fuel and regular 
unleaded gasoline from a random sample of gas stations in 6 states: Colorado, Illinois, Indiana, 
Kansas, Missouri, and Ohio. The file gas prices ch 10 project.xls, which can be accessed from 
the book’s website at highschool.bfwpub.com/tps6e, contains data from a total of 82 gas 
stations. Download the file to a computer for further analysis using the application specified by 
your teacher. Use the file provided to answer the following questions. 


. Start by calculating the difference (Diesel — Unleaded) in gas prices for all 82 stations, and 
store these values in a new column titled Difference. 


’. Make a graph to display the distribution of difference in gas prices. Describe the shape, 
center, and variability of the distribution. Are there any outliers? 


3. Construct and interpret a 95% confidence interval for the true mean difference. Does the 
interval provide convincing evidence of a difference in the mean price per gallon of diesel 
fuel and regular unleaded gasoline? 


|. Make a graph that compares the difference (Diesel — Unleaded) in gas prices for the 6 states 
from which the data were collected. Write a few sentences comparing the distributions. 


». We might expect the mean difference (Diesel — Unleaded) in gas prices to be the same for 
adjacent states. Choose 2 adjacent states from the data set. Then carry out an appropriate test 
to see if the data provide convincing evidence to contradict this expectation. 


Cumulative AP® Practice Test 3 


Section I: Multiple Choice Choose the best answer. 


AP3.1 Suppose the probability that a softball player gets a hit in any single at-bat is 0.300. 
Assuming that her chance of getting a hit on a particular time at bat is independent of 
her other times at bat, what is the probability that she will not get a hit until her fourth 
time at bat in a game? 
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AP3.2 A survey asked a random sample of U.S. adults about their political party affiliation 
and how long they thought they would survive compared to most people in their 
community if an apocalyptic disaster were to strike. The responses are summarized in 
the following two-way table. 


Political affiliation 


Democrat Independent | Republican Total 

Longer 314 

Expected = About as long 416 
survival 

length Not as long 115 

Not sure 153 

Total 998 


Suppose we select one of the survey respondents at random. Which of the following 
probabilities is the largest? 


a. P(Independent and Longer) 
b. P(Independent or Not as long) 


305 

——-= 0.200 = 20.0% 
c. P(Democrat |1526 | Not as long) 
305 


d. P(About as long [1526 °°? > 


aa | Democrat) 


e. P(About as long) 


AP3.3 Sports Illustrated planned to ask a random sample of Division I college athletes, “Do 
you believe performance-enhancing drugs are a problem in college sports?” Which of 
the following is the smallest number of athletes that must be interviewed to estimate the 


true proportion who believe performance-enhancing drugs are a problem within +2% 


305 
= 0.200 = 20.0% 507 _ 
1526 +2% with 90% confidence? 


c. 1680 
d. 1702 
e. 2401 


AP3.4 The distribution of grade point averages (GPAs) for a certain college is approximately 
Normal with a mean of 2.5 and a standard deviation of 0.6. The minimum possible GPA 
is 0.0 and the maximum possible GPA is 4.33. Any student with a GPA less than 1.0 is 
put on probation, while any student with a GPA of 3.5 or higher is on the dean’s list. 
About what percent of students at the college are on probation or on the dean’s list? 


a. 0.6 
b. 4.7 
c. 5.4 
d. 94.6 
e. 95.3 
AP3.5 Which of the following will increase the power of a significance test? 
a. Increase the Type II error probability. 
b. Decrease the sample size. 


c. Reject the null hypothesis only if the P-value is less than the significance level. 
305 


eer ——= 0.200 = 20.0% 
d. Increase the significance level 01526 a, 


e. Select a value for the alternative hypothesis closer to the value of the null 
hypothesis. 


AP3.6 You can find some interesting polls online. Anyone can become part of the sample just 
by clicking on a response. One such poll asked, “Do you prefer watching first-run 
movies at a movie theater, or waiting until they are available to watch at home or ona 
digital device?” In all, 8896 people responded, with only 12% (1118 people) saying 
they preferred theaters. You can conclude that 


a. American adults strongly prefer watching movies at home or on their digital devices. 
b. the high nonresponse rate prevents us from drawing a conclusion. 


c. the sample is too small to draw any conclusion. 


d. the poll uses voluntary response, so the results tell us little about all American 
adults. 


e. American adults strongly prefer seeing movies at a movie theater. 


AP3.7 A certain candy has different wrappers for various holidays. During Holiday 1, the 
candy wrappers are 30% silver, 30% red, and 40% pink. During Holiday 2, the 
wrappers are 50% silver and 50% blue. In separate random samples of 40 candies on 
Holiday 1 and 40 candies on Holiday 2, what are the mean and standard deviation of the 
total number of silver wrappers? 


a. 32, 18.4 
b. 32, 6.06 
c. 32, 4.29 
d. 80, 18.4 
e. 80, 4.29 


AP3.8 A beef rancher randomly sampled 42 cattle from her large herd to obtain a 95% 
confidence interval for the mean weight (in pounds) of the cattle in the herd. The 
interval obtained was (1010, 1321). If the rancher had used a 98% confidence interval 
instead, the interval would have been 


a. wider with less precision than the original estimate. 

b. wider with more precision than the original estimate. 

c. wider with the same precision as the original estimate. 
d. narrower with less precision than the original estimate. 
e. narrower with more precision than the original estimate. 


AP3.9 School A has 400 students and School B has 2700 students. A local newspaper wants 
to compare the distributions of SAT scores for the two schools. Which of the following 
would be the most useful for making this comparison? 


a. Back-to-back stemplots for A and B 
b. A scatterplot of A versus B 
c. Two dotplots for A and B drawn on the same scale 
d. Two relative frequency histograms of A and B drawn on the same scale 
e. Two bar graphs for A and B drawn on the same scale 
AP3.10 Let X represent the outcome when a fair six-sided die is rolled. For this random 


305 ; a 305 see? 
variable, pX=3,51626 9 ~ °*#x = 8.5 and oX=1.71 1826 = ox = 1.71, TF 
the die is rolled 100 times, what is the approximate probability that the sum is at least 
375? 


a. 0.0000 


b. 0.0017 
c. 0.0721 
d. 0.4420 
e. 0.9279 


AP3.11 An agricultural station is testing the yields for six different varieties of seed corn. The 
station has four large fields available, located in four distinctly different parts of the 
county. The agricultural researchers consider the climatic and soil conditions in the four 
parts of the county as being quite different, but are reasonably confident that the 
conditions within each field are fairly similar throughout. The researchers divide each 
field into six sections and then randomly assign one variety of corn seed to each section 
in that field. This procedure is done for each field. At the end of the growing season, the 
corn will be harvested, and the yield (measured in tons per acre) will be compared. 
Which one of the following statements about the design is correct? 


a. This is an observational study because the researchers are watching the corn grow. 
b. This a randomized block design with fields as blocks and seed types as treatments. 


c. This is a randomized block design with seed types as blocks and fields as treatments. 


jae 


. This is a completely randomized design because the six seed types were randomly 
assigned to the four fields. 


e. This is a completely randomized design with 24 treatments—6 seed types and 4 
fields. 


AP3.12 The correlation between the heights of fathers and the heights of their grownup sons, 
both measured in inches, is r=0. S21 26 "200 = 20.0%, — 0.52. If fathers’ heights were 

measured in feet instead, the correlation between heights of fathers and heights of sons 

would be 

a. much smaller than 0.52. 

b. slightly smaller than 0.52. 

c. unchanged; equal to 0.52. 

d. slightly larger than 0.52. 

e. much larger than 0.52. 


AP3.13 A random sample of 200 New York State voters included 88 Republicans, while a 
random sample of 300 California voters produced 141 Republicans. Which of the 
following represents the 95% confidence interval for the true difference in the 
proportion of Republicans in New York State and California? 
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AP3.14 Which of the following is not a property of a binomial setting? 
a. Outcomes of different trials are independent. 
b. The chance process consists of a fixed number of trials, n. 
c. The probability of success is the same for each trial. 
d. Trials are repeated until a success occurs. 
e. Each trial can result in either a success or a failure. 


AP3.15 Mrs. Woods and Mrs. Bryan are avid vegetable gardeners. They use different 
fertilizers, and each claims that hers is the best fertilizer to use when growing tomatoes. 
Both agree to do a study using the weight of their tomatoes as the response variable. 
Each planted the same varieties of tomatoes on the same day and fertilized the plants on 
the same schedule throughout the growing season. At harvest time, each randomly 
selects 15 tomatoes from her garden and weighs them. After performing a two-sample t 


305 ; : or 
test on the difference in mean weights of tomatoes, they get t=5.241526 — eee Sas 


305 9 20.0% 
t = 5.24 and P=0.0008i526 °° = *°°*P = 0.0008, Can the gardener with the larger 
mean claim that her fertilizer caused her tomatoes to be heavier? 


a. Yes, because a different fertilizer was used on each garden. 
b. Yes, because random samples were taken from each garden. 
c. Yes, because the P-value is so small. 


d. No, because the condition of the soil in the two gardens is a potential confounding 
variable. 
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AP3.16 The Environmental Protection Agency (EPA) is charged with monitoring industrial 
emissions that pollute the atmosphere and water. So long as emission levels stay within 
specified guidelines, the EPA does not take action against the polluter. If the polluter 


violates regulations, the offender can be fined, forced to clean up the problem, or 
possibly closed. Suppose that for a particular industry the acceptable emission level has 
been set at no more than parts per million (S ppm). The null and alternative 

05 305 9 or 
nypotneses are HO: p= saad 26 0-200 = 20.0% Ho: = 5 versus Ha: U>51526 See Sa 
Hq: > 5 Which of the following describes a Type II error? 


a. The EPA fails to find convincing evidence that emissions exceed acceptable limits 
when, in fact, they are within acceptable limits. 


b. The EPA finds convincing evidence that emissions exceed acceptable limits when, 
in fact, they are within acceptable limits. 


c. The EPA fails to find convincing evidence that emissions exceed acceptable limits 
when, in fact, they do exceed acceptable limits. 


d. The EPA finds convincing evidence that emissions exceed acceptable limits when, 
in fact, they do exceed acceptable limits. 


e. The EPA takes more samples to ensure that they make the correct decision. 
AP3.17 Which of the following statements is false? 


a. A measure of center alone does not completely summarize a distribution of 
quantitative data. 


b. If the original measurements are in inches, converting them to centimeters will not 
change the mean or standard deviation. 


c. One of the disadvantages of a histogram is that it doesn’t show each data value. 


d. Ina quantitative data set, adding a new data value equal to the mean will decrease 
the standard deviation. 


e. If a distribution of quantitative data is strongly skewed, the median and interquartile 
range should be reported rather than the mean and standard deviation. 


AP3.18 A 96% confidence interval for the proportion of the labor force that is unemployed in 
a certain city is (0.07, 0.10). Which of the following statements is true? 


a. The probability is 0.96 that between 7% and 10% of the labor force is unemployed. 


b. About 96% of the intervals constructed by this method will contain the true 
proportion of the labor force that is unemployed in the city. 


c. In repeated samples of the same size, there is a 96% chance that the sample 
proportion will fall between 0.07 and 0.10. 


d. The true rate of unemployment in the labor force lies within this interval 96% of the 
time. 


e. Between 7% and 10% of the labor force is unemployed 96% of the time. 


AP3.19 A large toy company introduces many new toys to its product line each year. The 
company wants to predict the demand as measured by y, first-year sales (in millions of 


dollars) using x, awareness of the product (as measured by the percent of customers 
who had heard of the product by the end of the second month after its introduction). A 
random sample of 65 new products was taken, and a correlation of 0.96 was computed. 
Which of the following is true? 


a. The least-squares regression line accurately predicts first-year sales 96% of the time. 


b. About 92% of the time, the percent of people who have heard of the product by the 
end of the second month will correctly predict first-year sales. 

c. About 92% of first-year sales can be accounted for by the percent of people who 
have heard of the product by the end of the second month. 

d. For each increase of 1% in awareness of the new product, the predicted sales will go 
up by 0.96 million dollars. 


e. About 92% of the variation in first-year sales can be accounted for by the least- 
squares regression line with the percent of people who have heard of the product by 
the end of the second month as the explanatory variable. 


AP3.20 Final grades for a class are approximately Normally distributed with a mean of 76 and 
a standard deviation of 8. A professor says that the top 10% of the class will receive an 
A, the next 20% a B, the next 40% a C, the next 20% a D, and the bottom 10% an F. 
What is the approximate maximum grade a student could attain and still receive an F for 
the course? 


a. 70 
b. 69.27 
c. 65.75 
d. 62.84 
e. 57 


AP3.21 National Park rangers keep data on the bears that inhabit their park. Here is a 
histogram of the weights of 143 bears measured in a recent year: 
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Which of the following statements is correct? 


a. The median will lie in the interval (140, 180), and the mean will lie in the interval 


(180, 220). 


b. The median will lie in the interval (140, 180), and the mean will lie in the interval 
(260, 300). 


c. The median will lie in the interval (100, 140), and the mean will lie in the interval 
(180, 220). 


d. The mean will lie in the interval (140, 180), and the median will lie in the interval 
(260, 300). 


e. The mean will lie in the interval (100, 140), and the median will lie in the interval 
(180, 220). 


AP3.22 A random sample of size n will be selected from a population, and the proportion p/ 
305 


1526 of those in the sample who have a Facebook page will be calculated. 
How would the margin of error for a 95% confidence interval be affected if the sample 
size were increased from 50 to 200 and the sample proportion of people who have a 
Facebook page is unchanged? 


0.200 = 20.0% 


a. It remains the same. 
b. It is multiplied by 2. 
c. It is multiplied by 4. 
d. It is divided by 2. 
e. It is divided by 4. 


AP3.23 A scatterplot and a least-squares regression line are shown in the figure. What effect 
does point P have on the slope of the regression line and the correlation? 


Point P increases the slope and increases the correlation. 


oT p 


. Point P increases the slope and decreases the correlation. 
c. Point P decreases the slope and decreases the correlation. 
d. Point P decreases the slope and increases the correlation. 
e. No conclusion can be drawn because the other coordinates are unknown. 


AP3.24 The following dotplots show the average high temperatures (in degrees Celsius) for a 
sample of tourist cities from around the world. Both the January and July average high 
temperatures are shown. What is one statement that can be made with certainty from an 


analysis of the graphical display? 
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a. Every city has a larger average high temperature in July than in January. 


d. 


e. 


. The distribution of temperatures in July is skewed right, while the distribution of 


temperatures in January is skewed left. 


The median average high temperature for January is higher than the median average 
high temperature for July. 


There appear to be outliers in the average high temperatures for January and July. 


There is more variability in average high temperatures in January than in July. 


AP3.25 Suppose the null and alternative hypotheses for a significance test are defined as 
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AP3.26 A large university is considering the establishment of a schoolwide recycling 
program. To gauge interest in the program by means of a questionnaire, the university 
takes separate random samples of undergraduate students, graduate students, faculty, 
and staff. This is an example of what type of sampling design? 


jan 


a. Simple random sample 
b. 


C. 


Stratified random sample 


Convenience sample 


. Cluster sample 


Randomized block design 


AP3.27 Suppose the true proportion of people who use public transportation to get to work in 
the Washington, D.C., area is 0.45. In a simple random sample of 250 people who work 
in Washington, about how far do you expect the sample proportion to be from the true 
proportion? 

a. 0.4975 
b. 0.2475 
c. 0.0315 
d. 0.0009 


e. 0 


Questions 28 and 29 refer to the following setting. According to sleep researchers, if you are 
between the ages of 12 and 18 years old, you need 9 hours of sleep to function well. A simple 
random sample of 28 students was chosen from a large high school, and these students were 
asked how much sleep they got the previous night. The mean of the responses was 7.9 hours 
with a standard deviation of 2.1 hours. 


AP3.28 If we are interested in whether students at this high school are getting too little sleep, 
which of the following represents the appropriate null and alternative hypotheses? 
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AP3.29 Which of the following is the standardized test statistic for the hypothesis test? 
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AP3.30 Shortly before the 2012 presidential election, a survey was taken by the school 
newspaper at a very large state university. Randomly selected students were asked, 


“Whom do you plan to vote for in the upcoming presidential election?” Here is a two- 
way table of the responses by political persuasion for 1850 students: 


Political persuasion 


Democrat Republican | Independent Total 
Obama 1029 
Candidate of Romney 695 
choice —— Other ai 
Undecided 105 
Total 1850 


Which of the following statements about these data is true? 
a. The percent of Republicans among the respondents is 41%. 


b. The marginal relative frequencies for the variable choice of candidate are given by 
Obama: 55.6%; Romney: 37.6%; Other: 1.1%; Undecided: 5.7%. 


c. About 11.2% of Democrats reported that they planned to vote for Romney. 
d. About 44.6% of those who are undecided are Independents. 


e. The distribution of political persuasion among those for whom Romney is the 
candidate of choice is Democrat: 7.5%; Republican: 84.0%; Independent: 18.8%. 


Section Il: Free Response Show all your work. Indicate clearly the methods you use, 
because you will be graded on the correctness of your methods as well as on the accuracy and 
completeness of your results and explanations. 


AP3.31 A researcher wants to determine whether or not a 5-week crash diet is effective over a 
long period of time. A random sample of 15 five-week crash dieters is selected. Each 
person’s weight (in pounds) is recorded before starting the diet and 1 year after it is 
concluded. Do the data provide convincing evidence that 5-week crash dieters weigh 
less, on average, 1 year after finishing the diet? 


Dieter 1 2 2 4 5 6 7 8 
Before 158 185 176 172 164 234 258 200 
After 163 182 188 150 161 220 235 191 
Dieter 9 10 11 12 13 14 15 
Before 228 246 198 221 236 255 231 
After 228 237 209 220 222 268 234 


AP3.32 Starting in the 1970s, medical technology has enabled babies with very low birth 
weight (VLBW, less than 1500 grams, or about 3.3 pounds) to survive without major 
handicaps. It was noticed that these children nonetheless had difficulties in school and 
as adults. A long-term study has followed 242 randomly selected VLBW babies to age 


20 years, along with a control group of 233 randomly selected babies from the same 
population who had normal birth weight.22 


a. Is this an experiment or an observational study? Why? 


b. At age 20, 179 of the VLBW group and 193 of the control group had graduated from 
high school. Do these data provide convincing evidence at the a=0.05 


S05 enn anne 
i526 0 = 7°." — 0.05 significance level that the graduation rate anong VLBW 


babies is less than for normal-birth-weight babies? 


AP3.33 A nuclear power plant releases water into a nearby lake every afternoon at 4:51 p.m. 
Environmental researchers are concerned that fish are being driven away from the area 
around the plant. They believe that the temperature of the water discharged may be a 
factor. The scatterplot shows the temperature of the water (in degrees Celsius) released 
by the plant and the measured distance (in meters) from the outflow pipe of the plant to 
the nearest fish found in the water on eight randomly chosen afternoons. 


Distance to nearest fish (m) 


15 20 25 30 35 40 
Temperature (°C) 


Here are computer output from a least-squares regression analysis on these data and a 
residual plot: 


Predictor Coef SE Coef T P 
Constant -73.64 15.48 -4.76 0.003 
Temperature 5.7188 0.5612 -10.19 0.000 

S = R - Sq = 94.5% R - Sq(adj) = 93.6% 


11.4175 


b. 
c. 
d. 
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Explain why a linear model is appropriate for describing the relationship between 
temperature and distance to the nearest fish. 


Write the equation of the least-squares regression line. Define any variables you use. 
Interpret the slope of the regression line. 


Compute the residual for the point (29, 78). Interpret this residual. 


AP3.34 The Candy Shoppe assembles gift boxes that contain 8 chocolate truffles and 2 
handmade caramel nougats. The truffles have a mean weight of 2 ounces with a 
standard deviation of 0.5 ounce, and the nougats have a mean weight of 4 ounces with a 
standard deviation of 1 ounce. The empty boxes have mean weight 3 ounces with a 
standard deviation of 0.2 ounce. 


a. 


Assuming that the weights of the truffles, nougats, and boxes are independent, what 
are the mean and standard deviation of the weight of a box of candy? 


. Assuming that the weights of the truffles, nougats, and boxes are approximately 


Normally distributed, what is the probability that a randomly selected box of candy 
will weigh more than 30 ounces? 


If five gift boxes are randomly selected, what is the probability that at least one of 
them will weigh more than 30 ounces? 


. If five gift boxes are randomly selected, what is the probability that the mean weight 


of the five boxes will be more than 30 ounces? 


AP3.35 An investor is comparing two stocks, A and B. She wants to know if over the long 
run, there is a significant difference in the return on investment as measured by the 
percent increase or decrease in the price of the stock from its date of purchase. The 
investor takes a random sample of 50 annualized daily returns over the past 5 years for 
each stock. The data are summarized in the table. 


Stock Mean return Standard deviation 
A 11.8% 12.9% 
B 7.1% 9.6% 


a. The investor uses the data to perform a two-sample ¢ test of HO: pA-pB=0 
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return for Stock A and »B1526 *0.0tB = the true mean annualized daily 
return for Stock B. The resulting P-value is 0.042. Interpret this value in context. 
What conclusion would you make? 


b. The investor believes that although the return on investment for Stock A usually 
exceeds that of Stock B, Stock A represents a riskier investment, where the risk is 
measured by the price volatility of the stock. The sample variance sx2 
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1526 Sz is a statistical measure of the price volatility and indicates how 


much an investment’s actual performance during a specified period varies from its 
average performance over a longer period. Do the price fluctuations in Stock A 
significantly exceed those of Stock B, as measured by their variances? State an 
appropriate set of hypotheses that the investor is interested in testing. 


c. To measure this, we will construct a test statistic defined as 
305 - 
_ ; ; =.= 0.200 = 20.0% 
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larger sample variance 


smaller sample variance 


Calculate the value of the F statistic using the information given in the table. Explain 
how the value of the statistic provides some evidence for the alternative hypothesis 
you stated in part (b). 


d. Two hundred simulated values of this test statistic, F, were calculated assuming that 
the two stocks have the same variance in daily price. The results of the simulation 
are displayed in the following dotplot. 
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Simulated value of F statistic 


Use these simulated values and the test statistic that you calculated in part (c) to 
determine whether the observed data provide convincing evidence that Stock A is a 
riskier investment than Stock B. Explain your reasoning. 


Inference for 


Distributions of Categorical Data 


Andrey Gudkov/Alamy 


INTRODUCTION 


In Chapter 9, we discussed tests for the proportion of successes in a single population. These 
tests were based on a single categorical variable with values that were divided into two 
categories: success and failure. Sometimes we want to perform a test for the distribution of a 
categorical variable with two or more categories. The chi-square test for goodness of fit allows 
us to determine whether a hypothesized distribution seems valid. This test is useful in a field 
like genetics, where the laws of probability give the expected proportion of outcomes in each 
category. It will also help us decide if the birthdays of NHL players are uniformly distributed 
throughout the year. 


In Chapter 10, we discussed tests for a difference in proportions for two populations or 
treatments. Sometimes we’d like to compare the distribution of a categorical variable for two or 
more populations or treatments, where the variable can have two or more categories. We can 
decide whether the distribution of a categorical variable differs for two or more populations or 
treatments using a chi-square test for homogeneity. This test will help us answer the question: 
Does background music influence customer purchases? 


Tests for homogeneity use data summarized in a two-way table. It is also possible to use the 
information in a two-way table to study the relationship between two categorical variables. The 
chi-square test for independence allows us to determine if there is convincing evidence of an 
association between two variables in a population, such as anger level and heart disease status. 


Here’s an activity that gives you a taste (pun intended) of what lies ahead. 


ACTIVITY The candy man can 


Ramon Rivera Moret 


Mars, Inc., is famous for its milk chocolate candies. Here’s what the company’s Consumer 
Affairs Department says about the distribution of color for M&M’S® Milk Chocolate 
Candies produced at its Hackettstown, New Jersey, factory: 


Brown: 12.5% — Red: 12.5% Yellow: 12.5% 
Green: 12.5% Orange: 25% —_ Blue: 25% 


The purpose of this activity is to investigate if the distribution of color in a large bag of 
M&M’S Milk Chocolate Candies differs from the claimed distribution. 


1. Your teacher will take a random sample of 60 M&M’S Milk Chocolate Candies from a 
large bag and give one or more pieces of candy to each student. As a class, count the 
number of candies of each color. Make a table on the board that summarizes these 
observed counts. 


2. How can you tell if the sample data give convincing evidence to dispute the company’s 
claim? Each team of 3 or 4 students should discuss this question and devise a formula for 
a test statistic that measures the difference between the observed and expected color 
distributions. The test statistic should yield a single number when the observed and 
expected values are plugged in. Also, larger differences between the observed and 
expected distributions should result in a larger value for the statistic. 


3. Each team will share its proposed test statistic with the class. Your teacher will then reveal 
a7 
how the chi-square test statisticy2X is calculated. 


4. Discuss as a class: If your sample is consistent with the company’s claimed distribution of 
M&M’S® Milk Chocolate Candies colors, will the value of 2x” be large or small? If 
your sample is not consistent with the company’s claimed color distribution, will the value 
of y2x" be large or small? 


5. Compute the value of the chi-square test statistic for the class’s data. 


We can use simulation to determine if your class’s chi-square test statistic is large 
enough to provide convincing evidence that the distribution of colors in the large bag is 
different from the company’s claim. To conduct the simulation, 100 random samples of size 
60 were selected from a population of M&M’S Milk Chocolate Candies that matches the 
company’s claim. For each random sample, the value of the y2x” test statistic was calculated 
and plotted on the dotplot. 


6. There is one dot at y2=16X° = 16 Explain what this dot represents. 


7. Where does your class’s value of y2x° fall relative to the other dots on the dotplot? What 
conclusion can you make about the distribution of colors in the large bag? 
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Simulated 7? test statistic 


The dotplot at the end of the activity shows the values of the chi-square test statistic that are 
likely to occur by chance alone when sampling from the company’s claimed M&M’S Milk 
Chocolate Candies color distribution. You may have noticed that the shape of the distribution 
isn’t approximately Normal. Will it always look like this? You will learn more about the 
sampling distribution of the chi-square test statistic shortly. 


SECTION 11.1 Chi-Square Tests for Goodness of Fit 


LEARNING TARGETS By the end of the section, you should be able to: 


e State appropriate hypotheses and compute the expected counts and chi-square 
test statistic for a chi-square test for goodness of fit. 


State and check the Random, 10%, and Large Counts conditions for performing 
a chi-square test for goodness of fit. 


Calculate the degrees of freedom and P-value for a chi-square test for goodness 
of fit. 


Perform a chi-square test for goodness of fit. 


Conduct a follow-up analysis when the results of a chi-square test are 
Statistically significant. 


Jerome’s class did the “Candy man can” activity using a bag from the Hackettstown factory. 
The one-way table summarizes the data from the class’s sample of M&M’S Milk Chocolate 
Candies. 


Color Brown Red Yellow Green Orange Blue Total 
Count 12 3 7 9 9 20 ~=60 
~ 12 
The sample proportion of brown candies is p=1260=0.20" ae ee Because the 
company claims that 12.5% of M&M’S® Milk Chocolate Candies are brown, Jerome might 
believe that something fishy is going on. He could use the one-sample z test for a proportion 
from Chapter 9 to test the hypotheses 


HO: p=0.125 Ho: p = 0.125 


where p is the true proportion of M&M?’S Milk Chocolate Candies in the large bag that are 
brown. He could then perform additional significance tests for each of the remaining colors. 


Note that the correct alternative hypothesis Hatta is two-sided. A sample proportion of brown 


candies much higher or much lower than 0.125 would give Jerome reason to be suspicious about 


the company’s claim. It’s not appropriate to adjust Hata after looking at the sample data! 


Besides being fairly inefficient, this method would also lead to the problem of multiple 
tests, which we discussed in Section 9.3. More important, this approach wouldn’t tell us how 
likely it is to get a random sample of 60 candies with a color distribution that differs as much 


from the one claimed by the company as the class’s sample does, taking all the colors into 
consideration at one time. For that, we use a new kind of significance test, called a chi-square 
test for goodness of fit. 


Stating Hypotheses 


As with any significance test, we begin by stating hypotheses. The null hypothesis in a chi- 
square test for goodness of fit should state a claim about the distribution of a single categorical 
variable in the population of interest. In the “Candy man can” activity, the categorical variable 
we’re measuring is color and the population of interest is all the M&M’S Milk Chocolate 
Candies in the large bag. The appropriate null hypothesis is: 


HO0:4o : The distribution of color in the large bag of M&M’S Milk Chocolate Candies is 
the same as the claimed distribution. 


The alternative hypothesis in a chi-square test for goodness of fit is that the categorical 
variable does not have the specified distribution. For the “Candy man can” activity, our 
alternative hypothesis is 


Ha:/4e * The distribution of color in the large bag of M&M’S Milk Chocolate Candies is 
not the same as the claimed distribution. 


Although we usually write the hypotheses in words, we can also write them in symbols. For 
example, here are the hypotheses for the “Candy man can” activity: 


HO: pbrown=0.125, pred=0.125, pyellow=0.125, pgreen=0.125, 
Ho: Porown = 0-125, Pred = 0-125, Pyetlow = 0-125, Pgreen = 0.125, 


porange=0.25, pblue =0.25 — Potange = 0-29, Phiue = 0.20 


where Pcojor = the true proportion of M&M’S Milk Chocolate Candies in the large bag of that 
color. 


Why don’t we write the alternative hypothesis as “Ha:/0 : At least one of the pi’sPi § is 
incorrect” instead? If the stated proportion in one category is wrong, then the stated proportion 
in at least one other category must be wrong because the sum of the pi’s?i $ must be 1. 

Oo Don’t state the alternative hypothesis in a way that suggests that all the 
proportions in the hypothesized distribution are wrong. For instance, it would be incorrect 
to write 


Ha: pbrown#0.125, pred#0.125, pyellow#0.125, pgreen#0.125 porange#0.25, pblue#0.25 
Ha: Prbrown # 0-125, Pred F 0.125, Pyetow F 0-125, Pgreen # 0.125 


Porange # 0.25, Prine ee 0.25 


Comparing Observed and Expected Counts: The Chi-Square Test 


Statistic 


The idea of the chi-square test for goodness of fit is this: we compare the observed counts from 
our sample with the counts that would be expected if HO is true. (Remember: we always 
assume that HO/0 is true when performing a significance test.) The more the observed counts 
differ from the expected counts, the more evidence we have against the null hypothesis and for 
the alternative hypothesis. 


Recall that Jerome’s class collected data from a random sample of M&M’S® Milk 
Chocolate Candies. How many candies of each color should they expect to find in their sample 
of 60 candies? Assuming that the color distribution stated by Mars, Inc., is true, 12.5% of the 
candies are brown. For random samples of 60 candies, the average number of brown candies 
should be (60)(0.125)=7.5(69) (0.125) = 7.5 This is our expected count of bbown M&M’S Milk 
Chocolate Candies. Using this same method, we find the expected counts for the other color 
categories: 


Red: (60)(0.125)=7.5 Yellow: (60)(0.125)=7.5Green: (60)(0.125)=7.5 Orange: (60) 
Red : (60) (0.125) = 7.5 


Yellow : (60) (0.125) = 7.5 
Green: (60) (0.125) = 7.5 
Orange : (60) (0.25) = 7.5 
(0.25)=7.5 Blue: (60)(0.25)=7.5 Blue: (60) (0.25) = 7.5 


ite 


CALCULATING EXPECTED COUNTS IN A CHI-SQUARE TEST FOR 
GOODNESS OF FIT 


Ramon Rivera Moret 


The expected count for category i in the distribution of a categorical variable is 


npi”’Pi 


where pi? is the relative frequency for category i specified by the null hypothesis. 


Did you notice that the expected count sounds a lot like the expected value of a random 
variable from Chapter 6? That’s no coincidence. The number of M&M’S Milk Chocolate 
Candies of a specific color in a random sample of 60 candies is a binomial random variable. Its 
expected value is np, the average number of candies of this color in many samples of 60 


M&M’S Milk Chocolate Candies. The expected count is not likely to be a whole number and 
shouldn’t be rounded to a whole number. 


To see if the data give convincing evidence for the alternative hypothesis, we compare the 
observed counts from our sample with the expected counts. If the observed counts are far from 
the expected counts, that’s the evidence we were seeking. The table gives the observed and 


expected counts for the sample of 60 candies from Jerome’s class. Figure 11.1 shows these 
counts as a side-by-side bar graph. 


Color Observed Expected 
Brown 12 fe 
Red 3 7.5 
Yellow 7 i 
Green 9 to 
Orange 9 15.0 
Blue 20 15.0 


C] Observed 
fz Expected 


Brown Red Yellow Green Orange Blue 
Color 


FIGURE 11.1 Bar graph comparing observed and expected counts for Jerome’s class sample of 
60 M&M'S® Milk Chocolate Candies. 


We see some fairly large differences between the observed and expected counts in several 
color categories. How likely is it that differences this large or larger would occur just by chance 
in random samples of size 60 from the population distribution claimed by Mars, Inc.? To 
answer this question, we calculate a statistic that measures how far apart the observed and 
expected counts are. The statistic we use to make the comparison is the chi-square test 
statistic 2x” (The symbol yx is the lowercase Greek letter chi, pronounced “kye” like “rye.”) 


AP® EXAM TIP 


The formula for the chi-square test statistic is included on the formula sheet that is 
provided on the AP® Statistics exam. However, it doesn’t include the word count: 


¥ (Observed—Expected 7 
Expected 


on 
X2=)> (Observed-Expected)2Expected* - 


We included the word count to emphasize that you must use the observed and 
expected counts—not the observed and expected proportions—when calculating 


the chi-square test statistic. 


DEFINITION Chi-square test statistic 


The chi-square test statistic is a measure of how far the observed counts are 
from the expected counts. The formula for the statistic is 


x2=); (Observed count-Expected count)2Expected count 
(Observed count —Expected count)” 


) yy | 
= 5 Expected count 


where the sum is over all possible values of the categorical variable. 


For Jerome’s data, we add six terms—one for each color category: 
X2=(12-7.5)27.5+(3-7.5)27.5+(7-7.5)27.5+(9-7.5)27.5+(9-15)215+ 
(20-15)215=2.7+2.7+0.03+0.30+2.4+1.67=9.8 
(12—7.5)2  (3-7.5)2 (7—7.5)” (9—-7.5)2 (9-15) (20-15)? 
an 7: 7.5 76° #76 8 4 
= 2.7+2.7-+ 0.03 + 0.30 + 2.4 + 1.67 = 9.8 


| 


Here’s an example to help you practice what you have learned so far. 


EXAMPLE |A fair die @& 
Hypotheses, expected counts, and the chi-square statistic 


PROBLEM: Carrie made a 6-sided die in her ceramics class and rolled it 90 times to test if 
each side was equally likely to show up. The table summarizes the outcomes of her 90 rolls. 


a. State the hypotheses that Carrie should test. 
b. Calculate the expected count for each of the possible outcomes. 


c. Calculate the value of the chi-square test statistic. 


Outcome of roll 1 2 3 4 5) 6 Total 
Frequency 12 28 12 13 10 15 90 
SOLUTION: 


a. Holo : The sides of Carrie’s die are equally likely to show up. 


Ha:Ha : The sides of Carrie’s die are not equally likely to show up. 
b. as Hoo is true, each of the 6 sides should show up 1/6 of the time. The expected count 


is 90(1/6) = 1599 (1/6) = 15 for each side. 


Outcome of roll i 2 B 4 5 6 Total 
Expected count 15 15 15 15 15 15 90 


You can also state the null hypothesis as: 
Ho: Carrie’s die is fair. 

or 
Ho: The distribution of outcome for Carrie’s 


die is uniform. 
or 


Ho: P1 = P2 = P3 = P4 = Ps = Pg = 1/6 


G 32202 1) IOS — ig a6 Ge = NIGMS = Is oiGe = IS pigedag = 
fis) bails =0,6+11.27+0.6+0.27+1.67+0 =14.41 
9 _ (12-15)? (28-15)? (12-15)? (13-15)? (10-15)? (15-15)? 
io 15 15 s © & © 15 
= 0.64 11.274+ 0.6+ 0.27+ 1.67+0 
= 1441 


X2=> (Observed count-Expected count)2Expected count 
9 ~ (Observed count—Expected count)* 
C= 


Expected count 


FOR PRACTICE, TRY EXERCISE 1 


Think About It 


WHY DO WE DIVIDE BY THE EXPECTED COUNT WHEN CALCULATING THE 
CHI-SQUARE TEST STATISTIC? In Jerome’s class sample, they got 4.5 more browns 
than expected (12-7.5)(12 ~ 7-5) and 5 more blues than expected (20-15)(20 — 15) Which 
of these is more surprising? 


Ramon Rivera Moret 


In both cases, the number of M&M’S® Milk Chocolate Candies in the sample exceeds 
the expected count by about the same amount. But it’s much more surprising to be off by 4.5 
out of an expected 7.5 brown candies (a 60% discrepancy) than to be off by 5 out of an 
expected 15 blue candies (a 33% discrepancy). For that reason, we want the category with a 
larger relative difference to contribute more heavily to the evidence against HO/0 and in 
favor of Ha/fe measured by the y2x” test statistic. 

If we just computed (Observed Count—Expected Count)2 


2 
(ObservedCount — ExpectedCount)” for each category instead, the contributions of these 


two color categories would be about the same (with the contribution from blue being slightly 
larger): 


Brown: (12-7.5)2=20.25 Blue: (20-15)2=25 
Brown : (12 — 7.5)” = 20.25 Blue : (20 — 15)? = 25 


By using (Observed count-Expected count)2/Expected count 


5) 
(Observed count — Expected count)* /Expected count we guarantee that the color category 


with the larger relative difference will contribute more heavily to the total: 


Brown: (12-7.5)27.5=2.7Blue: (20-15)215=1.67 
(12—7.5)2 (20-15)? 


B ; —————-= 2.7 Blue: ————= 1. 
rown 75 2.7 ue 5 67 


CHECK YOUR UNDERSTANDING 


i. 


J Mars, Inc., reports that the M&M’S® Peanut Chocolate Candies from their Cleveland 
factory have the following distribution of color: 20% blue, 20% orange, 20% green, 20% 
yellow, 10% red, and 10% brown. Joey bought a large bag of them and selected a random 
sample of 65 candies. He found 14 blue, 9 orange, 15 green, 14 yellow, 5 red, and 8 brown. 


1. State appropriate hypotheses for testing the company’s claim about the color distribution 
of M&M’S Peanut Chocolate Candies in Joey’s large bag. 


2. Calculate the expected count for each color. 


3. Calculate the chi-square test statistic for Joey’s sample. 


The Chi-Square Distributions and P-Values 


Think of y2X" as a measure of the distance the observed counts are from the expected counts. 
Like any distance, it is always zero or positive, and it is zero only when the observed counts are 
exactly equal to the expected counts. Large values of y2x” are stronger evidence for Ha/a 
because they say that the observed counts are far from what we would expect if HO0 were 
true. Small values of y2x” suggest that the data are consistent with the null hypothesis. Is the 
value from Jerome’s class, y2=9.8X = 9.8 | a large value? You know the drill: compare the 
observed value 9.8 against the sampling distribution that shows how 2x” would vary in 
repeated random sampling if the null hypothesis were true. 

We used software to simulate taking 1000 random samples of size 60 from the population 
distribution of M&M’S Milk Chocolate Candies given by Mars, Inc. Figure 11.2 shows a 
dotplot of the values of the chi-square test statistic for these 1000 samples. 

In 87 of the 1000 simulated 
samples, the value of the 


chi-square test statistic was 
at least 9.8—the observed 


test statistic from Jerome’s 
| class. 
4 | in. ee ee ° 


Simulated 7” test statistic 


FIGURE 11.2 Dotplot showing values of the chi-square test statistic in 1000 simulated samples 
of size n=60n = 60 from the population distribution of M&M’S Milk Chocolate Candies stated by 
the company. 


Recall that larger values of 2x” give more convincing evidence against HO and in favor 
of Ha/a. According to the dotplot, 87 of the 1000 simulated samples resulted in a chi-square 
test statistic of 9.8 or higher. Our estimated P-value is 87/ 1000=0.08787/ 1000 = 0.087 Because 
the P-value exceeds the default a=0.05e = 0.05 significance level, we fail to reject HO#0. We 
do not have convincing evidence that the color distribution in Jerome’s bag is different from 
the distribution claimed by the company. 

As Figure 11.2 suggests, the sampling distribution of the chi-square test statistic is not a 
Normal distribution. It is a right-skewed distribution that allows only nonnegative values 


because 2x” can never be negative. The sampling distribution of y2x” differs depending on 
the number of possible values for the categorical variable (i.e., on the number of categories). 


When the expected counts are all at least 5, the sampling distribution of the yx” test 
Statistic is modeled well by a chi-square distribution with degrees of freedom (df) equal to the 
number of categories minus 1. As with the ¢t distributions, there is a different chi-square 
distribution for each possible value of df. 


DEFINITION Chi-square distribution 


A chi-square distribution is defined by a density curve that takes only 
nonnegative values and is skewed to the right. A particular chi-square distribution 
is specified by its degrees of freedom. 


Figure 11.3 shows the density curves for three members of the chi-square family of 
distributions. As the degrees of freedom (df) increase, the density curves become less skewed, 
and larger values become more probable. 


Chi-square distribution 
with df = 1 


Chi-square distribution 


pe with df =4 


Chi-square distribution 
with df =8 


ic) 
0 2 4 6 8 10 12 14 16 


FIGURE 11.3 The density curves for three members of the chi-square family of distributions. 
Here are two other interesting facts about the chi-square distributions: 
e The mean of a particular chi-square distribution is equal to its degrees of freedom. 


e For df>2df > 2 the mode (peak) of the chi-square density curve is at df-24f — 2 


For example, when df=8df = 8 the chi-square distribution has a mean of 8 and a mode of 6. 


To get P-values from a chi-square distribution, we can use technology or Table C in the 


back of the book. For Jerome’s class data, y2=9.8X" = 9.8 Because all the expected counts are 
at least 5, the y2x” test statistic will be modeled well by a chi-square distribution when HOM 
is true. There are 6 color categories for M&M’S® Milk Chocolate Candies, so df=6-1=5 
df=6-1=5. 

The P-value is the probability of getting a value of yx” as large as or larger than 9.8 when 
HO is true. Figure 11.4 shows this probability as an area under the chi-square density curve 
with 5 degrees of freedom. 


Chi-square 
distribution 
with df =5 


=98 
FIGURE 11.4 The P-value for a chi-square test for goodness of fit using Jerome’s M&M’S Milk 
Chocolate Candies class data. 


To find the P-value using Table C, look in the df=5df = 5 row. The value y2=9.8X" = 9.8 
falls between the critical values 9.24 and 11.07. The corresponding areas in the right tail of the 
chi-square distribution with 5 degrees of freedom are 0.10 and 0.05. So the P-value for a test 
based on Jerome’s data is between 0.05 and 0.10. 


Now let’s look at how to find the P-value with your calculator. 


Pp 

df 15 .10 .05 
4 6.74 7.78 9.49 
8.12 9.24 £107 

6 9.45 10.64 17.59 


27. Technology Corner | FINDING P-VALUES FOR CHI-SQUARE 
TESTS @& 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 
To find the P-value in the M&M’S® Milk Chocolate Candies example with your calculator, 


use the y2x” cdf command. We ask for the area between y2=9.8X" = 9.8 and a very large 
number (we’Il use 10,000) under the chi-square density curve with 5 degrees of freedom. 


x*cdf(9.8,10006,.5) 
@.0811046958 


Press (DISTR) and choose yox” cdf(. 
OS 2.55 or later: In the dialog box, enter these values: lower:9.8, upper:10000, df:5, 


choose Paste, and then press |ENTER|. 


Older OS: Complete the command y2x° cdf(9.8,10000,5) and press [ENTER|. 


As the calculator screen shot shows, this method gives a more precise P-value than Table C. 


' 

Table C gives us an interval in which the P-value falls. The calculator’s y2X cdf command 
gives a result that is consistent with Table C but more precise. For that reason, we recommend 
using your calculator to compute P-values from a chi-square distribution. 


Based on Jerome’s sample, what conclusion should we draw about H0:/40 : The 
distribution of color in the large bag of M&M’S Milk Chocolate Candies is the same as the 
claimed distribution? Because our P-value of 0.081 is greater than a=0.05a = 0.05 we fail to 
reject HOM0. We don’t have convincing evidence that the distribution of color in the large bag 
of M&M’S Milk Chocolate Candies differs from the claimed distribution. This is consistent 
with the results of the simulation from the activity. 

o@ Failing to reject Hg does not mean that the null hypothesis is true! That is, we can’t 


conclude that the bag has the color distribution claimed by Mars, Inc. All we can say is that the 
sample data did not provide convincing evidence to reject HO/0. 


EXAMPLE | Return of the fair die (») 
Finding a P-value 


PROBLEM: Carrie made a 6-sided die in her ceramics class and rolled it 90 times to test if 
each side was equally likely to show up. The table summarizes the observed and expected 
counts. 


Outcome of roll 1 2 3 4 5 6 Total 
Observed count 12 28 12 13 10 15 90 
Expected count 15 15 15 15 15 15 90 


In the preceding example, we calculated y2=14.41X° = 14.41 Find the P-value using Table 
C. Then calculate a more precise value using technology. 

SOLUTION: 

df=6-1=sif =6-1=5 

Using Table C: the P-value is between 0.01 and 0.02. 


Using technology: x2 cdf(lower:14.41, wpper:10000, df:5)=0.0132 
x” cdf(lower:14.41, upper:10000, df:5) = 0.0132. 


df 5 # of categories -1 


Chi-square distribution 
with df =s5 


=H 


FOR PRACTICE, TRY EXERCISE 3 


Because the P-value is less than a=0.05a = 0.05 we should reject HO/#0. There is 
convincing evidence that Carrie’s die is unfair. 


CHECK YOUR UNDERSTANDING 


\ 


y Let’s continue our analysis of Joey’s sample of M&M’S® Peanut Chocolate Candies 
from the preceding Check Your Understanding (page 714). 


1. Confirm that the expected counts are large enough to use a chi-square distribution to 
calculate the P-value. Which degrees of freedom should you use? 


2. Use Table C to find the P-value. Then use your calculator’s y2x” cdf command. 


3. What conclusion would you draw about the company’s claimed color distribution for 
M&M’S Peanut Chocolate Candies? 


Carrying Out a Test 


Like our test for a population proportion, the chi-square test for goodness of fit uses an 
approximation that becomes more accurate as we take larger samples. The Large Counts 
condition says that all expected counts must be at least 5. Before performing a test, we must 
also check that the Random and 10% conditions are met. 


CONDITIONS FOR PERFORMING A CHI-SQUARE TEST FOR 
INDEPENDENCE 


e Random: The data come from a random sample from the population of interest. 


o 10%: When sampling without replacement, n<0.10 Nn < 0.10 N 


| e Large Counts: All expected counts are at least 5. | 


AP® EXAM TIP 


When checking the Large Counts condition, be sure to examine the expected 
counts, not the observed counts. And make sure to write and label the expected 
counts on your paper or you won't receive credit. 


When we want to compare the distribution of a categorical variable in one population to a 
claimed distribution, we use a chi-square test for goodness of fit. 


THE CHI-SQUARE TEST FOR GOODNESS OF FIT 


Suppose the conditions are met. To perform a test of 


H0:40 : The stated distribution of a categorical variable in the population of interest is 
correct 


compute the chi-square test statistic: 


x2=)' (Observed count — Expected count)2Expected count 


— (Observed count — Expected count)? 
= Expected count 


where the sum is over the k different categories. The P-value is the area to the right of y2x” 
under the chi-square density curve with k-1* — | degrees of freedom. 


The next example shows the chi-square test for goodness of fit in action. As always, we 
follow the four-step process when performing inference. 


EXAMPLE | Birthdays in hockey (>) 
A test for equal proportions 


Canva Pty Ltd/Alamy 


PROBLEM: In his book Outliers, Malcolm Gladwell suggests that a hockey player’s birth 
month has a big influence on his chance to make it to the highest levels of the game. 
Specifically, because January 1 is the cut-off date for youth leagues in Canada [where many 
National Hockey League (NHL) players come from], players born in January will be 
competing against players up to 12 months younger. The older players tend to be bigger, 
stronger, and more coordinated and hence get more playing time, more coaching, and have a 
better chance of being successful. 

To see if birth date is related to success (judged by whether a player makes it into the 
NHL), a random sample of 80 NHL players from a recent season was selected and their 
birthdays were recorded. The one-way table summarizes the data on birthdays for these 80 
players. 


Birthday Jan—Mar Apr—Jun Jul-Sep Oct—Dec 
Number of players 32 20 16 42 


Do these data provide convincing evidence that the birthdays of NHL players are not 
uniformly distributed across the four quarters of the year? 


SOLUTION: 


STATE: 


Ho:Ho : The birthdays of all NHL players are uniformly distributed across the four 
y jes) 4) 
quarters of the year. 


Haze : The birthdays of all NHL players are not uniformly distributed across the four 
iy Jay uy 
quarters of the year. 


Follow the four-step process! 


We could write the hypotheses in symbols as 
HO:pJan—Mar=pApr—Jun=pJul-Sep=pOct-—Dec=1/4 
Hp > PJan—Mar = PApr—Jun >= PJul—Sep = POct—Dec = 1/4 


| H,;: At least two of the proportions are not 1/4 | 


We'll use « = 0.05. = 0.05. 
PLAN: Chi-square test ‘for goodness of fit 


e Random: The data came from a random sample of NHL players. / 


© 10%: We must assume that 80 is less than 10% of all NHL players. / 
counts = 80(1/4) =20>5 y 


e Large Counts: All expected counts = 80(1/4) = 2025 


There were 879 NHL players in the population from which we selected our sample. 


There is some evidence in favor of Ha/Z« because the observed counts differ from the 


expected counts. 


DO: 
e Test statistic: 


y2=(32 — 20)220+(20 — 20)220+(16 — 20)220+(12 — 20)220=7.24+0+0.843.2 = 11.2 
(32 —20)2 (20-20)? (16-20)? (12-20)? 
— 9 2 20 20 
= 7.2+04+0.84+3.2 = 11.2 
* P-value: df=4-1=34f =4-1=3 
Using Table C: The P-value is between 0.01 and 0.02. 


2 


Using technology: y2cdf(lower:11.2, upper:10000, df:3)=0.011 
x’ cdf(lower : 11.2, upper : 10000, df: 3) = 0.011 


CONCLUDE: Because the P-value of 0.011<a = 0.05,9-011 <a=0.05, we reject Hoo. 
We have convincing evidence that the birthdays of NHL players are not uniformly 
distributed across the four quarters of the year. 


Chi-square distribution 
with df =3 


0 10} 20 


FOR PRACTICE, TRY EXERCISE 9 


You can use your calculator to carry out the “Do” step for a chi-square test for goodness of 
fit. Remember that using your calculator comes with potential benefits and risks on the AP® 


Statistics exam. 


28. Technology Corner |PERFORMING A CHI-SQUARE TEST FOR 


GOODNESS OF FI (>) 


TI-Nspire and other technology instructions are on the book’s website at 


highschool. bfwpub.com/tps6e. 


You can use the TI-84 to perform the calculations for a chi-square test for goodness of fit. 
We’ll use the data from the hockey and birthdays example to illustrate the steps. 


Birthday 
Jan—Mar 
Apr-Jun 
Jul-Sep 
Oct—Dec 


Observed 
32 
20 
16 
12 


Expected 
20 
20 
20 
20 


1. Enter the observed counts in L1 and the expected counts in L2. 


2. Press |STAT|, arrow over to TESTS, and choose yx GOF-Test... . 


Note: TI-83s and some older TI-84s don’t have this test. TI-84 users can get this 
functionality by upgrading their operating systems. 


3. Enter the inputs shown. If you choose Calculate, you’ ll get a screen with the test statistic, 
P-value, and df. If you choose the Draw option, you’!! get a picture of the appropriate chi- 
square distribution with the test statistic marked and shaded area corresponding to the P- 


value. 


Observed:Li 
Expected:L2 
df:3 


eae BLUE 


Calculate Draw 


ORMAL FLOAT AUTO REAL RADIAN CL fl JORMAL FLOAT AUTO REAL RADIAN CL 


X2GOF-Test 
X2=11.2 
p=.0106921291 


df= 
CNTRB={7.2 @ .8 3.2} 


We’ll discuss the CNTRB results shortly. 


AP® EXAM TIP 


ORMAL FLOAT AUTO REAL RADIAN CL i 


You can use your calculator to carry out the mechanics of a significance test on the 
AP® Statistics exam. But there’s a risk involved. If you just give the calculator 
answer with no work, and one or more of your values is incorrect, you will likely get 


no credit for the “Do” step. We recommend writing out the first few terms of the chi- 
square calculation followed by “...”. This approach might help you earn partial credit 
if you enter a number incorrectly. Be sure to name the procedure (chi-square test 
for goodness of fit) and to report the test statistic (x2=11.2) (x’ = 11.2) degrees of 
freedom (df=3) (at = 3) and P-value (0.011). 


FOLLOW-UP ANALYSIS In the chi-square test for goodness of fit, we test the null 
hypothesis that a categorical variable has a specified distribution in the population of interest. If 
the sample data lead to a statistically significant result, we can conclude that our variable has a 
distribution different from the one stated. To investigate how the distribution is different, start 
by identifying the categories that contribute the most to the chi-square statistic. Then describe 
how the observed and expected counts differ in those categories, noting the direction of the 
difference. 


belchonock /Depositphoto 


Let’s return to the hockey and birthdays example. The table of observed and expected 
counts for the 80 randomly selected NHL players is repeated on the following page. The last 
column shows the contributions (also called components) of the chi-square test statistic. The 
two biggest contributions to the chi-square statistic came from Jan—Mar and Oct—Dec. In 
January through March, 12 more players were born than expected. In October through 
December, 8 fewer players were born than expected. These results support Malcolm Gladwell’s 
claim that NHL players are more likely to be born early in the year. 


Birthday Observed Expected O-E (O - E)7/E 
Jan—Mar eZ 20 12 7.2 
Apr—Jun 20 20 0 0.0 
Jul-Sep 16 20 -4 0.8 
Oct-Dec 12 20 =3 Se 


Note: When we ran the chi-square test for goodness of fit on the calculator, a list of these 
individual components was produced and stored in the list menu. On the TI-84, the list is called 
CNTRB (for contribution). 


CHECK YOUR UNDERSTANDING 


. 


y Does the warm, sunny weather in Arizona affect a driver’s choice of car color? Cass 
thinks that Arizona drivers might opt for a lighter color with the hope that it will reflect some 
of the heat from the sun. To see if the distribution of car colors in Oro Valley, near Tucson, 
is different from the distribution of car colors across North America, she selected a random 
sample of 300 cars in Oro Valley. The table shows the distribution of car color for Cass’s 
sample in Oro Valley and the distribution of car color in North America, according to 


www.ppg.com.4 


Color White Black Gray’ Silver Red Blue Green Other Total 


Oro Valley 84 38 31 46 20 29 6 39 300 
sample 
North America 23% 18% 16% 15% 10% 9% 2% 7% 100% 


1. Do these data provide convincing evidence that the distribution of car color in Oro Valley 
differs from the North American distribution? 


2. If there is convincing evidence of a difference in the distribution of car color, perform a 
follow-up analysis. 


Section 11.1 Summary 


e The chi-square test for goodness of fit tests the null hypothesis that a categorical variable 
has a specified distribution in the population of interest. The alternative hypothesis is that the 
variable does not have the specified distribution in the population of interest. 


e This test compares the observed count in each category with the counts that would be 
expected if HO“0 were true. The expected count for any category is found by multiplying 
the sample size by the proportion in each category according to the null hypothesis. 


e The chi-square test statistic is 


Xx2=); (Observed count — Expected count)2Expected count 


" (Observed count — Expected count) 2 


Expected count 
where the sum is over all possible categories. 
e The conditions for performing a chi-square test for goodness of fit are: 
= Random: The data come from a random sample from the population of interest. 


© 10%: When sampling without replacement, n<0.10 N.n < 0.10 .N. 


« Large Counts: All expected counts are at least 5. 


2 — 
e When the conditions are met, the sampling distribution of the y2* test statistic can be 
modeled by a chi-square distribution. 


e Large values of 2x” are evidence against HO/0 and in favor of Ha/fc. The P-value is the 


area to the right of y2X under the chi-square density curve with degrees of freedom 
df=number of categories—14f = number of categories —1 


e If the test finds a statistically significant result, consider doing a follow-up analysis that 
looks for the largest contributions to the chi-square test statistic and compares the observed 
and expected counts. 


11.1 Technology Corners | @ 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 


27. Finding P-values for chi-square tests Page 716 
28. Performing a chi-square test for goodness of fit Page 719 


Section 11.1 Exercises 


1. pg 712 |») Aw, nuts! A company claims that each batch of its deluxe mixed nuts 


contains 52% cashews, 27% almonds, 13% macadamia nuts, and 8% Brazil nuts. To test 
this claim, a quality-control inspector takes a random sample of 150 nuts from the latest 
batch. The table displays the sample data. 


Type of nut Cashew Almond Macadamia Brazil 
Count 83 29 20 18 


a. State appropriate hypotheses for performing a test of the company’s claim. 
b. Calculate the expected count for each type of nut. 


c. Calculate the value of the chi-square test statistic. 

2. Roulette Casinos are required to verify that their games operate as advertised. American 
roulette wheels have 38 slots—18 red, 18 black, and 2 green. In one casino, managers 
record data from a random sample of 200 spins of one of their American roulette wheels. 
The table displays the results. 


Color Red Black Green 
Count 85 99 16 


8. 


a. State appropriate hypotheses for testing whether these data give convincing evidence 
that the distribution of outcomes on this wheel is not what it should be. 


b. Calculate the expected count for each color. 
c. Calculate the value of the chi-square test statistic. 
pg 716 |») P-values For each of the following, find the P-value using Table C. Then 


calculate a more precise value using technology. 
a. x2=19.03, df=11X° = 19.03, df= 11 


b. x2=19.03, df=3x" = 19.03, df = 3 
More P-values For each of the following, find the P-value using Table C. Then calculate a 
more precise value using technology. 


a. y2=4.49, df=5x° = 4.49, df = 5 


b. x2=4.49, df=1x" = 4.49, df = 1 
Aw, nuts! Refer to Exercise 1. 


a. Confirm that the expected counts are large enough to use a chi-square distribution to 
calculate the P-value. What degrees of freedom should you use? 


b. Use Table C to find the P-value. Then use your calculator’s y2x” cdf command. 


c. What conclusion would you draw about the company’s claimed distribution for its 
deluxe mixed nuts? 
Roulette Refer to Exercise 2. 


a. Confirm that the expected counts are large enough to use a chi-square distribution to 
calculate the P-value. What degrees of freedom should you use? 


b. Use Table C to find the P-value. Then use your calculator’s y2x” cdf command. 


c. What conclusion would you draw about whether or not the roulette wheel is operating 
correctly? 

No chi-square A school’s principal wants to know if students spend about the same 

amount of time on homework each night of the week. She asks a random sample of 50 

students to keep track of their homework time for a week. The following table displays the 

average amount of time (in minutes) students reported per night. 


Night Sunday Monday Tuesday Wednesday Thursday Friday Saturday 
Average time 130 108 115 104 99 37 62 


Explain carefully why it would not be appropriate to perform a chi-square test for goodness 
of fit using these data. 

No chi-square The principal in Exercise 7 also asked the random sample of students to 
record whether they did all of the homework that was assigned on each of the five school 
days that week. Here are the data: 


School day Monday Tuesday Wednesday’ Thursday _ Friday 


Number who did all homework 34 29 32 28 19 


Explain carefully why it would not be appropriate to perform a chi-square test for goodness 
of fit using these data. 
9. pg 718 |») Munching Froot Loops Kellogg’s Froot Loops cereal comes in six colors: 


orange, yellow, purple, red, blue, and green. Charise randomly selected 120 loops and 
noted the color of each. Here are her data: 


Color Orange Yellow Purple Red Blue Green 
Count 28 21 16 25 14 16 


Do these data provide convincing evidence at the 5% significance level that Kellogg’s 
Froot Loops do not contain an equal proportion of each color? 

10. What’s your sign? The University of Chicago’s General Social Survey (GSS) is the 
nation’s most important social science sample survey. For reasons known only to social 
scientists, the GSS regularly asks a random sample of people their astrological sign. Here 
are the counts of responses from a recent GSS of 4344 people: 


Sign Aries Taurus Gemini Cancer Leo Virgo 
Count o21 360 367 374 383 402 
Sign Libra Scorpio Sagittarius Capricorn Aquarius Pisces 
Count 392 329 331 354 376 355 


If births are spread uniformly across the year, we expect all 12 signs to be equally likely. 
Do these data provide convincing evidence at the 1% significance level that all 12 signs are 
not equally likely? 

11. Fruit flies Biologists wish to mate pairs of fruit flies having genetic makeup RrCc, 
indicating that each has one dominant gene (R) and one recessive gene (r) for eye color, 
along with one dominant (C) and one recessive (c) gene for wing type. Each offspring will 
receive one gene for each of the two traits from each parent, so the biologists predict that 
the following phenotypes should occur in a ratio of 9:3:3:1: 


Phenotype Red eyes and Red eyes and White eyes and White eyes and 
straight wings curly wings straight wings curly wings 
Frequency 99 42 49 10 


Assume that the conditions for inference are met. Carry out a test at the a=0.05a = 0.05 
significance level of the proposed genetic model. 

12. You say tomato The paper “Linkage Studies of the Tomato” (Transactions of the 
Canadian Institute, 1931) reported the following data on phenotypes resulting from 
crossing tall cut-leaf tomatoes with dwarf potato-leaf tomatoes. We wish to investigate 
whether the following frequencies are consistent with genetic laws, which state that the 
phenotypes should occur in the ratio 9:3:3:1. 


Phenotype Tall cut Tall potato Dwarf cut Dwarf potato 


Frequency 926 288 293 104 


Assume that the conditions for inference are met. Carry out a test at the a=0.05a = 0.05 
significance level of the proposed genetic model. 

13. Birds in the trees Researchers studied the behavior of birds that were searching for seeds 
and insects in an Oregon forest. In this forest, 54% of the trees are Douglas firs, 40% are 
ponderosa pines, and 6% are other types of trees. At a randomly selected time during the 
day, the researchers observed 156 red-breasted nuthatches: 70 were seen in Douglas firs, 79 


in ponderosa pines, and 7 in other types of trees.? 


a. Do these data provide convincing evidence that nuthatches prefer particular types of 
trees when they’re searching for seeds and insects? 


b. Relative to the proportion of each tree type in the forest, which type of trees do the 
nuthatches seem to prefer the most? The least? 

14. Seagulls by the seashore Do seagulls show a preference for where they land? To answer 
this question, biologists conducted a study in an enclosed outdoor space with a piece of 
shore whose area was made up of 56% sand, 29% mud, and 15% rocks. The biologists 
chose 200 seagulls at random. Each seagull was released into the outdoor space on its own 
and observed until it landed somewhere on the piece of shore. In all, 128 seagulls landed on 
the sand, 61 landed in the mud, and 11 landed on the rocks. 


a. Do these data provide convincing evidence that seagulls show a preference for where 
they land? 


b. Relative to the proportion of each ground type on the shore, which type of ground do the 
seagulls seem to prefer the most? The least? 

15. Mendel and the peas Gregor Mendel (1822-1884), an Austrian monk, is considered the 
father of genetics. Mendel studied the inheritance of various traits in pea plants. One such 
trait is whether the pea is smooth or wrinkled. Mendel predicted a ratio of 3 smooth peas 
for every 1 wrinkled pea. In one experiment, he observed 423 smooth and 133 wrinkled 
peas. Assume that the conditions for inference are met. 


a. Carry out a chi-square test for goodness of fit for the genetic model that Mendel 
predicted. 


b. In Chapter 9, Exercise 49, you tested Mendel’s prediction using a one-sample z test for a 
proportion. The hypotheses were HO: p=0.75/40 : P = 9.75 and Ha: p20.75 a ‘pF 0.75 
where p = true proportion of smooth peas. Calculate the z statistic and P-value for this 
test. How do these values compare to the values from part (a)? 

16. Spinning heads? When a fair coin is flipped, we all know that the probability the coin 
lands on heads is 0.50. However, what if a coin is spun? According to the article “Euro 
Coin Accused of Unfair Flipping” in the New Scientist, two Polish math professors and 
their students spun a Belgian euro coin 250 times. It landed heads 140 times. One of the 
professors concluded that the coin was minted asymmetrically. A representative from the 
Belgian mint indicated the result was just chance. Assume that the conditions for inference 
are met. 


a. Carry out a chi-square test for goodness of fit to test if heads and tails are equally likely 


when a euro coin is spun. 


In Chapter 9, Exercise 50, you analyzed these data with a one-sample z test for a 
proportion. The hypotheses were HO: p=0.5/40 : Pp = 9.5 and Ha: p#0.5/4a ‘pF 0.5 
where p = the true proportion of heads. Calculate the z statistic and P-value for this test. 
How do these values compare to the values from part (a)? 


17. Skittles® Statistics teacher Jason Molesky contacted Mars, Inc., to ask about the color 
distribution for Skittles candies. Here is an excerpt from the response he received: “The 
original flavor blend for the Skittles Bite Size Candies is lemon, green apple, orange, 
strawberry and grape. They were chosen as a result of consumer preference tests we 
conducted. The flavor blend is 20 percent of each flavor.” 


a. State appropriate hypotheses for a significance test of the company’s claim. 


Find the expected counts for a random sample of 60 candies. 


How large a yx” test statistic would you need to have significant evidence against the 
company’s claim at the a=0.050 = 0.05 level? At the a=0.01a = 0.01 level? 


Create a set of observed counts for a random sample of 60 candies that gives a P-value 
between 0.01 and 0.05. Show the calculation of your chi-square test statistic. 


18. Is your random number generator working? Use your calculator’s RandInt function to 
generate 200 digits from 0 to 9 and store them in a list. 


a. 


State appropriate hypotheses for a chi-square test for goodness of fit to determine 
whether your calculator’s random number generator gives each digit an equal chance of 
being generated. 


Carry out a test at the a=0.05e = 0.05 significance level. Hint: To obtain the observed 
counts, make a histogram of the list containing the 200 random digits, and use the trace 
feature to see how many of each digit were generated. You may have to adjust your 
window to go from —0.5~9-9 to 9.5 with an increment of 1. 


Assuming that a student’s calculator is working properly, what is the probability that the 
student will make a Type I error in part (b)? 


Suppose that 25 students in an AP® Statistics class independently do this exercise for 
homework and that all of their calculators are working properly. Find the probability that 
at least one of them makes a Type I error. 


Multiple Choice: Select the best answer for Exercises 19-22. 


Exercises 19-21 refer to the following setting. The manager of a high school cafeteria is 
planning to offer several new types of food for student lunches in the new school year. She 
wants to know if each type of food will be equally popular so she can start ordering 
supplies and making other plans. To find out, she selects a random sample of 100 students 
and asks them, “Which type of food do you prefer: Ramen, tacos, pizza, or hamburgers?” 
Here are her data: 


19. 


20. 


z1, 


Type of food Ramen Tacos Pizza Hamburgers 
Count 18 22 39 21 


An appropriate null hypothesis to test whether the food choices are equally popular is 


a. HO: p=254o > = 25 where pi = the mean number of students that prefer each type of 
food. 


b. HO: p=0.2540 : p = 9.25 where p = the proportion of all students who prefer ramen. 


c. HO: nR=nT=nP=nH=25/0 : "rx = nr = np = NH = 25 where nR”R is the number of 
students in the school who would choose ramen, and so on. 

d. HO: pR=pT=pP=pH=0.25/40 : Pr = pr = pp = Pu = 9.25 where pRPR is the proportion 
of students in the school who would choose ramen, and so on. 

e. HO: p\R=pT=p\P=p\H=0.25/20: Pr=Pr=Pp=Pu = 9.25 where pARPR is the 
proportion of students in the sample who chose ramen, and so on. 

The chi-square test statistic is 


a. (18-25)225+(22—25)225+(39-25)225+(21-25)225 


2 5 > °. 9 
(22—25)* (39—25)° (21—25)° 
+ — 4 


(18—25) 


25 25 25 25 


b. (25-18)218+(25-22)222+(25-39)239+(25-21)221 
(25-18)? (25-22)? ; (25-39)? ; (25-21)? 


18 22 39 21 


_—_—_ ee 


c. (18-25)25+(22-25)25+(39-25)25+(21-25)25 2° 2 oY 8S 
d. (18-25)2100+(22-25)2100+(39-25)2100+(21-25)2100 


(18—25)* (22-25)? (39—25)* (21—25)* 


100 100 100 100 


e. (0.18-0.25)20.25+(0.22-0.25)20.25+(0.39-0.25)20.25+(0.21-0.25)20.25 
(0.18—0.25)? (0.22—0.25)" (0.39—0.25)* (0.21—0.25)* 
02 - 0.25 0.25 0.25 
The P-value for a chi-square test for goodness of fit is 0.0129. Which of the following is the 
most appropriate conclusion at a significance level of 0.05? 


a. Because 0.0129 is less than a=0.05a = 0.05 reject HO0. There is convincing evidence 
that the food choices are equally popular. 


b. Because 0.0129 is less than a=0.050 = 0.05 reject HO/0. There is not convincing 
evidence that the food choices are equally popular. 


c. Because 0.0129 is less than a=0.05e = 0.05 reject HO0. There is convincing evidence 
that the food choices are not equally popular. 


d. Because 0.0129 is less than a=0.05e = 0.05 fail to reject HO0. There is not convincing 
evidence that the food choices are equally popular. 


e. Because 0.0129 is less than a=0.05e = 0.05 fail to reject HOM0. There is convincing 
evidence that the food choices are equally popular. 


22. Which of the following is false? 


a 


b. 


C. 


d. 


e. 


. A chi-square distribution with k degrees of freedom is more right-skewed than a chi- 
square distribution with k+1* + 1 degrees of freedom. 


A chi-square distribution never takes negative values. 
The degrees of freedom for a chi-square test are determined by the sample size. 
P(y2>10)P (x* > 10) i, greater when df=k+14f = & + 1 than when df=kdf = k. 


The area under a chi-square density curve is always equal to 1. 


Recycle and Review 


23. 


Video games (1.1) To determine if there is a relationship between age and playing video 
games, the Pew Research Center asked randomly selected adults for their age and if they 
“ever play video games on a computer, TV, game console, or portable device like a cell 


phone.”2 Here is a two-way table summarizing the results of the study: 
Age group 
18-29 30-49 50-64 65+ Total 
Play video Yes 3033 
aa No 3126 
Total 1316 2089 1635 1119 6159 


a. Construct a segmented bar graph to display the relationship between age group and 


response to the question about video games. 


b. Describe the association shown in the segmented bar graph in part (a). 


Exercises 24—26 refer to the following setting. Do students who read more books for 
pleasure tend to earn higher grades in English? The boxplots show data from a simple 
random sample of 79 students at a large high school. Students were classified as light 
readers if they read fewer than 3 books for pleasure per year. Otherwise, they were 
classified as heavy readers. Each student’s average English grade for the previous two 


marking periods was converted to a GPA scale, where A=4.0, A-=3.7, B+=3.3 


Heavy 


Type of reader 


24 26 28 30 32 34 36 38 4.0 
Average English grade 


24. Reading and grades (1.3) Write a few sentences comparing the distributions of average 
English grade for light and heavy readers. 


25. Reading and grades (10.2) Summary statistics for the two groups from Minitab are 


provided. 
Type of n Mean StDev SE Mean 
reader 
Heavy 47 3.640 0.324 0.047 
Light 32 3.356 0.380 0.067 


Explain why it is acceptable to use two-sample t procedures in this setting. 


Construct and interpret a 95% confidence interval for the difference in the mean 
English grade for light and heavy readers. 


Does the interval in part (b) provide convincing evidence that reading more causes a 
difference in students’ English grades? Justify your answer. 


26. Reading and grades (3.2) The scatterplot shows the number of books read and the 
English grade for all 79 students in the study. The least-squares regression line 
y\=3.42+0.024xY = 3.42 + 0.0242 has been added to the graph. 


a. 


b. 


Interpret the slope and y intercept. 


The student who reported reading 17 books for pleasure had an English GPA of 2.85. 
Calculate and interpret this student’s residual. 


For this linear model, r2=0.083r? = 0.083 Interpret this value. 


Average English grade 
NNN WY WY WE 


6 8 10 12 14 16 18 20 22 
Number of books read 


SECTION 11.2 Inference for Two-Way Tables 


LEARNING TARGETS By the end of the section, you should be able to: 


e State appropriate hypotheses and compute the expected counts and chi-square 
test statistic for a chi-square test based on data in a two-way table. 


State and check the Random, 10%, and Large Counts conditions for a chi- 
square test based on data in a two-way table. 


Calculate the degrees of freedom and P-value for a chi-square test based on 
data in a two-way table. 


Perform a chi-square test for homogeneity. 


Perform a chi-square test for independence. 


Choose the appropriate chi-square test in a given setting. 


The two-sample z procedures of Chapter 10 allow us to compare the proportions of successes 
in two populations or for two treatments. What if we want to compare more than two samples 
or groups? More generally, what if we want to compare the distributions of a single categorical 
variable across several populations or treatments? We rely on a new significance test, called a 
chi-square test for homogeneity. 


The test for homogeneity starts by presenting the data in a two-way table. However, two- 
way tables have other uses than comparing distributions of a single categorical variable. As we 
saw in Section 1.1, they can also be used to summarize relationships between two categorical 
variables. To determine if there is convincing evidence of an association between two 
categorical variables, we perform a chi-square test for independence. 


Tests for Homogeneity: Stating Hypotheses 


Does background music influence what customers buy? One experiment in a European 
restaurant compared three randomly assigned treatments: no music, French accordion music, 
and Italian string music. Under each condition, the researchers recorded the number of 
customers who ordered French, Italian, and other entrées.4 The null hypothesis in this example 
is 
HO0:40 : There is no difference in the true distributions of entrées ordered at this restaurant 
when no music, French accordion music, or Italian string music is played. 


It would also be correct to state the null hypothesis as Ho:4o : The distribution of a categorical 


variable is the same for each of several populations or treatments. We prefer the “no difference” 
wording because it’s more consistent with the language we used in the significance tests of 


Chapter 10. 


In general, the null hypothesis in a chi-square test for homogeneity says that there is no 
difference in the true distribution of a categorical variable in the populations of interest or for 
the treatments in an experiment. 


The alternative hypothesis says that there is a difference in the distributions but does not 
specify the nature of that difference. In the restaurant example, the alternative hypothesis is 


Ha:a : There is a difference in the true distributions of entrées ordered at this restaurant 
when no music, French accordion music, or Italian string music is played. 


og The alternative hypothesis does not state that each distribution is different from 


each of the others. Instead, the alternative hypothesis will be true even if just one of the true 
distributions is different from the others. Consequently, any difference among the three 
observed distributions of entrées ordered is evidence against the null hypothesis and for the 
alternative hypothesis. 


So how did the experiment turn out? The two-way table summarizes the data and Figure 
11.5 shows the conditional relative frequencies of entrée ordered for each of the three 
treatments. 


Type of background music 


. French 
Entrée ordered ; 
Italian 
Other 
Total 
Music = None Music = French Music = Italian 
50 50 50 
40 40 40 
5 30 5 30 5 30 
z z z 
& 20 2 2 2 20 
10 L 10 10 
0 - 0 : 0 : 
French Italian Other French Italian Other French Italian Other 
Entrée Entrée Entrée 


FIGURE 11.5 Relative frequency bar graphs comparing the distributions of entrées ordered for 
different music conditions. 


The type of entrée that customers order seems to differ considerably across the three music 
treatments. Orders of Italian entrées are very low (1.3%) when French music is playing, but are 
higher when Italian music (22.6%) or no music (13.1%) is playing. French entrées seem 
popular in this restaurant, as they are ordered often under all music conditions—but notably 
more often when French music is playing. For all three music treatments, the percent of Other 
entrées ordered was similar. 


Do the differences in these distributions provide convincing evidence that background 
music affects customer behavior at this restaurant? Or is it plausible that the background music 
has no effect on customer behavior and that these differences are due to the chance involved in 
the random assignment of treatments? To decide, we have to know how likely it is to get 
differences this big or bigger when the null hypothesis is true. In other words, we need a P- 
value! 


With only the methods we already know, we might start by comparing the proportions of 


French entrées ordered when no music (p\=30/84=0.357)\P = 30/84 = 0.357) 
(p = 39/75 = 0.52) 


and when 
French accordion music are played (p4=39/75=0.52) using a two-sample z 
test. We could similarly compare other pairs of proportions, ending up with many tests and 
many P-values. This is a bad idea. gw Performing multiple tests on the same data increases 
the probability that we make a Type I error in at least one of the tests. 

Because of the increased probability of a false positive, it’s cheating to pick out one large 
difference from the two-way table and then perform a significance test as if it were the only 
comparison we had in mind. Statisticians even have a name for this unethical practice: P- 
hacking. 

For example, a test comparing the proportions of French entrées ordered under the no music 
and French accordion music treatments shows that the difference is statistically significant 
(z=2.06, P=0.039) (2 = 2.06, P = 0.039) However, the proportions of Italian entrées ordered for 
the no music and Italian string music treatments do not differ significantly (z=1.61, P=0.107) 

(z = 1.61, P = 0.107) Reporting only the results of the first test wouldn’t be telling the whole 
story. 

The problem of how to do many comparisons at once without increasing the overall 
probability of a Type I error is common in statistics. Statistical methods for dealing with 
multiple comparisons usually have two steps: 


|. Perform an overall test to see if there is convincing evidence of any differences among the 
parameters that we want to compare. 


», When the overall test shows there is convincing evidence of a difference, perform a detailed 
follow-up analysis to decide which of the parameters differ and to estimate how large the 
differences are. 


When we want to compare the distribution of a categorical variable for several populations 
or treatments, the overall test uses the familiar chi-square test statistic. 


Tests for Homogeneity: Expected Counts and the Chi-Square Test 
Statistic 


A chi-square test for homogeneity begins with the hypotheses 


Hg: There is no difference in the distribution of a categorical variable for several 


populations or treatments. 


H,: There is a difference in the distribution of a categorical variable for several populations 


or treatments. 


To perform the test, we compare the observed counts in a two-way table with the counts we 
would expect if HO“0 were true. Calculating the expected counts isn’t difficult, but we 
calculate them slightly differently than in the chi-square test for goodness of fit. 

The null hypothesis in the restaurant experiment is that there’s no difference in the 
distributions of entrées ordered when no music, French accordion music, or Italian string music 
is played. To find the expected counts, we start by assuming that Ho is true. We can see from 
the two-way table that 99 of the 243 entrées ordered during the study were French. 


Type of background music 


French 
Italian 
Other 
Total 


Entrée ordered 


Ingram Publishing/Alamy 


If the specific type of music that’s playing has no effect on entrée orders, the proportion of 
French entrées ordered under each music condition should be 99/243=0.407499/243 = 0.4074 


Because there were 84 total entrées ordered when no music was playing, we would expect 
9 
84 eet 84 (0.4074) = 34.22 
84-99243=84(0.4074)=34.22 243 
of those entrées to be French, on average. The expected counts of French entrées ordered under 
the other two music conditions can be found in a similar way: 


French music: 75(0.4074)=30.56 French music: 75 (0.4074) = 30.56 
Italian music: 84(0.4074)=34,22!talian music: 84 (0.4074) = 34.22 


AP® EXAM TIP 


As with chi-square tests for goodness of fit, the expected counts should not be 
rounded to the nearest whole number. While an observed count of entrées ordered 
must be a whole number, an expected count need not be a whole number. The 
expected count gives the average number of entrées ordered if HO/40 is true and 
the random assignment process is repeated many times. 


We repeat the process to find the expected counts for the other two types of entrées. The 
overall proportion of Italian entrées ordered during the study was 31/243=0.1276 
31/243 = 0.1276 So the expected counts of Italian entrées ordered under each treatment are 
No music: 84(0.1276)=10.72French music: 75(0.1276)=9.57 Italian music: 84(0.1276)=10.72 
No music: 84(0.1276) = 10.72 
French music: 75 (0.1276) = 9.57 
Italian music: 84(0.1276) = 10.72 


The overall proportion of Other entrées ordered during the experiment was 113/243=0.465 
113/243 = 0.465 go the expected counts of Other entrées ordered for each treatment are 
No music: 84(0.465)=39.06 French music: 75(0.465)=34.88 Italian music: 84(0.465)=39.06 
No music: 84 (0.465) 39.06 
French music: 75 (0.465) = 34.88 
Italian music: 84 (0.465) = 39.06 


Il 


The following table summarizes the expected counts for all three treatments. Note that the 
values for no music and Italian music are the same because 84 total entrées were ordered under 
each condition, and we expect the distributions of entrée choice to be the same. 


Expected counts 
Type of background music 
None French Italian Total 


. French 34.22 30.56 34.22 99 
Entrée ordered 

Italian 10.72 10.72 31 

Other 39.06 34.88 39.06 113 


Total 84 15 84 243 


We can check our work by adding the expected counts to obtain the row and column totals, as 
in the table. These should be the same as those in the table of observed counts except for small 
roundoff errors, such as 75.01 rather than 75 for the total number of entrées ordered when 


French music was playing. 

Let’s take a look at the two-way table from the restaurant study one more time. In this 
context, we found the expected count of French entrées ordered when no music was playing as 
follows: 

99 


4 -——= 34.22 
84-99243=34,.22 243 


Observed counts 
Type of background music 


French 
Italian 
Other 
Total 


Entrée ordered 


We’ve marked in the table the three numbers used in this calculation. These values are the row 
total for French entrées ordered, the column total for entrées ordered when no music was 
playing, and the table total of entrées ordered during the experiment. We can rewrite the 


original calculation as 
84-99 = wii ~ 34.99 
84-99243=99-84243=34.22 243 243 


This suggests a more general formula for the expected count in any cell of a two-way table. 


CALCULATING EXPECTED COUNTS FOR A CHI-SQUARE TEST 
BASED ON DATA IN A TWO-WAY TABLE 


When H0/40 is true, the expected count in any cell of a two-way table is 


row total - column total 
expected count =———————____ 


expected count=row total - column totaltable total table total 


Just as we did with the chi-square test for goodness of fit, we compare the observed counts 
with the expected counts using the chi-square test statistic 
x2=); (Observed count — Expected count)2Expected count 
2 _ 2 (Observed count — Expected count)? 


Expected count 
This time, the sum is over all cells (not including the totals!) in the two-way table. 


For French entrées with no music, the observed count is 30 orders and the expected count is 
2 = ; 
34.22. The contribution to the x2’ test statistic for this cell is 


(Observed count — Expected count)2Expected count=(30-34.22)234.22=0.52 
(Observed count — Expected count)? J (30—34.22) a 
Expected count 7 34.22 - 


2 Sead 
The x2 test statistic is the sum of nine such terms: 


¥2=(30-34.22)234.22+(39-30.56)230.56+---+(35—39.06)239.06=0.52+2.33+---+0.42=18.28 


, _ (30-34.22)?  (39—-30.56)? (35—39.06)? 
x 34.22 30.56 = 39.06 


= 0.52 + 2.33 + --- + 0.42 = 18.28 


AP® EXAM TIP 


In the “Do” step, you aren't required to show every term in the chi-square test 


statistic. Writing the first few terms of the sum followed by “...” is considered as 
“showing work.” We suggest that you do this and then let your calculator tackle the 


computations. 


Here is an example to practice what you have learned so far. 


EXAMPLE |Would you vote for a female president? » 
Hypotheses, expected counts, and the chi-square test statistic 


Bloomberg/Getty Images 


PROBLEM: For a class project, Abby and Mia wanted to know if the gender of an 
interviewer could affect the responses to a survey question. The subjects in their experiment 
were 100 males from their school. Half of the males were randomly assigned to be asked, 
“Would you vote for a female president?” by a female interviewer. The other half of the 
males were asked the same question by a male interviewer.2 The table shows the results. 


a. State the appropriate null and alternative hypotheses. 


b. Show the calculation for the expected count in the Male/Yes cell. Then provide a 
complete table of expected counts. 


c. Calculate the value of the chi-square test statistic. 


Gender of interviewer 


Male Female Total 

Response to Yes 69 
question No 11 
Maybe 20 

Total 50 50 100 


SOLUTION: 


a. Holo: There is no difference in the true distributions of response to this question 


when asked by a male interviewer and when asked by a female interviewer ‘for subjects 


like these. 


Ha:Ha : There is a difference in the true distributions of response to this question when 


asked by a male interviewer and when asked by a female interviewer ‘for subjects 


like these. 


69-50 


b. The expected count ‘for the Male/Yes cell is 69° 50100 = 34.5 100 — 34.0 The rest of the 


expected counts are shown in the table: 


Gender of interviewer 


Total 
Response to Yes 69 
question No fe 
Maybe 20 
Total 100 


expected count=row total - column totaltable total 


row total - column total 
table total 


expected count = 


9 _ (30-345)?  (39-34.5)? 
x = 34.5 34.5 a 


C. 42 = (30 — 34.5)234.5+(39 — 34.5)234.54+--= 4.25 = 428 


Xx2=) (Observed count — Expected count)2Expected count 
(Observed count — Expected count)” 


Expected count 
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CHECK YOUR UNDERSTANDING 


i. 


y Separate random samples of children from the United Kingdom and the United States 
who completed a survey in a recent year were selected. For each student, we recorded the 
superpower he or she would most like to have: the ability to fly, ability to freeze time, 
invisibility, super strength, or telepathy (ability to read minds). Is there convincing evidence 
that the distributions of superpower preference are different for survey takers in the two 
countries? The data are summarized in the two-way table.® 


Country 

U.K. US. Total 

Fly 99 

Superpower Freeze time 96 
preference Invisibility 67 
Super strength 43 

Telepathy 110 

Total 200 Zs 415 


1. State the appropriate null and alternative hypotheses. 


2. Show the calculation for the expected count in the U.K./Fly cell. Then provide a complete 
table of expected counts. 


3. Calculate the value of the chi-square test statistic. 


Tests for Homogeneity: Conditions and P-values 


Like every other significance test, there are conditions that must be met to justify our 
calculations and conclusions. 


| CONDITIONS FOR PERFORMING A CHI-SQUARE TEST FOR | 


GOODNESS OF FIT 


e Random: The data come from a random sample from the population of interest. 


o 10%: When sampling without replacement, n<0.10 Nn < 0.10 N for each sample. 


e Large Counts: All expected counts are at least 5. 


RF Pictures/Getty Images 
We can confirm that the conditions are met in the restaurant experiment. 
e Random: The three treatments were assigned at random. 


e Large Counts: All expected counts are at least 5 (see table below). 


We don’t have to check the 10% condition because the researchers were not sampling without 
replacement from some population of interest. They performed an experiment using customers 
who happened to be in the restaurant at the time. 


Observed counts 
Type of background music 


None French Italian Total 
French 
Entrée ordered : =e 
Italian SL 
Other 113 
Total 243 


Expected counts 
Type of background music 
None French Italian Total 


. French 34.22 30.56 34.22 99 
Entrée ordered 

Italian 10.72 10.72 31 

Other 39.06 34.88 39.06 113 


Total 84 75 84 243 


As in the test for goodness of fit, you should think of the chi-square test statistic y2X° asa 
measure of how much the observed counts deviate from the expected counts. Once again, large 
values of 2x are evidence against HO“ and in favor of Ha‘. The P-value measures the 
strength of this evidence. When the conditions are met, P-values for a chi-square test for 
homogeneity come from a chi-square distribution with 


df=(number of rows—1)x(number of columns-1) 
df = (number ofrows — 1) x (number of columns — 1) 


For the restaurant experiment, y2=18.28X° = 18.28 Because there are 3 rows and 3 columns 
in the two-way table (not including the totals), 


df=(3-1)x(3-1)=44f = (3-1) x (8-1) =4 


We can find the P-value using Table C or technology. 


Pp 
df 0025 001 
4 16.42 18.47 


e Using Table C: Look at the df=4df = 4 row in Table C. The calculated value y2=18.28 
x’ = 18.28 Jies between the critical values 16.42 and 18.47. The corresponding P-value is 
between 0.001 and 0.0025. 


Chi-square distribution 
with df = 4 


x2 = 18.28 


e Using technology: The command yx” cdf(lower:18.28, upper:10000, df:4) gives 0.0011. 


Because the P-value of 0.0011<a=0.050.0011 < a = 0.05 we reject HO“0. There is 
convincing evidence that there is a difference in the true distributions of entrées ordered at this 
restaurant when no music, French accordion music, or Italian string music is played. 


EXAMPLE | Does the gender of an interviewer matter? (>) 
Conditions, P-value, and conclusion 


sturti/Getty Images 


PROBLEM: In the preceding example, you read about an experiment to determine if the 
gender of an interviewer affects responses to the question “Would you vote for a female 
president?” Here are tables showing the observed and expected counts: 


Observed counts 


Gender of interviewer 


Male Female Total 

Response to Yes 69 
question No 11 
Maybe 20 

Total 100 

Expected counts 
Gender of interviewer 

Male Female Total 

Response to Yes 69 
question No 11 
Maybe 20 

Total 50 50 100 


Earlier, we calculated y2=4.25X" = 4,25 
a. Verify that the conditions for inference are met. 


b. Use Table C to find the P-value. Then use your calculator’s y2x” cdf command. 
c. Interpret the P-value from the calculator. 


d. What conclusion would you draw? 


SOLUTION: 


a. Random: Treatments were randomly assigned. / 


Large Counts: All expected countszscounts > 5 (see table of expected counts). 4 
g oe sa 


We don’t check the 10% condition because Abby and Mia didn’t randomly select 


subjects from some population. 


5 df = (3 - 1)(2 - 1) = 24f = (3—1)(2-—1) =2 
Using Table C: The P-value is between 0.10 and 0.15. 


Using technology: x2 cdf(lower:4.25, upper:10000, df:2) = 0.119 
x” cdf(lower : 4.25, upper : 10000, df: 2) = 0.119 


Ce Assuming that the gender of the interviewer doesn’t affect responses to this question, 
there is a 0.119 probability of observing differences in the distributions of responses as 
large as or larger than those in this study by chance alone. 


d. Because the P-value of 0.119>% = 0.05,0-119 > a= 0.05, we fail to reject Hollo There is 
not convincing evidence of a difference in the true distributions of response to this 
question when asked by a male interviewer and when asked by a female interviewer ‘for 


subjects like these. 


Chi-square distribution 
with af =2 
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Calculating the expected counts and then the chi-square test statistic by hand is a bit time- 
consuming. As usual, technology saves time and gets the arithmetic right. 


29. Technology Corner | PERFORMING CHI-SQUARE TESTS FOR 
TWO-WAY TABLES (>) 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 


You can use the TI-83/84 to perform calculations for a chi-square test for homogeneity. 
We’ll use the data from the restaurant study to illustrate the process. 


1. Enter the observed counts in matrix [A]. 


e Press (MATRIX), arrow to EDIT, and choose A. 


NAMES MATH 
Ze 


e Enter the dimensions of the matrix: 3x33 x 3 


e Enter the observed counts from the two-way table in the same locations in the matrix. 


fAK1-= 3Q 


2. Press|STAT|, arrow to TESTS, and choose y2x” -Test. Adjust your settings as shown. 
Just y: 8 


ORMAL FLOAT AUTO REAL RADIAN CL fl 


Observed: [A] 
Expected: (B] 
Color: Us 


Calculate Draw 


Note: You do not have to enter the expected counts in matrix [B]. Once you have run the 
test, the expected counts will be stored in matrix [B]. 


3. Choose “Calculate” or “Draw” to carry out the test. If you choose “Calculate,” you should 
get the test statistic, P-value, and df shown here. If you specify “Draw,” the chi-square 
distribution with 4 degrees of freedom will be drawn, the area in the tail will be shaded, 
and the P-value will be displayed. 


ORMAL FLOAT AUTO REAL RADIAN CL ih] 


ix? —Test, 
%2=18.27921151 
P=, 0010882802 
df=4 


x2-Test 
%2=18.2792 p=.0011 


4. To see the expected counts, Press (MATRIX), arrow to EDIT, and choose [B]. 


= 


uUTO : ' N i 


MATRIXCBI] 3 x3 
34.222 30.556 34.222 
10.716 9.5679 10.716 
39.062 34.877 39.062 


AP® EXAM TIP 


You can use your calculator to carry out the mechanics of a significance test on the 
AP® Statistics exam—but there’s a risk involved. If you just give the calculator 
answer without showing work, and one or more of your entries is incorrect, you will 


likely get no credit for the “Do” step. We recommend writing out the first few terms 
of the chi-square calculation followed by “...”. This approach may help you earn 
partial credit if you enter a number incorrectly. Be sure to name the procedure x2x" 


test for homogeneity) and to report the test statistic (y2=18.279) (x = 18.279) 


degrees of freedom (df=4) (df = 4) and P-value (0.0011). 


CHECK YOUR UNDERSTANDING 


hy 


y In the preceding Check Your Understanding (page 732), we presented data about 
superpower preferences for random samples of children from the United Kingdom and the 
United States. Here are the data once again: 


Country 
U.K. U.S. Total 


os 


Fly 
Superpower’ Freeze time 96 
preference — |nvisibility Bi 
Super strength 43 
Telepathy 110 
Total 200 215 415 


Verify that the conditions for inference are met. 


Use Table C to find the P-value. Then use your calculator’s y2x” cdf command. 


. Interpret the P-value from the calculator. 


Bw N 


. What conclusion would you draw? 


Putting It All Together: The Chi-Square Test for Homogeneity 


In Section 11.1, we used a chi-square test for goodness of fit to test a hypothesized model for 
the distribution of a categorical variable. When we want to compare the distribution of a 
categorical variable in several populations or for several treatments, we use a chi-square test for 


homogeneity. 


This test is also known as a chi-square test for homogeneity of proportions. We prefer the simpler 


name. 


CHI-SQUARE TEST FOR HOMOGENEITY 


Suppose the conditions are met. To perform a test of 

HO:4o : There is no difference in the distribution of a categorical variable for several 
populations or treatments 

compute the chi-square test statistic 


x2=y. ( Observed count — Expected count)2Expected count 
‘ ‘ ( Observed count — Expected count)? 
=o Expected count 


where the sum is over all cells (not including totals) in the two-way table. The P-value is the 
area to the right of yx” under the chi-square density curve with degrees of freedom = 
(number of rows — 1)(number of columns — 1). 


Let’s look at an example of a chi-square test for homogeneity from start to finish. As usual, 
we follow the four-step process when performing a significance test. 


EXAMPLE | Speaking English (>) 
The chi-square test for homogeneity 


Scott Olson/Getty Images 


PROBLEM: The Pew Research Center conducts surveys about a variety of topics in many 
different countries. In one survey, it wanted to investigate how residents of different 
countries feel about the importance of speaking the national language. Separate random 
samples of residents of Australia, the United Kingdom, and the United States were asked 
many questions, including the following: “Some people say that the following things are 
important for being truly [survey country nationality]. Others say they are not important. 
How important do you think it is to be able to speak English?” The two-way table 
summarizes the responses to this question. 


Country 
Australia U.K. U.S. Total 
Very important 2569 
Somewhat 713 
Opinion about important 
speaking Not very 118 
English — important 
Not at all 63 
important 
Total 1000 1460 1003 3463 


Do these data provide convincing evidence at the a=0.05a = 0.05 level that the distributions 
of opinion about speaking English differ for residents of Australia, the U.K., and the U.S.? 


SOLUTION: 


STATE: Ho:lo : There is no difference in the true distributions of opinion about speaking 
English for residents of Australia, the U.K., and the ‘US. 


Use the four-step process! 


Ha:Ha : There is a difference in the true distributions of opinion about speaking English 
for residents of Australia, the U.K., and the ‘US. 


We'll use « = 0.05. = 0.05. 
PLAN: Chi-square test ‘for homogeneity. 
e Random: Independent random samples of residents from the three countries. / 


© 10%: 1000 is<10%10% : 1000 is < 10% of all Australian residents, 1460 is<10% 
1460 is < 10% of all UK. residents, and 1003 is<10%1003 is < 10% of all U.S. 
residents. / 


e Large Counts: All expected counts are =>5= 5 (see table below). 4 


Expected counts Country 


Australia U.K. U.S. Total 
Very 2569 
important 
ie Somewhat 713 
Opinion about | 
important 
speaking 
English ‘Not very ue 
important 
Not at all 63 
important 
Total 1000 1460 1003 3463 
DO: 


e Test statistic: 


y2 = (690 — 741.8)2741.8+(1177 — 1083.1)21083.1+:** = 68.57 


2 _ (690~741.8)? | (1177—1083.1)? __ gs 
xX" = sare + Sos OH = 88.57 
© P-valuedf = (4 - 1)(3 - 1) = 6P-value: df = (4—1) (3-1) =6 
Using Table C: P-value<o.ooosP — value < 0.0005 


Using technology: y2cdf(lower:68.57, upper:10000, df:6)~0 
x cdf (lower : 68.57, upper : 10000, df : 6) ~ 0 


CONCLUDE: Because the P-value of approximately O<g = 0.05,0 < a= 0.05, we reject 
Hoo There is convincing evidence that there is a difference in the true distributions of 
opinion about speaking English for residents of Australia, the U.K., and the ‘US. 


Because the observed counts differ from the expected counts, there is some evidence 


for Ha/ta 


Chi-square distribution 
with df =6 


If you want to know how the distributions differ, do a follow-up analysis. 


FOR PRACTICE, TRY EXERCISE 35 


AP® EXAM TIP 


Many students lose credit on the AP® Statistics exam because they don’t write 


down and label the expected counts in their response. It isn’t enough to claim that 
all the expected counts are at least 5. You must provide clear evidence. 


What if we want to compare several proportions? Many studies involve comparing the 
proportion of successes for each of several populations or treatments. The two-sample z test 
from Chapter 10 allows us to test the null hypothesis HO: p1=p2/40 : Pi = P2 where p1P1 and 
p2P2 are the true proportions of successes for the two populations or treatments. The chi-square 
test for homogeneity allows us to test HO: p1=p2=...=pko : Pi = P2 = --- = Pk This null 
hypothesis says that there is no difference in the proportions of successes for the k populations 
or treatments. The alternative hypothesis is Ha:/4« : At least two of the pi’sPi S$ are different. 
Many students incorrectly stateHaa as “all the proportions are different.” Think about it this 
way: the opposite of “all the proportions are equal” is “some of the proportions are not equal.” 


FOLLOW-UP ANALYSIS The chi-square test for homogeneity allows us to compare the 
distribution of a categorical variable for any number of populations or treatments. If the test 


allows us to reject the null hypothesis of no difference, we may want to do a follow-up analysis 
that examines the differences in detail. As with the chi-square test for goodness of fit, start by 
identifying the cells that contribute the most to the chi-square statistic. Then describe how the 
observed and expected counts differ in those categories, noting the direction of the difference. 


Our earlier restaurant study found significant differences among the distributions of entrées 
ordered under each of the three music conditions. We entered the two-way table for the study 
into Minitab software and requested a chi-square test. The output appears in Figure 11.6. 
Minitab repeats the two-way table of observed counts and puts the expected count for each cell 
below the observed count, followed by the nine individual components that contribute to the y2 


9 
Xx test statistic. 
Chi-Square Test: None, French, Italian 


Expected counts are printed below observed counts 


Chi-Square contributions are printed below expected counts 


None French Italian Total 

French 39 30 99 
entrée M 30.56 34.22 
2.334 0.521 
Italian 1 19 


entrée ; 9257 10.72 
7.672 6.404 

Other 4 30 
entrée : F 39.06 
0.422 

84 


P-Value = 


FIGURE 11.6 Minitab output for the two-way table in the restaurant study. The output gives the 
observed counts, the expected counts, and the individual components of the chi-square test 
statistic. 


Looking at the output, we see that just two of the nine components contribute about 14 
(almost 77%) of the total y2=18.28X" = 18.28 Comparing the observed and expected counts in 
these two cells, we see that orders of Italian entrées are far below what we expect when French 
music is playing and far above what we expect when Italian music is playing. We are led to a 
specific conclusion: orders of Italian entrées are strongly affected by Italian and French music. 
More advanced methods provide tests and confidence intervals that make this follow-up 
analysis more complete. 


CHECK YOUR UNDERSTANDING 


i. 


y Canada has universal health care. The United States does not but often offers more 
elaborate treatment to patients who have access. How do the two systems compare in treating 
heart attacks? Researchers compared random samples of U.S. and Canadian heart attack 


patients. One key outcome was the patients’ own assessment of their quality of life relative 
to what it had been before the heart attack. Here are the data for the patients who survived a 
year: 


Quality of life Canada United States 
Much better 09 541 
Somewhat “1 498 
better 

About the same 96 779 
Somewhat 50 282 
worse 

Much worse 19 65 
Total 311 2165 


Is there a significant difference between the two distributions of quality-of-life ratings? 
Carry out an appropriate test at the a=0.010 = 0.01 level. 


Relationships Between Two Categorical Variables 


Two-way tables can summarize data from different types of studies. The restaurant experiment 
compared entrées ordered for three music treatments. The observational study about speaking 
English compared independent random samples from three different populations. In both cases, 
we are comparing the distribution of a categorical variable for several populations or 
treatments. We use the chi-square test for homogeneity to perform inference in such settings. 

Another common situation that leads to a two-way table is when a single random sample of 
individuals is chosen from a single population and then classified based on two categorical 
variables. In that case, our goal is to analyze the relationship between the variables. 


Francesco Carta fotografo/Getty Images 


Are people who are prone to sudden anger more likely to develop heart disease? An 
observational study followed a random sample of 8474 people with normal blood pressure for 
about four years.2 All the individuals were free of heart disease at the beginning of the study. 
Each person took the Spielberger Trait Anger Scale test, which measures how prone a person is 


to sudden anger. Researchers also recorded whether each individual developed coronary heart 
disease (CHD). This includes people who had heart attacks, as well as those who needed 
medical treatment for heart disease. Here is a two-way table that summarizes the data: 


Anger level 
Low Moderate High Total 
Yes 
CHD status le 
No 8284 
Total 3110 4731 633 8474 


The bar graph in Figure 11.7 shows the percent of people in each of the three anger 
categories who developed CHD. There is a clear trend: as the anger score increases, so does the 
percent who suffer heart disease. A much higher percent of people in the high anger category 
developed CHD (4.27%) than in the moderate (2.33%) and low (1.70%) anger categories. 
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FIGURE 11.7 Bar graph comparing the percents of people in each anger category who 
developed coronary heart disease (CHD). 


Do these data provide convincing evidence of an association between the variables in the 
larger population? Or is it plausible that there is no association between the variables in the 
population and that we observed an association in the sample by chance alone? To answer that 
question, we rely on a new significance test. 


Tests for Independence: Stating Hypotheses 


When we gather data from a single random sample and measure two categorical variables, we 
are often interested in whether the sample data provide convincing evidence that the variables 
have an association in the population. That is, does knowing the value of one variable help 
predict the value of the other variable for individuals in the population? To determine if 
evidence from the sample is convincing, we perform a chi-square test for independence. 


In this test, our null hypothesis is that there is no association between the two categorical 
variables in the population of interest. The alternative hypothesis is that there is an association 


between the variables. For the observational study of anger level and coronary heart disease, 
we want to test the hypotheses 


HO0:4o : There is no association between anger level and heart-disease status in the 
population of people with normal blood pressure. 


Ha:/4a * There is an association between anger level and heart-disease status in the 
population of people with normal blood pressure. 


We could substitute the word dependent in place of not independent in the alternative hypothesis. 


We'll avoid this practice, however, because saying that two variables are dependent sounds too 
much like saying that changes in one variable cause changes in the other. 


No association between two variables means that knowing the value of one variable does 
not help us predict the value of the other. That is, the variables are independent. An equivalent 
way to state the hypotheses is 


HO: : Anger and heart-disease status are independent in the population of people with 
normal blood pressure. 


Ha:/4a * Anger and heart-disease status are not independent in the population of people 
with normal blood pressure. 


Tests for Independence: Expected Counts 


As with the two previous types of chi-square tests, we begin by comparing the observed counts 
in a two-way table with the expected counts if HO/0 is true. In the anger study, the null 
hypothesis is that there is no association between anger level and heart-disease status in the 
population of interest. If we assume that HO/0 is true, then anger level and CHD status are 
independent. We can find the expected cell counts in the two-way table using the definition of 
independent events from Chapter 5: P(A|B)=P(A)” (A|B) = P(A) The chance process here is 
randomly selecting a person and recording his or her anger level and CHD status. 


Anger level 
Low Moderate High Total 
Yes 
CHD status mae 
No 8284 
Total 3110 4731 633 8474 


Let’s start by considering the events “Yes” and “Low anger.” We see from the two-way 
table that 190 of the 8474 people in the study had CHD. If we imagine choosing one of these 
people at random, P(Yes)=190/8474P (Yes) = 190/8474 t¢ the null hypothesis is true and anger 
level and CHD status are independent, knowing that the selected individual is low anger does 
not change the probability that this person develops CHD. That is to say, 


P(Yes|Low anger)=P(Yes)=190/8474=0.02242 
P (Yes|Low anger) = P (Yes) = 190/8474 = 0.02242 
Of the 3110 low-anger people in the study, we’d expect 
190 


110 -—— = 3110 (0.02242) = 69.73 
3110-1908474=3110(0.02242)=69.73 8474 


to get CHD. You can see that the general formula we developed earlier for a test for 
homogeneity applies in this situation also: 


expected count=row total - column totaltable total=190-31108474=69.73 
row total- column total 190-3110 | 69.73 


ted {= woe neesietatatee! 
—on table total 8474 


You can complete the table of expected counts by using the formula for each cell or by 
using the formula for some cells and subtracting to find the remaining cells. Here is the 
completed table of expected counts: 


Expected counts 
Anger level 
Low Moderate High Total 


Yes 69.73 106.08 14.19 190 
CHD status 
No 3040.27 4624.92 618.81 8284 


Total 3110 4731 633 8474 


Tests for Independence: Conditions and Calculations 


The 10% and Large Counts conditions for the chi-square test for independence are the same as 
for the test for homogeneity. There is a slight difference in the Random condition for the two 
tests: a test for independence uses data from a single random sample, but a test for homogeneity 
uses data from two or more independent random samples or from two or more groups in a 
randomized experiment. 


CONDITIONS FOR PERFORMING A CHI-SQUARE TEST FOR 
INDEPENDENCE 


e Random: The data come from a random sample from the population of interest. 


° 10%: When sampling without replacement, n<0.10 Nn < 0.10 N 


e Large Counts: All expected counts are at least 5. 


We can confirm that the conditions are met in the anger and heart disease study. 


e Random: The data came from a random sample of 8474 people with normal blood pressure. 


© 10%: It is reasonable to assume that 8474<10%8474 < 10% of all people with normal 
blood pressure. 


e Large Counts: All the expected counts are at least 5 (see table on previous page). 
2 see 
When the conditions are met, we use the familiar y2% test statistic to measure the strength 
of the association between the variables in the sample. P-values for this test come from a chi- 
square distribution with 


df=(number of rows—1)(number of columns-—1) 
df = (number of rows — 1) x (number of columns — 1) 


For the anger and heart disease study, 


e Test statistic: y2=(53-69.73)269.73+(110-106.08)2106.08+---=16.077 

9 _ (5369.73) (110—106.08)” _ a% 

x = "$0.73 106.08 +++ = 16.077 
e Using technology: With df=(2-1)(3-1)=24F = (2—1)(3—1) =2) p_value=0.00032 
P — value = 0.00032 

Because the P-value of 0.00032<a=0.050.00032 < a = 0.05 we reject HOM0. We have 
convincing evidence of an association between anger level and heart-disease status in the 
population of people with normal blood pressure. 

A follow-up analysis reveals that two cells contribute most of the chi-square test statistic: 
Low anger, Yes (4.014) and High anger, Yes (11.564). A much smaller number of low-anger 
people developed CHD than expected. And a much larger number of high- anger people got 
CHD than expected. 


Chi-square distribution 
with df =2 


0 2 4 6 8&8 1 12 14 16\18 2 
y? = 16.077 


Can we conclude that proneness to anger causes heart disease? No. The anger and heart- 
disease study is an observational study, not an experiment. It isn’t surprising that some other 
variables are confounded with anger level. For example, people prone to anger are more likely 
than others to be men who drink and smoke. We don’t know whether the increased rate of heart 
disease among those with higher anger levels in the study is due to their anger or perhaps to 
their drinking and smoking or maybe even to gender. 


Putting It All Together: The Chi-Square Test for Independence 


When we want to test for an association between two categorical variables in a population, we 
use a chi-square test for independence. Here are the key details. 


The chi-square test for independence is also known as the chi-square test for association. 


CHI-SQUARE TEST FOR INDEPENDENCE 


Suppose the conditions are met. To perform a test of 
HO0:4o : There is no association between two categorical variables in the population of 
interest 


compute the chi-square test statistic 
x2=)y. (Observed count — Expected count)2Expected count 
7 2 (Observed count — Expected count)? 
sie Expected count 


where the sum is over all cells in the two-way table (not including the totals). The P-value is 
the area to the right of 2x” under the chi-square density curve with degrees of freedom = 
(number of rows — 1)(number of columns — 1). 


We’re now ready to perform a complete test for independence. 


EXAMPLE | Snowmobiles in Yellowstone (>) 
Chi-square test for independence 


korolOK/Depositphotos 


PROBLEM: In Chapter 1, you read about a random sample of winter visitors to 
Yellowstone National Park. Each of the 1526 visitors in the sample was asked two questions: 


1. Do you belong to an environmental club (like the Sierra Club)? 


2. What is your experience with a snowmobile: own, rent, or never used? 
The two-way table summarizes the results. 


Environmental club status 


Not a member Member Total 

Snowmobile Never used 657 
experience — Renter 574 
Total 1221 305 1526 


Do the data provide convincing evidence of an association between environmental club 
status and type of snowmobile use in the population of winter visitors to Yellowstone 
National Park? 


SOLUTION: 


Use the four-step process! 


STATE: 


Ho:Ho : There is no association between environmental club status and type of snowmobile 
use in the population of winter visitors to Yellowstone. 


You could also say Ho:/40 : Environmental club status and type of snowmobile use 


are independent in the population of winter visitors to Yellowstone. 


Ha:Ha : There is an assoctation between environmental club status and type of snowmobile 
use in the population of winter visitors to Yellowstone. 


We'll use « = 0.05. = 0.05. 


If no significance level is provided, use a=0.05a = 0.05 


PLAN: Chi-square test ‘for independence. 


© Random: Random sample of 1526 winter visitors to Yellowstone. 4 


We know this is a test for independence and not homogeneity because there was 


only one random sample. 


© 10%: It is reasonable to assume that 1526<10%1526 < 10% of all winter visitors to 


Yellowstone. / 


e Large Counts: All the expected counts are at least 5 (see table below). 


Expected counts 
Environmental club status 
Not a member Member Total 
Snowmobile Never used 657 
experience Renter S74 
Owner 295 
ae, 1221 305 1526 


Because the observed counts differ from the expected counts, there is some evidence 


for Ho. 


DO: 
© Test statistic: y2 = (445 — 525.7)2525.7+(212 — 131.3)2131.3+--- = 116.6 
9 _ (445-525.7)?  (212—131.3)? 
525.7 131.3 
= 116.6 


© P-value: df = (3 - 1)(2 - 1) = 24f = (3—1)(2—1) =2 

Using Table C: P-value<o.ooosP — value < 0.0005 

Using technology: The calculator’s yor’ -Test gives x2 = 116.6X" = 116.6 and P-value = 
4.82X10-26P — value = 4.82 x 10°*° using df = pay = 25 

CONCLUDE: Because the P-value of approximately O<a = 0.050 < a= 0.05 we reject Ho 


Ho We have convincing evidence of an association between environmental club status and 
type of snowmobile use in the population of winter visitors to Yellowstone National Park. 
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x2-Test 
x2=116.5883 p=8 
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AP® EXAM TIP 


When the P-value is very small, the calculator will report it using scientific notation. 
Remember that P-values are probabilities and must be between 0 and 1. If your 
calculator reports the P-value with a number that appears to be greater than 1, look 
to the right, and you will see that the P-value is being expressed in scientific 
notation. If you claim that the P-value is 4.82, you will certainly lose credit. 


CHECK YOUR UNDERSTANDING 


‘. 


v A random sample of 1200 U.S. college students was asked, “What is your perception of 
your own body? Do you feel that you are overweight, underweight, or about right?” The 
two-way table summarizes the data on perceived body image by gender.2 


Gender 
Female Male Total 
About right 855 
Body image Overweight 235 
Underweight 110 
Total 760 440 1200 


Do these data provide convincing evidence at the a=0.010 = 0.01 level of an association 
between gender and perceived body image in the population of U.S. college students? 


Using Chi-Square Tests Wisely 


Both the chi-square test for homogeneity and the chi-square test for independence start with a 
two-way table of observed counts. They even calculate the test statistic, degrees of freedom, 
and P-value in the same way. The questions that these two tests answer are different, however. 
A chi-square test for homogeneity tests whether the distribution of a categorical variable is the 
same for each of several populations or treatments. The chi-square test for independence tests 
whether two categorical variables are associated in some population of interest. 


One way to help you distinguish these two tests is to consider the two sets of totals in the 
two-way table. In tests for homogeneity, one set of totals is known by the researchers before 
the data are collected. For example, in the experiment to determine if the gender of the 
interviewer affects the response to a question about a female president, Abby and Mia decided 
in advance to randomly assign 50 subjects to each treatment. 


Gender of interviewer 


Male Female Total 
Response to Yes 2 
question No , 
Maybe ? 

Total 100 


Likewise, in the observational study comparing opinions about speaking English, 
researchers knew in advance that they would survey 1000 people from Australia, 1460 from the 
U.K., and 1003 from the U.S. In both cases, only one set of totals was left to vary. This is 
consistent with the design of the study: select independent random samples (or randomly assign 
treatments) and compare the distribution of a single categorical variable. 


However, in a test for independence, neither set of totals is known in advance. In the 
observational study about snowmobile use in Yellowstone, the researchers didn’t know 
anything about either variable ahead of time—they only knew that they would survey 1526 
visitors. This is consistent with the design of the study: select one sample and record the values 
of two variables for each member. 


Environmental club status 


Not a member Member Total 
Never used ? 
Snowmobile Ranier 5 
experience 
Owner ? 
Total 1526 


Unfortunately, it is quite common to see questions about association when a test for 
homogeneity applies; when a test for independence applies, questions about differences 
between proportions or the distribution of a variable are quite common. Many people avoid the 
distinction altogether and pose questions about the “relationship” between two variables. 

Instead of focusing on the question asked, the best plan is to consider how the data were 
produced. If the data come from two or more independent random samples or treatment groups 
in a randomized experiment, then do a chi-square test for homogeneity. If the data come from a 
single random sample, with the individuals classified according to two categorical variables, 
use a chi-square test for independence. 


EXAMPLE | Scary movies and fear (>) 
Choosing the right type of chi-square test 


Alamy 


PROBLEM: Are men and women equally likely to suffer lingering fear from watching 
scary movies as children? Researchers asked a random sample of 117 college students to 
write narrative accounts of their exposure to scary movies before the age of 13. More than 
one-fourth of the students said that some of the fright symptoms are still present when they 
are awake. The following table breaks down these results by gender. 


Gender 
Male Female Total 
Fright Yes 36 
symptoms? No 81 
Total 38 79 117 


Assume that the conditions for performing inference are met. Minitab output for a chi-square 
test using these data is shown. 


Chi-Square Test: Male, Female 
Expected counts are printed below observed counts 
Chi-Square contributions are printed below expected counts 


Male Female Total 
Yes 7 29 36 
11.69 Beas AL, 
noose 0.906 
No eal 50 81 
Asie cial 54.69 
0.837 0.403 
Total Sie) 79 117 
Chi-Sq = 4.028, DF = 1, P-Value = 0.045 


a. Should a chi-square test for independence or a chi-square test for homogeneity be used in 
this setting? Explain your reasoning. 


b. State an appropriate pair of hypotheses for researchers to test in this setting. 


c. Which cell contributes most to the chi-square test statistic? In what way does this cell 
differ from what the null hypothesis suggests? 


d. Interpret the P-value. What conclusion would you draw at a=0.01?a = 0.01? 
SOLUTION: 


a, € hi-square test for independence. The data were produced using a single random 
sample of college students, who were then classified according to two variables: gender 
and whether or not they had lingering fright symptoms. The chi-square test ‘for 


homogeneity requires independent random samples from each population. 


In this setting, the researchers wouldn't know either set of totals until the data were 


collected. They would know only the overall total (117) in advance. Thus, a test for 
independence is the correct choice. 


b. Ho:o : There is no association between gender and whether or not college students 
have lingering fright symptoms. 
Ha:Ha : There is an assoctation between gender and whether or not college students 
have lingering fright symptoms. 

c. Men who admit to having lingering fright symptoms account ‘for the largest component 


of the chi-square test statistic (1.883). Far fewer men in the sample admitted to 
lingering fright symptoms (7) than we would expect if Hoo were true (11.69). 


d. ay there is no association between gender and whether or not college students have 
lingering fright symptoms, there is a 0.045 probability of obtaining an association as 
strong as or stronger than the one observed in the random sample of 117 students. 
Because the P-value of 0.045+« = 0,019-045 + a = 0.01 we fail to reject Hoo. We do not 
have convincing evidence that there is an association between gender and whether or not 


college students have lingering fright symptoms. 


FOR PRACTICE, TRY EXERCISE 49 


WHAT IF WE WANT TO COMPARE TWO PROPORTIONS? Shopping at secondhand 
stores is becoming more popular and has even attracted the attention of business schools. A 
study of customers’ attitudes toward secondhand stores interviewed separate random samples 
of shoppers at two secondhand stores of the same chain in different cities. The two-way table 
shows the breakdown of respondents by gender.12 


Store 
A B Total 


Gender Male 106 
Female goo 
Total 241 218 459 


Do the data provide convincing evidence of a difference in the distributions of gender for 
shoppers at these two stores? To answer this question, we could perform a chi-square test for 
homogeneity. Our hypotheses are 


HO:o : There is no difference in the distributions of gender for shoppers at these two 
stores. 


Ha:/4a : There is a difference in the distributions of gender for shoppers at these two stores. 


A difference in distribution of gender would mean that there is a difference in the true 
proportion of female shoppers at the two stores. So we could also use a two-sample z test from 
Section 10.1 to compare two proportions. The hypotheses for this test are 


Ho: pa — pp = 0 
HO: pA-pB=0Ha: pA-pB<0 Ma: Pa — Pp #9 


where pAP4 and pBP2# are the true proportions of female shoppers at Store A and Store B, 
respectively. 


The TI-84 screen shots show the results from a two-sample z test for pA—pBP4 ~ PB and 
from a chi-square test for homogeneity. (We verified that the Random, 10%, and Large Counts 
conditions were met before carrying out the calculations.) 
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P1i*P2 X2=15.33407007 
z=3.915874113 pP=9,0077244eE-5 
P=9.0114092eE-5 df=1 

i1=. 8423236515 


62=.6880733945 
b=. 7690631808 
ni=241 
n2=218 


Note that the P-values from the two tests are the same except for rounding errors. You can 
also check that the chi-square test statistic is the square of the two-sample z statistic: 
(3.915---)2=15,.334(3-915 --+)° = 15.334, 

As the preceding example suggests, the chi-square test for homogeneity based on a 2x2 
2 x 2 table is equivalent to the two-sample z test for p1—p2?1 ~ P2 with a two-sided alternative 
hypothesis. However, there are other settings where only one of the options is valid: 


e If the two-way table is larger than 2x22 x 2 the only option is a chi-square test for 
homogeneity. 


e If the table is 2x22 x 2 and the alternative hypothesis is one-sided, use a two-sample z test 
for a difference in proportions rather than a chi-square test. 


e If the table is 2x22 x 2 and you want to construct a confidence interval for a difference in 


proportions, the only option is a two-sample z interval. 
Because of the points made in the last two bullets, we recommend the Chapter 10 methods 
for comparing two proportions whenever you are given a choice. 


GROUPING QUANTITATIVE DATA INTO CATEGORIES As we mentioned in Chapter 
1, it is possible to convert a quantitative variable to a categorical variable by grouping together 
intervals of values. Here’s an example. Researchers surveyed independent random samples of 
shoppers at two secondhand stores of the same chain in different cities. The two-way table 
summarizes data on the incomes of the shoppers in the two samples. 


Store 

Total 
Under $10,000 132 
$10,000 to $19,999 115 

Income 

$20,000 to $24,999 119 
$25,000 to $34,999 41 
$35,000 or more 41 
Total 459 


Personal income is a quantitative variable. But by grouping the values of this variable, we 
create a categorical variable. We could use these data to carry out a chi-square test for 
homogeneity because the data came from independent random samples of shoppers at the two 
stores. Comparing the distributions of income for shoppers at the two stores would give more 
information than simply comparing their mean incomes. 


WHAT IF SOME OF THE EXPECTED CELL COUNTS ARE LESS THAN 5? Let’s 
look at a situation where this is the case. A sample survey asked a random sample of young 
adults, “Where do you live now? That is, where do you stay most often?” Here is a two-way 
table of all 2984 people in the sample (both men and women) classified by their age and by 
where they lived.44 Living arrangement is a categorical variable. Even though age is 
quantitative, the two-way table treats age as categorical by dividing the young adults into four 
categories. The table gives the observed counts for all 20 combinations of age and living 
arrangement. 


Age (years) 


19 20 21 22 Total 
Parents’ home 1357 
Another 162 
person’s 
— home 
Living arrangement 
Your own 1254 


place 


Group 58 60 49 25 192 
quarters 


Other 
Total 


Our null hypothesis is HO:0 : There is no association between age and living arrangement 
in the population of young adults. The following table shows the expected counts assuming HO 
Ho is true. We can see that two of the expected counts (circled in red) are less than 5. This 
violates the Large Counts condition. 


Age (years) 


19 20 Zi 22 Total 

Parents’ home| 245.57 | 348.35 | 364.26 | 398.82 | 1357 
Another 29.32 | 41.59 | 43.49 | 47.61 162 
person’s 

as home 

Living arrangement 
Your own 226.93| 321.90} 336.61] 368.55| 1254 
place 


Group 


34.75 | 49.29 51.54 56.43 192 
quarters 


Other | 8.44) | 4.88 19 


Total 540 766 801 877 | 2984 


To make all of the expected counts 5 or more, a clever strategy is to “collapse” the table by 
combining two or more rows or columns. In this case, it might make sense to combine the 
“Group quarters” and “Other” living arrangements. Doing so and then running a chi-square test 
in Minitab gives the following output. Notice that the Large Counts condition is now met. 


Chi-Square Test: 19, 20, 21, 22 
Expected counts are printed below observed counts 


Chi-Square contributions are printed below expected counts 


19 20 21 22 Total 
Parent’s home 324 378 337 318 1357 
245.57 348.35 364.26 398 .82 
25.049 2.925 2.04 16.379 
Another home 37 47 40 38 162 
29.32 ALi 39 43.49 47.61 
2.014 0.705 0.279 1.94 
Own place 116 279 372 487 1254 
226.93 321.9 336.61 368.55 
94.226 Da 1i9 S.i2 38.068 
63 62 52 34 211 


Other 


38.18 54.16 56.64 62.01 


16.129 1.134 0.38 12.654 
Total 540 766 801 87/ 2984 
DF = Q, P-Value = 0.000 


Chi-Sq = 182.961, 


Section 11.2 Summary 


We use the chi-square test for homogeneity to compare the distribution of a single 
categorical variable for each of several populations or treatments. The null hypothesis is that 
there is no difference in the distribution of the categorical variable for each of the 
populations or treatments. 


The conditions for performing a chi-square test for homogeneity are: 


« Random: The data come from independent random samples or groups in a randomized 
experiment. 


© 10%: When sampling without replacement, n < 0.10 N for each sample. 
: Large Counts: All expected counts must be at least 5. 


We use the chi-square test for independence to test the association between two categorical 
variables. The null hypothesis is that there is no association between the two categorical 
variables in the population of interest. Another way to state the null hypothesis is that the 
two categorical variables are independent in the population of interest. 


The conditions for performing a chi-square test for independence are: 

= Random: The data come from a random sample from the population of interest. 
© 10%: When sampling without replacement, n < 0.10 N. 

: Large Counts: All expected counts must be at least 5. 

The expected count in any cell of a two-way table when Hg is true is 


expected count = row total - column total 
table total 


expected count=row total - column totaltable total 


The chi-square test statistic is 


Xx2=); (Observed count — Expected count)2Expected count 
(Observed count — Expected count)? 
} — , aaron 


Expected count 
where the sum is over all cells in the two-way table (not including the totals). 


Calculate the P-value by finding the area to the right of yx under the chi-square density 
curve with (number of rows—1)(number of columns-1) 


(number of rows — 1)(number of columns — 1) degrees of freedom. 


If the test finds a statistically significant result, consider doing a follow-up analysis that 


looks for the largest components of the chi-square test statistic and compares the observed 
and expected counts in the corresponding cells. 


11.2 Technology Corner 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 
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Section 11.2 Exercises 


27. The color of candy Inspired by the example about how background music influences 
choice of entrée at a restaurant, a statistics student decided to investigate other ways to 
influence a person’s behavior. Using 60 volunteers, she randomly assigned 20 volunteers 
to get a “red” survey, 20 volunteers to get a “blue” survey, and 20 volunteers to get a 
control survey. The first three questions on each survey were the same, but the fourth and 
fifth questions were different. For example, the fourth question on the “red” survey was 
“When you think of the color red, what do you think about?” On the blue survey, the 
question replaced red with blue. On the control survey, the last two questions were not 
about color. As a reward, each volunteer was allowed to choose a chocolate candy in a red 
wrapper or a chocolate candy in a blue wrapper. Here are segmented bar graphs showing 
the results of the experiment. Describe what you see. 


100 
C| Red candy 
80 ie Blue candy 
60 
40 
20 
0 


Percent 


Red Blue Control 
survey survey survey 
Type of survey 


28. Python eggs How is the hatching of water python eggs influenced by the temperature of 
the snake’s nest? Researchers randomly assigned newly laid eggs to one of three water 
temperatures: hot, neutral, or cold. Hot duplicates the extra warmth provided by the 
mother python, and cold duplicates the absence of the mother./2 Here are segmented bar 
graphs showing the results of the experiment. Describe what you see. 


Hatched? 


fo No 
fy Yes 


Relative frequency 


Cold Neutral Hot 
Water temperature 


29. pg 731 |») More candy The two-way table shows the results of the experiment 


described in Exercise 27. 


C. 


Survey type 
Red Blue Control Total 
Color of Red 26 
candy Blue 34 
Total 20 20 20 60 


State the appropriate null and alternative hypotheses. 


Show the calculation for the expected count in the Red/Red cell. Then provide a 
complete table of expected counts. 


Calculate the value of the chi-square test statistic. 


30. More pythons The two-way table shows the results of the experiment described in 
Exercise 28. 


C. 


Water temperature 


Total 

Hatching Yes 129 
status No 58 
Total 27 56 104 187 


. State the appropriate null and alternative hypotheses. 


. Show the calculation for the expected count in the Cold/Yes cell. Then provide a 


complete table of expected counts. 


Calculate the value of the chi-square test statistic. 


31. pg 734 |») Last candy Refer to Exercises 27 and 29. 


a. Verify that the conditions for inference are met. 


b. Use Table C to find the P-value. Then use your calculator’s y2x” cdf command. 
c. Interpret the P-value from the calculator. 


d. What conclusion would you draw using a=0.01?a = 0.01? 


32. Last python Refer to Exercises 28 and 30. 


a. Verify that the conditions for inference are met. 


b. Use Table C to find the P-value. Then use your calculator’s y2x° cdf command. 
c. Interpret the P-value from the calculator. 


d. What conclusion would you draw using a=0.10?a = 0.10? 


33. Sorry, no chi-square How do U.S. residents who travel overseas for leisure differ from 


34. 


those who travel for business? The following is the breakdown by occupation.12 


Occupation Leisure travelers (%) Business travelers (%) 
Professional/technical 36 39 
Manager/executive 23 48 
Retired 14 3 
Student 7 3 
Other 20 v4 
Total 100 100 


Explain why we can’t use a chi-square test to learn whether these two distributions differ 
significantly. 


Going nuts The UR Nuts Company sells Deluxe and Premium nut mixes, both of which 
contain only cashews, Brazil nuts, almonds, and peanuts. The Premium nuts are much 
more expensive than the Deluxe nuts. A consumer group suspects that the two nut mixes 
are really the same. To find out, the group took separate random samples of 20 pounds of 
each nut mix and recorded the weights of each type of nut in the sample. Here are the 
dati 


Type of mix 
Type of nut Premium Deluxe 
Cashew 6 Ib 5 |b 
Brazil nut 3 Ib 4 |b 
Almond 5 Ib 6 Ib 
Peanut 6 Ib 5 Ib 


Explain why we can’t use a chi-square test to determine whether these two distributions 
differ significantly. 


35. 


36. 


of. 


pg 738 |») Gummy bears Courtney and Lexi wondered if the distribution of color was 


the same for name-brand gummy bears (Haribo Gold) and store-brand gummy bears 
(Great Value). To investigate, they randomly selected 6 bags of each type and counted the 
number of gummy bears of each color? Here are the data: 


Brand 

Name Store Total 

Red 349 

Green 157 

Color Yellow 135 
Orange 208 

White 146 

Total 373 622 995 


Do these data provide convincing evidence that the distributions of color differ for name- 
brand gummy bears and store-brand gummy bears? 

How are schools doing? The nonprofit group Public Agenda conducted telephone 
interviews with three randomly selected groups of parents of high school children. There 
were 202 black parents, 202 Hispanic parents, and 201 white parents. One question asked, 
“Are the high schools in your state doing an excellent, good, fair, or poor job, or don’t you 
know enough to say?” Here are the survey results:4© 


Parents’ race/ethnicity 


Total 

Excellent 68 

Opinion Good 205 
about high Fair 196 
schools Poor 72 
Don’t know 64 

Total 605 


Do these data provide convincing evidence that the distributions of opinion about high 
schools differ for the three populations of parents? 


How to quit smoking It’s hard for smokers to quit. Perhaps prescribing a drug to fight 
depression will work as well as the usual nicotine patch. Perhaps combining the patch and 
the drug will work better than either treatment alone. Here are data from a randomized, 
double-blind trial that compared four treatments./Z A “success” means that the subject did 
not smoke for a year following the start of the study. 


Group Treatment Subjects Successes 
1 Nicotine patch 244 40 


2 Drug 244 74 
2 Patch plus drug 245 87 
4 Placebo 160 25 


a. Summarize these data in a two-way table. 


b. Do the data provide convincing evidence of a difference in the effectiveness of the four 
treatments at the a=0.05a = 0.05 significance level? 


38. Preventing strokes Aspirin prevents blood from clotting and so helps prevent strokes. 
The Second European Stroke Prevention Study asked whether adding another anticlotting 
drug named dipyridamole would be more effective for patients who had already had a 
stroke. Here are the data on strokes during the two years of the study:48 


Group Treatment Number of patients Number who had a stroke 
i Placebo 1649 250 
2 Aspirin 1649 206 
rs) Dipyridamole 1654 211 
4 Both 1650 157 


a. Summarize these data in a two-way table. 


b. Do the data provide convincing evidence of a difference in the effectiveness of the four 
treatments at the a=0.05a = 0.05 significance level? 


39. How to quit smoking Refer to Exercise 37. Which treatment seems to be most effective? 
Least effective? Justify your choices. 


40. Preventing strokes Refer to Exercise 38. Which treatment seems to be most effective? 
Least effective? Justify your choices. 


41. Relaxing in the sauna Researchers followed a random sample of 2315 middle-aged men 
from eastern Finland for up to 30 years. They recorded how often each man went to a 
sauna and whether or not he suffered sudden cardiac death (SCD). The two-way table 
shows the data from the study.22 


Weekly sauna frequency 


1 or fewer 2-3 4 or more Total 
Yes 
SCD ae 
No 2125 
Total 601 1513 201 2315 


a. State appropriate hypotheses for performing a chi-square test for independence in this 
setting. 


b. Compute the expected counts assuming that HO/“0 is true. 


c. Calculate the chi-square test statistic, df, and P-value. 


d. What conclusion would you draw? 


42. Is astrology scientific? The General Social Survey (GSS) asked a random sample of 
adults their opinion about whether astrology is very scientific, sort of scientific, or not at 
all scientific. Here is a two-way table of counts for people in the sample who had three 


levels of higher education:22 
Degree held 
Associate’s | Bachelor’s Master’s Total 
a Not at all 539 
Opinion scientific 
about 
astrology Very or sort 148 
of scientific 
Total 234 321 132 687 


a. State appropriate hypotheses for performing a chi-square test for independence in this 
setting. 


b. Compute the expected counts assuming that HO/“0 is true. 
c. Calculate the chi-square test statistic, df, and P-value. 


d. What conclusion would you draw? 


43. pg 744 |») Finger length Is your index finger longer than your ring finger? Or is it the 


other way around? It isn’t the same for everyone. To investigate if there is a relationship 
between gender and relative finger length, we selected a random sample of 460 U.S. high 
school students who completed a survey. The two-way table shows the results. 


Gender 
Female Male Total 
; ; Index longer 158 
on mnger Same length 86 
Ring longer 216 
Total 227 233 460 


Do these data provide convincing evidence at the a=0.100 = 0.10 level of an association 
between gender and relative finger length in the population of students who completed the 
survey? 


44. Regulating guns The National Gun Policy Survey asked a random sample of adults, “Do 
you think there should be a law that would ban possession of handguns except for the 
police and other authorized persons?” Here are the responses, broken down by the 


45. 


46. 


47. 


respondent’s level of education:24 


Education 
Less than| HS grad | Some College | Postgrad 
Opinion HS college grad degree Total 
about Yes 486 
handgun 
ban No 715 
Total 116 213 463 233 176 1201 


Do these data provide convincing evidence at the a=0.05a = 0.05 level of an association 
between education level and opinion about a handgun ban in the adult population? 


Tuition bills A random sample of U.S. adults was recently asked, “Would you support or 
oppose major new spending by the federal government that would help undergraduates 
pay tuition at public colleges without needing loans?” The two-way table shows the 
responses, grouped by age.22 


Total 

Support 856 

Response Oppose 565 
Don’t know 88 

Total 1509 


Do these data provide convincing evidence of an association between age and opinion 
about loan-free tuition in the population of U.S. adults? 


Online banking A recent poll conducted by the Pew Research Center asked a random 
sample of 1846 Internet users if they do any of their banking online. The table summarizes 
their responses by age.22 Is there convincing evidence of an association between age and 
use of online banking for Internet users? 


Age 
18-29 30-49 50-64 65+ Total 
Online Yes 1088 
banking No 758 
Total 395 542 553 356 1846 


Which test? Determine which chi-square test is appropriate in each of the following 
settings. Explain your reasoning. 


a. With many babies being delivered by planned cesarean section, Mrs. McDonald’s 
Statistics class hypothesized that there would be fewer younger people born on the 
weekend. To investigate, they selected a random sample of people born before 1980 


and a separate random sample of people born after 1993. In addition to year of birth, 
they also recorded the day of the week on which each person was born. 


b. Are younger people more likely to be vegan/vegetarian? To investigate, the Pew 
Research Center asked a random sample of 1480 U.S. adults for their age and whether 
or not they are vegan/vegetarian. 


48. Which test? Determine which chi-square test is appropriate in each of the following 
settings. Explain your reasoning. 


a. Does chocolate help heart-attack victims live longer? Researchers in Sweden randomly 
selected 1169 people who had suffered heart attacks and asked them about their 
consumption of chocolate in the previous year. Then the researchers followed these 


people and recorded whether or not they had died within 8 years.24 


b. Random-digit-dialing telephone surveys used to exclude cell-phone numbers. If the 
opinions of people who have only cell phones differ from those of people who have 
landline service, the poll results may not represent the entire adult population. The Pew 
Research Center interviewed separate random samples of cell-only and landline 
telephone users who were less than 30 years old and asked them to describe their 


political party affiliation.22 


49. pg 747 |») Where do young adults live? A survey by the National Institutes of Health 


asked a random sample of young adults (aged 19 to 25 years), “Where do you live now? 
That is, where do you stay most often?” Here is the full two-way table (omitting a few 


who refused to answer and one who reported being homeless):2° 
Gender 
Female Male Total 
Parents’ home 1909 
Another 276 
Living location person's home 
Own place 2423 
Group quarters 246 
Total 2488 2366 4854 


a. Should we use a chi-square test for homogeneity or a chi-square test for independence 
in this setting? Justify your answer. 


b. State appropriate hypotheses for performing the type of test you chose in part (a). 


Here is Minitab output from a chi-square test. 


Chi-Square Test: Female, Male 
Expected counts are printed below observed counts 


Chi-Square contributions 


Parents’ home 


Another home 


Own place 


Group 


Total 
Chi-Sq = 11.038, 


c. Check that the conditions for carrying out the test are met. 


d. Interpret the P-value. What conclusion would you draw? 


are printed below expected counts 


Female 
923 
978.49 
3.147 
144 
141.47 
0.045 
1294 
1241.95 
2.181 
127 
126.09 
0.007 
2488 

DF = 3, 


Male 
986 
930.51 
3.309 
132 
134.53 
0.048 
1129 
1181.05 
2.294 
119 
119.91 
0.007 
2366 


P-Value = 


Total 
1909 


276 


2423 


246 


4854 
0.012 


50. Distance from home A study of first-year college students asked separate random 
samples of students from private and public universities the following question: “How 
many miles is this university from your permanent home?” Students had to choose from 
the following options: 5 or fewer, 6 to 10, 11 to 50, 51 to 100, 101 to 500, or more than 


500.22 Here is the two-way table summarizing the responses: 


5 or fewer 
6 to 10 

11 to 50 
51 to 100 
101 to 500 
Over 500 
Total 


Distance from 
home (miles) 


Type of university 


Public 


43,363 


Private 


1951 1028 
2688 1285 
10,971 bozr 


6765 2211 
15,177 6195 
9811 9486 


25,732 


Total 
2979 
3973 
16,498 

8976 
21,372 
15,297 
69,095 


a. Should we use a chi-square test for homogeneity or a chi-square test for independence 


in this setting? Justify your answer. 


b. State appropriate hypotheses for performing the type of test you chose in part (a). 


Here is Minitab output from a chi-square test. 


Chi-Square Test: Public, Private 


Expected counts are printed below observed counts 


Chi-Square contributions 
Public 
1951 
1869.6 
3.54 
2688 
2493.4 
15.19 
10971 
10354 
36.77 
6765 
5633.2 
227.4 
15177 
13413 
232 
5811 
9600.2 
1496 
43363 
S, 


5 or less 


6 to 10 


11 to 50 


51 to 100 


101 to 500 


Over 500 


Total 
Chi-Sq = 5398.7, 


DF 


c. Check that the conditions for carrying out the test are met. 


d. Interpret the P-value. What conclusion would you draw? 


Private 
1028 
1109.4 
5.97 
1285 
1479.6 
25.59 
5527 
6144.1 
61.98 
2211 
3342.8 
383.2 
6195 
7959.2 
391 
9486 
5696.8 
2520.4 
25132 
P-Value = 


are printed below expected counts 


Total 
2979 


3973 


16498 


8976 


21372 


15297 


69095 
0.0000 


51. Where do you live? Conduct a follow-up analysis for the test in Exercise 49. 


52. How far away do you live? Conduct a follow-up analysis for the test in Exercise 50. 


53. Treating ulcers Gastric freezing was once a recommended treatment for ulcers in the 
upper intestine. Use of gastric freezing stopped after experiments showed it had no effect. 
One randomized comparative experiment found that 28 of the 82 gastric-freezing patients 


improved, while 30 of the 78 patients in the placebo group improved.28 We can test the 
hypothesis of “no difference” in the effectiveness of the treatments in two ways: with a 


two-sample z test or with a chi-square test. 


a. State appropriate hypotheses for a chi-square test. 


b. Here is Minitab output for a chi-square test. Interpret the P-value. What conclusion 


would you draw? 


Chi-Square Test: Gastric freezing, Placebo 
Expected counts are printed below observed counts 


Chi-Square contributions are printed below expected counts 


Gastric freezing 
28 

29.73 

0.1 


Improved 


Place 


28. 
0.1 


bo Total 
30 58 
27 


05 


Didn’t improve 54 48 102 


52.27 49.73 

0.057 0.06 
Total 82 78 160 
Chi-Sq = 0.322, DF = 1, P-Value = 0.570 


c. Here is Minitab output for a two-sample z test. Explain how these results are consistent 
with the test in part (a). 


Test for Two Proportions 


Sample X N Sample p 
iL. 28 82 0.341463 
2 30 78 0.384615 


Difference = p(1) - p(2) 

Estimate for difference: -0.0431520 
Test for difference = © (vs not = 0): 
Z = -0.57 P-Value = 0.570 


54. Opinions about the death penalty The General Social Survey (GSS) asked separate 
random samples of people with only a high school degree and people with a bachelor’s 
degree, “Do you favor or oppose the death penalty for persons convicted of murder?” Of 
the 1379 people with only a high school degree, 1010 favored the death penalty, while 319 
of the 504 people with a bachelor’s degree favored the death penalty. We can test the 
hypothesis of “no difference” in support for the death penalty among people in these 
educational categories in two ways: with a two-sample z test or with a chi-square test. 


a. State appropriate hypotheses for a chi-square test. 


b. Here is Minitab output for a chi-square test. Interpret the P-value. What conclusion 
would you draw? 


Chi-Square Test: HS, Bachelor 
Expected counts are printed below observed counts 
Chi-Square contributions are printed below expected counts 


HS Bachelor Total 
Favor 1010 319 1329 
973.28 355.72 
12385 3.790 
Oppose 369 185 554 
405.72 148.28 
3.323 9.092 
Total 1379 504 1883 
Chi-Sq = 17.590, DF = 1, P-Value = 0.000 


c. Here is Minitab output for a two-sample z test. Explain how these results are consistent 
with the test in part (a). 


Test for Two Proportions 


Sample X N Sample p 
1 1010 1379 0.732415 
2 319 504 0.632937 
Difference = p(1) - p(2) 

Estimate for difference: 0.0994783 

Test for difference = 0 (vs not = 0) 

Z = 4.19 P-Value = 0.000 


Multiple Choice Select the best answer for Exercises 55—60. 

Exercises 55—58 refer to the following setting. The National Longitudinal Study of Adolescent 
Health interviewed a random sample of 4877 teens (grades 7 to 12). One question asked, “What 
do you think are the chances you will be married in the next 10 years?” Here is a two-way table 
of the responses by gender:22 


Gender 
Female Male Total 
Almost no 222 
chance 
Some chance, 321 
Opinion about but probably not 
ae A 50-50 chance 959 
A good chance 1445 
Almost certain 1930 
Total 2625 2252 4877 


55. Which of the following is the appropriate null hypothesis for performing a chi-square test? 


a. Equal proportions of female and male teenagers are almost certain they will be married 
in 10 years. 


b. There is no difference between the distributions of female and male teenagers’ opinions 
about marriage in this sample. 


c. There is no difference between the distributions of female and male teenagers’ opinions 
about marriage in the population. 


d. There is no association between gender and opinion about marriage in the sample. 


e. There is no association between gender and opinion about marriage in the population. 


56. Which of the following is the expected count of females who respond “Almost certain”? 
a. 487.7 
b. 525 
c. 965 


d. 


es 


1038.8 
1174 


57. Which of the following is the correct number of degrees of freedom for the chi-square test 
using these data? 


58. 


59. 


60. 


a. 
b. 
c 
d. 


e. 


4 
8 

10 
20 
4876 


For these data, y2=69.8X° = 69.8 with a P-value of approximately 0. Assuming that the 
researchers used a significance level of 0.05, which of the following is true? 


a. 


b. 


A Type I error is possible. 
A Type II error is possible. 
Both a Type I and a Type IJ error are possible. 


There is no chance of making a Type I or Type II error because the P-value is 
approximately 0. 


There is no chance of making a Type I or Type II error because the calculations are 
correct. 


When analyzing survey results from a two-way table, the main distinction between a test 
for independence and a test for homogeneity is 


a. 
b. 
C, 
d. 


e. 


how the degrees of freedom are calculated. 
how the expected counts are calculated. 
the number of samples obtained. 

the number of rows in the two-way table. 


the number of columns in the two-way table. 


Cocaine addicts need cocaine to feel any pleasure, so perhaps giving them an 
antidepressant drug will help. A 3-year study with 72 chronic cocaine users compared an 
antidepressant drug called desipramine with lithium (a standard drug to treat cocaine 
addiction) and a placebo. One-third of the subjects were randomly assigned to receive 
each treatment. At the end of the study, researchers recorded whether or not the subjects 


relapsed.22 Which of the following conditions must be satisfied to perform the appropriate 
chi-square test using the data from this study? 


I. The population distribution is approximately Normal. 
II. The treatments were randomly assigned. 
III. The observed counts are all at least 5. 
a. I only 
b. Il only 
c. I only 
d. I and HI 


e. I, IJ, and III 


Recycle and Review 


For Exercises 61 and 62, you may find the inference summary chart inside the back cover 
helpful. 


61. 


62. 


Inference recap (8.1 to 11.2) In each of the following settings, state which inference 
procedure from Chapter 8, 9, 10, or 11 you would use. Be specific. For example, you 
might answer, “Two-sample z test for the difference between two proportions.” You do 


not have to carry out any procedures.2! 


a. What is the average voter turnout during an election? A random sample of 38 cities was 
asked to report the percent of registered voters who voted in the most recent election. 


b. Are blondes more likely to have a boyfriend than the rest of the single world? 
Independent random samples of 300 blondes and 300 nonblondes were asked whether 
they have a boyfriend. 


Inference recap (8.1 to 11.2) In each of the following settings, state which inference 
procedure from Chapter 8, 9, 10, or 11 you would use. Be specific. For example, you 
might answer, “Two-sample z test for the difference between two proportions.” You do 


not have to carry out any procedures.22 


a. Is there a relationship between attendance at religious services and alcohol 
consumption? A random sample of 1000 adults was asked whether they regularly 
attend religious services and whether they drink alcohol daily. 


b. Separate random samples of 75 college students and 75 high school students were 
asked how much time, on average, they spend watching television each week. We want 
to estimate the difference in the average amount of TV watched by high school and 
college students. 


Exercises 63 and 64 refer to the following setting. For their final project, a group of AP® 
Statistics students investigated the following question: “Will changing the rating scale on 


63. 


64. 


a survey affect how people answer the question?” To find out, the group took an SRS of 
50 students from an alphabetical roster of the school’s just over 1000 students. The first 22 
students chosen were asked to rate the cafeteria food on a scale of 1 (terrible) to 5 
(excellent). The remaining 28 students were asked to rate the cafeteria food on a scale of 0 
(terrible) to 4 (excellent). Here are the data: 


1 to 5 scale 
Rating ul 2 3 4 5 
Frequency 2 3 dT 13 3 
0 to 4scale 
Rating 0 dl 2 3 4 
Frequency 0 0 2 18 8 


Design and analysis (4.2, 11.2) 
a. Was this an observational study or an experiment? Justify your answer. 


b. Explain why it would not be appropriate to perform a chi-square test in this setting. 


Average ratings (1.3, 2.1, 10.2) The students decided to compare the average ratings of 
the cafeteria food on the two scales. 


a. Find the mean and standard deviation of the ratings for the students who were given the 
1-to-5 scale. 


b. For the students who were given the 0-to-4 scale, the ratings have a mean of 3.21 anda 
standard deviation of 0.568. Since the scales differ by one point, the group decided to 
add 1 to each of these ratings. What are the mean and standard deviation of the adjusted 
ratings? 


c. Would it be appropriate to compare the means from parts (a) and (b) using a two- 
sample t test? Justify your answer. 


M _. FRAPPY! FREE RESPONSE AP® 
PROBLEM, YAY! 


The following problem is modeled after actual AP® Statistics exam free response 
questions. Your task is to generate a complete, concise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you use, because you will be 
scored on the correctness of your methods as well as on the accuracy and completeness of 
your results and explanations. 


Two Statistics students wanted to know if including additional information in a survey 
question would change the distribution of responses. To find out, they randomly selected 
30 teenagers and asked them one of the following two questions. Fifteen of the teenagers 
were randomly assigned to answer Question A, and the other 15 students were assigned to 
answer Question B. 


Question A: When choosing a college, how important is a good athletic program: very 
important, important, somewhat important, not that important, or not important at all? 


Question B: It’s sad that some people choose a college based on its athletic program. 
When choosing a college, how important is a good athletic program: very important, 
important, somewhat important, not that important, or not important at all? 


The table below summarizes the responses to both questions. For these data, the chi- 
square test statistic is y2=6.12% = 6.12. 


Question 
Total 
Very important 9 
Important i 
Somewhat fs) 
Importance of a important 
good athletic = \j6¢ that 3 


program 


important 
Not important at 1 5 6 


all 
Total 15 15 30 


a. State the hypotheses that the students are interested in testing. 


b. Describe a Type I error and a Type II error in the context of the hypotheses stated 
in part (a). 


c. For these data, explain why it would not be appropriate to use a chi-square 
distribution to calculate the P-value. 


d. To estimate the P-value, 100 trials of a simulation were conducted, assuming that 
the additional information didn’t have an effect on the response to the question. In 
each trial of the simulation, the value of the chi-square test statistic was calculated. 
These simulated chi-square test statistics are displayed in the dotplot shown here. 


seecececocoesooes 
Bggeeeeooesooe 


Simulated chi-square statistic 


Based on the results of the simulation, what conclusion would you make about the 
hypotheses stated in part (a)? 


After you finish, you can view two example solutions on the book’s website 
(highschool.bfwpub.com/tos6e). Determine whether you think each solution is 
“complete,” “substantial,” “developing,” or “minimal.” If the solution is not 
complete, what improvements would you suggest to the student who wrote it? 
Finally, your teacher will provide you with a scoring rubric. Score your response 
and note what, if anything, you would do differently to improve your own score. 


Chapter 11 Review 


Section 11.1: Chi-Square Tests for Goodness of Fit 


In this section, you learned the details for performing a chi-square test for goodness of fit. The 
null hypothesis is that a single categorical variable follows a specified distribution in a 
population of interest. The alternative hypothesis is that the variable does not follow the 
specified distribution in the population of interest. 


The chi-square test statistic measures the difference between the observed distribution of a 
categorical variable and its hypothesized distribution. To calculate the chi-square test statistic, 
use the following formula that involves the observed and expected counts for each value of the 
categorical variable: 

x2=); (Observed count — Expected count)2Expected count 
(Observed count — Expected count)? 
1 eee 


Expected count 


To calculate the expected counts, multiply the total sample size by the hypothesized 
proportion for each category. Larger values of the chi-square test statistic provide more 
convincing evidence that the distribution of the categorical variable differs from the 
hypothesized distribution in the population of interest. 


When the Random, 10%, and Large Counts conditions are satisfied, we can accurately 
model the sampling distribution of the chi-square test statistic using a chi-square distribution 
(density curve). The Random condition says that the data are from a random sample from the 
population of interest. The 10% condition says that the sample size should be less than 10% of 
the population size when sampling without replacement. The Large Counts condition says that 
the expected counts for each category must be at least 5. In a test for goodness of fit, use a chi- 
square distribution with degrees of freedom=number of categories—1 
degrees of freedom = number of categories — 1 

When the results of a test for goodness of fit are significant, consider doing a follow-up 
analysis. Identify which categories of the variable had the largest contributions to the chi- 
square test statistic and whether the observed values in those categories were larger or smaller 
than expected. 


Section 11.2: Inference for Two-Way Tables 


In this section, you learned two different tests to analyze categorical data that are summarized 
in a two-way table. A test for homogeneity compares the distribution of a single categorical 
variable for two or more populations or treatments. A test for independence looks for an 
association between two categorical variables in a single population. 


In a chi-square test for homogeneity, the null hypothesis is that there is no difference in the 
true distribution of a categorical variable for two or more populations or treatments. The 


alternative hypothesis is that there is a difference in the distributions. The Random condition is 
that the data come from independent random samples or groups in a randomized experiment. 
The 10% condition applies for each sample when sampling without replacement, but not in 
experiments. Finally, the Large Counts condition remains the same—the expected counts must 
be at least 5 in each cell of the two-way table. 


To calculate the expected counts for a test for homogeneity, use the following formula: 


row total - column total 
expected count =————_____——_ 


expected count=row total - column totaltable total table total 
To calculate the P-value, compute the chi-square test statistic and use a chi-square distribution 
with (number of rows—1)(number of columns—1)(2umber ofrows — 1) (number of columns — 1) 


In a chi-square test for independence, the null hypothesis is that there is no association 
between two categorical variables in one population (or that the two variables are independent 
in the population). The alternative hypothesis is that there is an association between the two 
variables (or that the two variables are not independent). For this test, the Random condition 
says that the data must come from a single random sample. The 10% condition applies when 
sampling without replacement. The Large Counts condition is still the same—the expected 
counts must all be at least 5. The method for calculating expected counts, the chi-square test 
Statistic, the degrees of freedom, and the P-value are exactly the same in a test for 
independence and a test for homogeneity. 

As with tests for goodness of fit, when the results of a test for homogeneity or 
independence are significant, consider doing a follow-up analysis. Identify which cells in the 
two-way table had the largest contributions to the chi-square test statistic and whether the 
observed values in those cells were larger or smaller than expected. 


Goodness of fit Homogeneity Independence 
Number of 1 2 or more 1 
samples/treatments 
Number of variables 1 1 2 
Null hypothesis The stated There is no difference There is no 
distribution of a in the distribution of a association between 
categorical variable in categorical variable —_ two categorical 
the population of for several variables in the 
interest is correct. populations or population of interest. 
treatments. 


Random condition The data come from a The datacome from The datacome froma 
random sample from independent random random sample from 


the population of samples or groups in the population of 
interest. a randomized interest. 
experiment. 
10% condition When sampling without replacement, n < 0.10N for each sample. 


Large Counts All expected counts = 5 


condition 


Expected counts (samplesize) row total - column totaltable total 
(expectedproportion) row 4 ae total 
| ( expected ) — 

size proportion 
Formula for test X2=> 
statistic (Observed count — Expected count)2Expected count 


. (Observed count — Expected count)? 
Oe, pe 


Expected count 


Degrees of freedom (# categories — 1) (# rows — 1)(# columns — 1) 
TI-83/84 name x2 GOF—test x2-testX’ —test 
x- GOF—-test 


What Did You Learn? 


Related Relevant 
Example Chapter 


on Review 
Learning Target Section Page(s) Exercise(s) 
State appropriate hypotheses and compute the 11.1 712 ceil 


expected counts and chi-square test statistic for a chi- 
square test for goodness of fit. 


State and check the Random, 10%, and Large 11.1 718 R11.1 
Counts conditions for performing a chi-square test for 
goodness of fit. 


Calculate the degrees of freedom and P-value for a 11.1 716 seh 
chi-square test for goodness of fit. 
Perform a chi-square test for goodness of fit. 11.1 718 Aii4 
Conduct a follow-up analysis when the results of a 11.1, Discussion Bile 
chi-square test are statistically significant. 11.2 on 720, 

739 
State appropriate hypotheses and compute the 11.2 131 R11.4, 
expected counts and chi-square test statistic for a chi- Aili 5S 
square test based on data in a two-way table. 
State and check the Random, 10%, and Large 11.2 734 Ril.2, 
Counts conditions for a chi-square test based on data R11.4, 
in a two-way table. sabe 
Calculate the degrees of freedom and P-value for a 11.2 734 R11.4, 
chi-square test based on data in a two-way table. Pol 
Perform a chi-square test for homogeneity. 11.2 738 R1ii.4 


Perform a chi-square test for independence. 11.2 7144 R11.5 


Choose the appropriate chi-square test in a given 
setting. 


11.2 


147 


Chapter 11 Review Exercises 


These exercises are designed to help you review the important ideas and methods of the 
chapter. 


R11.1 |») Testing a genetic model Biologists wish to cross pairs of tobacco plants having 


genetic makeup Gg, indicating that each plant has one dominant gene (G) and one 
recessive gene (g) for color. Each offspring plant will receive one gene for color from 
each parent. The Punnett square shows the possible combinations of genes received by 


the offspring. 
Parent 2 passes on: 
G g 
Parent 1 G GG Gg 
passes on: 
g Gg gg 


The Punnett square suggests that the expected ratio of green (GG) to yellow-green (Gg) 
to albino (gg) tobacco plants should be 1:2:1. In other words, the biologists predict that 
25% of the offspring will be green, 50% will be yellow-green, and 25% will be albino. 
To test their hypothesis about the distribution of offspring, the biologists mate 84 
randomly selected pairs of yellow-green parent plants. Of 84 offspring, 23 plants were 
green, 50 were yellow-green, and 11 were albino. Do the data provide convincing 
evidence at the a=0.01a@ = 0.01 level that the true distribution of offspring is different 
from what the biologists predict? 


R11.2 |») Sorry, no chi-square We would prefer to learn from teachers who know their 


subject. Perhaps even preschool children are affected by how knowledgeable they think 
teachers are. Assign 48 three- and four-year-olds at random to be taught the name of a 
new toy by either an adult who claims to know about the toy or an adult who claims not 
to know about it. Then ask the children to pick out a picture of the new toy from a set of 
pictures of other toys and say its name. The response variable is the count of right 
answers in four tries. Here are the data:22 


Count of correct answers 


Knowledge Knowledgeable 
ofteacher = |gnorant 


Total 


The researchers report that children who were taught by the teacher claiming to be 


knowledgeable did significantly better (y2=20.4, P<0.05)(x" = 20.4, P< 0.05) Explain 
why this result isn’t valid. 


R11.3 |») Fewer TVs? The United States Energy Information Administration periodically 


surveys a random sample of U.S. households to determine how they use energy.22 One 
of the variables they track is how many TVs are in a household (None, 1, 2, 3, 4, or 5 or 
more). The computer output compares the distribution of number of TVs for households 
in 2009 and 2015. 


2009 2015 All 

None 192 174 366 
252 114 
14.113 31.034 

1 3098 1680 4778 
3284 1494 
10.577 23.258 

2 4801 2190 6991 
4806 2185 
0.004 0.010 

3 3406 1495 4901 
3369 1532 
0.408 0.897 

4 1818 689 2507 
1723 784 
5.204 11.442 

2009 2015 All 

5 or more 1242 392 1634 
1123 511 
12.564 27.628 
All 14557 6620 21177 


Cell Contents: Count 
Expected count 
Contribution to Chi-square 
Chi-Square = 137.137, DF = 5, P-Value = 0.000 


a. Which chi-square test is appropriate to analyze these data? Explain your answer. 
b. Show how the numbers 252 and 14.113 were obtained for the 2009/None cell. 


c. Which 3 cells contribute most to the chi-square test statistic? How do the observed 
and expected counts compare for these cells? 


R11.4 |») Stress and heart attacks You read a newspaper article that describes a study of 


whether stress management can help reduce heart attacks. The 107 subjects all had 
reduced blood flow to the heart and so were at risk of a heart attack. They were assigned 
at random to three groups. The article goes on to say: 


One group took a four-month stress management program, another underwent a 


four-month exercise program, and the third received usual heart care from their 
personal physicians. In the next three years, only 3 of the 33 people in the stress 
management group suffered “cardiac events,” defined as a fatal or non-fatal heart 
attack or a surgical procedure such as a bypass or angioplasty. In the same period, 
7 of the 34 people in the exercise group and 12 out of the 40 patients in usual care 


suffered such events.24 


a. Use the information in the news article to make a two-way table that describes the 
study results. 
b. Compare the success rates of the three treatments in preventing cardiac events. 


c. Do the data provide convincing evidence at the a=0.05a = 0.05 level that the true 
success rates for patients like these are not the same for the three treatments? 


R11.5 |») Popular kids Who were the popular kids at your elementary school? Did they get 


good grades or have good looks? Were they good at sports? A study was performed in 
Michigan to examine the factors that determine social status for children in grades 4, 5, 
and 6. Researchers administered a questionnaire to a random sample of 478 students in 
these grades. One of the questions asked, “What would you most like to do at school: 
make good grades, be good at sports, or be popular?” The two-way table summarizes the 
students’ responses.2° Is there convincing evidence of an association between gender and 
goal for students in grades 4, 5, and 6? 


Gender 
Female Male Total 
Grades 247 
Goal Popular 141 
Sports 90 


Total 251 221 478 


Chapter 11 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


Exercises T11.1 and T11.2 refer to the following setting. Recent revenue shortfalls in a 
midwestern state led to a reduction in the state budget for higher education. To offset the 
reduction, the largest state university proposed a 25% tuition increase. It was determined that 
such an increase was needed simply to compensate for the lost support from the state. Separate 
random samples of 50 freshmen, 50 sophomores, 50 juniors, and 50 seniors from the university 
were asked whether they were strongly opposed to the increase, given that it was the minimum 
increase necessary to maintain the university’s budget at current levels. Here are the results: 


Year 
Total 
Strongly Yes 122 
opposed? No 78 
Total 50 50 50 50 200 


711.1 Which null hypothesis would be appropriate for performing a chi-square test? 


a. The closer students get to graduation, the less likely they are to be opposed to 
tuition increases. 


b. The mean number of students who are strongly opposed is the same for each of the 
4 years. 


c. The distribution of student opinion about the proposed tuition increase is the same 
for each of the 4 years at this university. 


d. Year in school and student opinion about the tuition increase are independent in the 
sample. 


e. There is an association between year in school and opinion about the tuition 
increase at this university. 


111.2 The conditions for carrying out the chi-square test in Exercise T11.1 are: 
I. Independent random samples from the populations of interest. 
II. All sample sizes are less than 10% of the populations of interest. 


III. All expected counts are at least 5. 


Which of the conditions is (are) satisfied in this case? 
a. I only 
b. IL only 


c. I and III only 
d. II and II only 


e. I, IZ, and III 


Exercises T11.3—T11.5 refer to the following setting. A random sample of traffic tickets 
given to motorists in a large city is examined. The tickets are classified according to the 
race or ethnicity of the driver. The results are summarized in the following table. 


Race/Ethnicity White Black Hispanic Other 
Number of tickets 69 52 18 9 


The proportion of this city’s population in each of the racial/ethnic categories listed is 


as follows. 
Race/Ethnicity White Black Hispanic Other 
Proportion 0.55 0.30 0.08 0.07 


We wish to test HO:0 * The racial/ethnic distribution of traffic tickets in the city is the 
same as the racial/ethnic distribution of the city’s population. 


T11.3 Assuming HO/“0 is true, what is the expected number of Hispanic drivers who would 
receive a ticket? 


a. 8 
b. 10.36 
c. 11 
d. 11.84 
e, 12 


T11.4 We compute the value of the y2x” test statistic to be 6.57. Assuming that the 
conditions for inference are met, which of the following is the correct P-value? 


a. Greater than 0.20 

b. Between 0.10 and 0.20 
c. Between 0.05 and 0.10 
d. Between 0.01 and 0.05 
e. Less than 0.01 


T11.5 The category that contributes the largest component to the y2x” test statistic is 
a. White, with 12.4 fewer tickets than expected. 
b. White, with 12.4 more tickets than expected. 


c. Hispanic, with 6.16 fewer tickets than expected. 


jae 


. Hispanic, with 6.16 more tickets than expected. 


e. Other, with 1.36 fewer tickets than expected. 


T11.6 Which of the following statements about chi-square distributions are true? 
I. For all chi-square distributions, P(y220)=1" (x? >0)=1 


II. A chi-square distribution with fewer than 10 degrees of freedom is roughly 
symmetric. 


III. The more degrees of freedom a chi-square distribution has, the larger the mean of the 
distribution. 


a. I only 
b. IL only 

c. I only 

d. Tand III 

e. I, I, and III 


Exercises T11.7—T11.10 refer to the following setting. All current-carrying wires 
produce electromagnetic (EM) radiation, including the electrical wiring running into, 
through, and out of our homes. High-frequency EM radiation is thought to be a cause 
of cancer. The lower frequencies associated with household current are generally 
assumed to be harmless. To investigate the relationship between current configuration 
and type of cancer, researchers visited the addresses of a random sample of children 
who had died of some form of cancer (leukemia, lymphoma, or some other type) and 
classified the wiring configuration outside the dwelling as either a high-current 
configuration (HCC) or a low-current configuration (LCC). Here are the data: 


Type of cancer 


Total 
HCC 
Wiring configuration a 
LCC 136 
Total 136 ol 48 215 


Computer software was used to analyze the data. The output included the value 
y2=0.435X = 0.435 


111.7 Which of the following is the appropriate degrees of freedom for the 2x” test? 
a 
b. 
C 


d. 


e. 
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111.8 Which of the following is the expected count of cases with lymphoma in homes with 


an HCC? 
a. 79°31215 215 


b. 10:21215 215 
c. 79-3110 10 


136-31 


d. 136°31215 215 
e. None of these 
T11.9 Which of the following may we conclude, based on the chi-square test results? 


a. There is convincing evidence of an association between wiring configuration and 
the chance that a child will develop some form of cancer. 


b. HCC either causes cancer directly or is a major contributing factor to the 
development of cancer in children. 


c. Leukemia is the most common type of cancer among children. 


d. There is not convincing evidence of an association between wiring configuration 
and the type of cancer that caused the deaths of children. 


e. There is convincing evidence that HCC does not cause cancer in children. 
111.10 A Type I error would occur if we found convincing evidence that 

a. HCC wiring caused cancer when it actually didn’t. 

b. HCC wiring didn’t cause cancer when it actually did. 


c. there is no association between the type of wiring and the form of cancer when there 
actually is an association. 


d. there is an association between the type of wiring and the form of cancer when there 
actually is no association. 


e. the type of wiring and the form of cancer have a positive correlation when they 
actually don’t. 


Section II: Free Response Show all your work. Indicate clearly the methods you use, 
because you will be graded on the correctness of your methods as well as on the accuracy and 
completeness of your results and explanations. 


711.11 A large distributor of gasoline claims that 60% of all drivers stopping at their service 
stations choose regular unleaded gas and that premium and supreme are each selected 
20% of the time. To investigate this claim, researchers collected data from a random 
sample of drivers who put gas in their vehicles at the distributor’s service stations in a 
large city. The results were as follows. 


Gasoline selected 


Regular Premium Supreme 
261 51 88 


Carry out a test of the distributor’s claim at the 5% significance level. 


711.12 A study conducted in Charlotte, North Carolina, tested the effectiveness of three 
police responses to spouse abuse: (1) advise and possibly separate the couple, (2) issue 
a Citation to the offender, and (3) arrest the offender. Police officers were trained to 
recognize eligible cases. When presented with an eligible case, a police officer called 
the dispatcher, who would randomly assign one of the three available treatments to be 
administered. There were a total of 650 cases in the study. Each case was classified 


according to whether the abuser was arrested within 6 months of the original incident.22 


Police response 


Advise and Citation Arrest Total 
separate 
Subsequent jj, 543 
arrest? 
Yes 107 
Total 212 224 214 650 


a. Explain the purpose of the random assignment in the design of this study. 

b. State an appropriate pair of hypotheses for performing a chi-square test in this 
setting. 

c. Assume that all the conditions for performing the test in part (b) are met. The test 
yields y2=5.063X° = 5.063 and a P-value of 0.0796. Interpret this P-value. 

d. What conclusion should we draw from the study? 


T11.13 In the United States, there is a strong relationship between education and smoking: 
well-educated people are less likely to smoke. Does a similar relationship hold in 
France? To find out, researchers recorded the level of education and smoking status of a 
random sample of 459 French men aged 20 to 60 years.28 The two-way table displays 


the data. 
Education 
Primary Secondary | University Total 
school school 

146 

Smoking 125 
status? 

104 

84 

459 


Is there convincing evidence of an association between smoking status and educational 


level among French men aged 20 to 60 years? 


More About Regression 
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Introduction 


Section 12.1 Inference for Linear Regression 


INTRODUCTION 


When a scatterplot shows a linear relationship between a quantitative explanatory variable xx 
and a quantitative response variable y,¥» we can use the least-squares line calculated from the 
data to predict y¥ for a given value of x.”. If the data are a random sample from a larger 
population, we use statistical inference to answer questions like these: 


e Is there really a linear relationship between x# and y¥Y in the population, or is it plausible that 
the pattern we see in the scatterplot happened by chance alone? 


e Inthe population, how much will the predicted value of y¥ change for each increase of 1 unit 
in x?2? What’s the margin of error for this estimate? 


If the data come from a randomized experiment, the values of the explanatory variable 
correspond to the levels of some factor that is being manipulated by the researchers. For 
instance, malaria researchers might want to investigate how temperature affects the life span of 
mosquitoes. They could set up several tanks at each of several different temperatures and then 
randomly assign hundreds of mosquitoes to each of the tanks. The response variable of interest 
is the time (in days) from hatching to death. Suppose that a scatterplot of average life span 
versus temperature has a linear form. We use statistical inference to decide if these data provide 
convincing evidence that changes in temperature cause changes in life span. 


It is conventional to refer to a scatterplot of the points (x, yi? y) as a graph of y¥ versus x.2. 


So a scatterplot of life span versus temperature uses life span as the response variable and 
temperature as the explanatory variable. 


In Section 12.1, we will show you how to estimate and test claims about the slope of the 
population (true) regression line. Before you perform inference about a slope, you should 
understand the sampling distribution of the sample slope. The following activity gets you 
started. 


ACTIVITY Sampling from Old Faithful 


In Chapter 7, you learned about the sampling distribution of a sample proportion p/? and the 
sampling distribution of asample mean x .@- In this activity, you will explore the sampling 
distribution of the sample slope b121 when calculating least-squares regression lines from 
random samples. 


Benny Marty/Shutterstock 


Old Faithful geyser is one of the most popular attractions in Yellowstone National Park. 
As you saw in Chapter 3, it is possible to use a least-squares regression line to predict y=¥ ~ 
interval of time (in minutes) until the next eruption from x= = (in minutes) of an eruption. 
In one particular month, Old Faithful erupted 263 times. Your teacher has printed out this 
population on 263 cards. Each card gives the duration of a particular eruption on one side 
and the interval to the next eruption on the other side. 


1. Form teams of 2 or 3 students. 


2. Have one student from each team select a random sample of 15 cards from the population 
and return to the team. This student should read aloud the duration (x)(@ ) and interval (x) 
(7) values for each eruption while the remaining team members enter these values into 
their calculators (or other technology). Replace the 15 cards in the population. 


3. After each team has recorded the values for its 15 randomly selected eruptions, the team 
should calculate the sample least-squares regression line for its data. Record the value of 
the sample slope b1.21- 


4. Your teacher will prepare an axis for a dotplot to display the distribution of the sample 
slope. Add the value you calculated in Step 3 to this dotplot, using the symbol “b1°1” 
instead of a dot. Repeat Steps 2—4 as necessary to get at least 20 sample slopes. 


5. Asa class, describe this simulated sampling distribution. Remember to discuss shape, 
center, and variability. 


6. What do you think would happen to the simulated sampling distribution if the sample size 
was increased from 15 to 50? Discuss as a class. 


Sometimes a scatterplot reveals that the relationship between two quantitative variables has 
a curved form. One strategy is to transform one or both variables so that the graph shows a 
linear pattern. Then we can use least-squares regression to fit a linear model to the data. Section 
12.2 examines methods of transforming data to achieve linearity. 


SECTION 12.1 Inference for Linear Regression 


LEARNING TARGETS By the end of the section, you should be able to: 


e Check the conditions for performing inference about the slope B11 of the 
population (true) regression line. 


Interpret the values of bO, b1, s,0, 1, 8, and SEb1°% in context, and determine 


these values from computer output. 


Construct and interpret a confidence interval for the slope B1°: of the population 
(true) regression line. 


Perform a significance test about the slope B1°1 of the population (true) 
regression line. 


Figure 12.1 shows the relationship between x=durationt = duration (in minutes) of an 
eruption and y=interval of timeY = interval of time (jn minutes) until the next eruption for all 
263 eruptions of the Old Faithful geyser during a particular month. The least-squares regression 
line is also shown. 
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FIGURE 12.1 Scatterplot of the duration and interval between eruptions of Old Faithful for all 
263 eruptions in a single month. The population least-squares line is shown in blue. 


1.0 


Because the scatterplot includes all the eruptions in a particular month, the least-squares 
regression line is called a population regression line (or true regression line). In most cases, 
we don’t have data from the entire population, so we use the sample regression line (or 
estimated regression line) to estimate the population regression line. 


DEFINITION Population regression line, Sample regression line 


A regression line calculated from every value in the population is called a 


population regression line (true regression line). The equation of a population 
regression line is uU=BO+B1x4 — 59 + Pi® where 


e wy» is the mean y¥-value for a given value of x.z. 
e B00 is the population yY intercept. 
e B11 is the population slope. 


A regression line calculated from a sample is called a sample regression line 
(estimated regression line). The equation of a sample regression line is 
y*=b0+b1x¥ = 4 + 612 where 


e yY is the estimated mean y¥-value for a given value of x.#. 
e b0% is the sample yY intercept. 


e 6151 is the sample slope. 


Note that the symbols B0P0 and p11 here refer to the intercept and slope of the population 


regression line. They are in no way related to the probability of a Type Il error, which is 


sometimes designated by the symbol pe (the Greek letter “beta”). 


In Chapter 3, we interpreted y/¥ as the predicted value of y¥ for a given value of x.«- That 
interpretation is still valid. What if we want to use the sample regression line to make an 
inference about the population regression line p=B0+B1x?4 — Bo + 912? Now we can think of 
y/¥ as being an estimate for py“ the mean value of y¥Y for all individuals in the population 
with the given value of x.2. 

How does the slope of the sample regression line b1°1 relate to the slope of the population 
regression line 8121? To find out, we’ll learn about the sampling distribution of the sample 


slope. 


Sampling Distribution of b1 


Confidence intervals and significance tests about the slope of the population regression line are 
based on the sampling distribution of b1%1 the slope of the sample regression line. Figure 12.2 
shows the results of taking three different SRSs of 15 Old Faithful eruptions from the 
population described earlier. Each graph displays the selected points and the least-squares 


regression line for that sample (in green). The population regression line (uy=33.35+13.29x) 


(Hy = 33.35 + 13.292) i¢ also shown (in blue). 
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FIGURE 12.2 Scatterplots and least-squares regression lines (in green) for three different SRSs 
of 15 Old Faithful eruptions, along with the population regression line (in blue). 


Notice that the slopes of the sample regression lines (b1=10.0, b1=12.5, and b1=15.7) 

(6) = 10.0, b) = 12.5, and b; = 15.7) vary quite a bit from the slope of the population 
regression line, 81=13.29°1 = 13-29 The pattern of variation in the sample slope b1% is 
described by its sampling distribution. 

To get a better picture of this variation, we used technology to simulate choosing 1000 
SRSs of n=15n = 15 points from the Old Faithful data, each time calculating the sample 
regression line y\=b0+b1x.¥ = 40 + 612.Figure 12.3 displays the values of the slope b1%1 for 
the 1000 sample regression lines. We have added a vertical line (in blue) at 13.29 
corresponding to the slope of the population regression line B1.°1- 
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FIGURE 12.3 Dotplot of the sample slope b151 of the least-squares regression line in 1000 
simulated SRSs of n=15n = 15 eruptions. The population slope (13.29) is marked with a blue 
vertical line. 


Let’s describe this simulated sampling distribution of b1.1- 


Shape: We can see that the distribution of b1%1-values is roughly symmetric and single 
peaked. Figure 12.4(a) is a Normal probability plot of these sample regression line slopes. 
The strong linear pattern in the graph tells us that the simulated sampling distribution of b11 
is close to Normal. 


Center: The mean of the 1000 b1¥1-values is 13.34. This value is quite close to the slope of 
the population (true) regression line, 13.29. 


Variability: The standard deviation of the 1000 b11-values is 1.40. We will soon see that 
the standard deviation of the sampling distribution of b1® is actually 1.42. 
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FIGURE 12.4 (a) Normal probability plot and (b) histogram of the 1000 sample regression line 
slopes from Figure 12.3. The blue density curve in Figure 12.4(b) is for a Normal distribution 
with mean 13.29 and standard deviation 1.42. 


Figure 12.4(b) is a histogram of the b11-values from the 1000 simulated SRSs. We have 
superimposed the density curve for a Normal distribution with mean 13.29 and standard 
deviation 1.42. This curve models the approximate sampling distribution of the slope quite 
well. 

Let’s do a quick recap. For all 263 eruptions of Old Faithful in a single month, the 
population regression line is py=33.35+13.29x.4y ~ 33.35 + 13.292. wre use the symbols 
BO=33.3590 = 33.35 and B1=13.29%1 = 13-29 to represent the yY intercept and slope 
parameters. The standard deviation of the residuals for this line is the parameter o=6.47. 

o = 6.47. 

Figure 12.4(b) shows the approximate sampling distribution of the slope b11 of the sample 
regression line for samples of 15 eruptions. If we take all possible SRSs of size n=15n = 15 
from this population, we get the sampling distribution of b1.%1- Can you guess its shape, center, 
and variability? 


Shape: Approximately Normal 


Center: b1=$1=13.29" ~ By = 13.29 (b151 is an unbiased estimator of B1.°1-) 


re. fe 


a ~ —. 42, 
Variability: ob1=coxn=6.471.1815=1.42, "asym 1.181 where ox?7= is the 
standard deviation of duration for the 263 eruptions. 


We interpret ob17" just like any other standard deviation: the slopes of the sample 
regression lines typically differ from the slope of the population regression line by about 1.42. 
Here’s a summary of the important facts about the sampling distribution of b1.1- 


SAMPLING DISTRIBUTION OF A SLOPE 


Choose an SRS of nt observations (x, y)(@ > Y) froma population of size NV with least- 
squares regression line 


wy=B0+B1xy — Bo | By r 
Let b1°1 be the slope of the sample regression line. Then: 


e¢ The mean of the sampling distribution of b11 is pb1=B1.: = 1: 
e The standard deviation of the sampling distribution of b1°1 is 


Co 
Oh, = 


ob1l=ooxn oryn 


as long as the 10% condition is satisfied: n<0.10 N.n < 0.10 N. 


¢ The sampling distribution of b1%1 will be approximately Normal if the values of the 
response variable y¥ follow a Normal distribution for each value of the explanatory 
variable xx (the Normal condition). 


We’ll say more about the Normal condition in a moment. 


Think About It 


WHAT’S WITH THAT FORMULA FOR ob1?7: * Three factors affect the standard 
deviation of the sampling distribution of b1:1 : 


1. 0,%> the standard deviation of the residuals for the population regression line. Because o7 
is in the numerator of the formula, when o@ is larger, so is ob1.7%* When the points vary 
more from the population (true) regression line, we should expect more variability in the 
slopes b151 of sample regression lines from repeated random sampling or random 
assignment. 


2. ox,7*» the standard deviation of the explanatory variable. Because ox%= is in the 
denominator of the formula, when ox is larger, ob1°": is smaller. More variability in 
the values of the explanatory variable leads to a more precise estimate of the slope of the 
true regression line. 


3. n,’: the sample size. As with every other formula for the standard deviation of a statistic, 
the variability of the statistic gets smaller as the sample size increases. A larger sample 
size will lead to a more precise estimate of the true slope. 


Conditions for Regression Inference 


We can fit a least-squares line to any data relating two quantitative variables, but the results are 
useful only if the scatterplot shows a linear pattern. Inference about regression involves more 
detailed conditions. Figure 12.5 shows the regression model in picture form when the 
conditions are met. 


For any fixed x, the responses y 
follow a Normal distribution 
with standard deviation o. 


FIGURE 12.5 The regression model when the conditions for inference are met. The line is the 
population (true) regression line, which shows how the mean response py” changes as the 
explanatory variable x changes. For any fixed value of x,“» the observed response y¥ varies 
according to a Normal distribution having mean yy“ and standard deviation o.¢- 


Shape: For each possible value of the explanatory variable x,”» the values of the response 
variable y¥ follow a Normal distribution. 


Center: For each possible value of the explanatory variable x,”» the mean value of the 
response variable pry“ falls on the population (true) regression line py=B0+B1x. 

Hy = 80 + P12. Because the regression line goes through the mean value of y¥ for each value 
of x,” we can interpret the slope as the change in the average value of y¥ for each 1-unit 


increase in x.2.- 


Variability: For each possible value of the explanatory variable x,” the values of the 
response variable y¥ have the same standard deviation o.¢- 


Consider the population of all eruptions of the Old Faithful geyser in a given year. For each 
eruption, let x be the duration (in minutes) and y¥Y be the interval of time (in minutes) until the 
next eruption. Suppose that the conditions for regression inference are met, the population 


= 34+ 182, and the variability around the line is given by 


regression line is py=34+13x,Y 
o=6.0 = 6. 
Let’s focus on the eruptions that lasted x=2% = 2 minutes. For this “subpopulation”: 


e The average amount of time until the next eruption is py=34+13(2)=60 minutes. 
jly = 34+ 13 (2) = 60 minutes. 


e The amounts of time until the next eruption follow a Normal distribution with mean 60 
minutes and standard deviation 6 minutes. 


e For about 95% of these eruptions, the amount of time y¥ until the next eruption is between 
60-2(6)=48 minutes®0 — 2(6) = 48 minutes ang 60+2(6)=72 minutes.99 + 2(6) = 72 minutes. 


That is, if the previous eruption lasted 2 minutes, 95% of the time the next eruption will 
occur in 48 to 72 minutes. 


— My =34 + 13x 
Interval 
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1.5 2 25 
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Here are the conditions for performing inference about the linear regression model. The 
acronym LINER should help you remember them! 


CONDITIONS FOR REGRESSION INFERENCE 


Suppose we have n?’ observations on a quantitative explanatory variable xt anda 
quantitative response variable y.¥: Our goal is to study or predict the behavior of y¥ for 
given values of x.@. 


e Linear: The actual relationship between x and y¥ is linear. For any fixed value of x,”> 


the mean response py” falls on the population (true) regression line py=B0+B1x. 
fy = Bo + Biz. 


Independent: Individual observations are independent of each other. When sampling 
without replacement, check the 10% condition. 


Normal: For any fixed value of x,” the response y¥ varies according to a Normal 
distribution. 


Equal SD: The standard deviation of y¥ (call it o7) is the same for all values of x.®. 


Random: The data come from a random sample from the population of interest or a 
randomized experiment. 


Although the conditions for regression inference are a bit complicated, it is not hard to 
check for major violations. Most of the conditions involve the population (true) regression line 
and the deviations of responses from this line. We usually can’t observe the population line, but 
the sample regression line estimates it. The residuals from the sample regression line estimate 
the deviations from the population line. We can check several of the conditions for regression 
inference by looking at graphs of the residuals. Start by making a residual plot and a histogram, 
dotplot, stemplot, boxplot, or Normal probability plot of the residuals. 


Here’s a summary of how to check the conditions one by one. 


e Linear: Examine the scatterplot to see if the overall pattern is roughly linear. Make sure 
there are no leftover curved patterns in the residual plot. 


Good: Scatterplot has a linear form. Bad: Residual plot shows a curved pattern. 
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e Independent: Knowing the value of the response variable for one individual shouldn’t help 
predict the value of the response variable for other individuals. If sampling is done without 
replacement, remember to check that the sample size is less than 10% of the population size 
(10% condition). There are also other issues that can lead to a lack of independence. One 
example is measuring the same variable over time, yielding what is known as time-series 
data. Knowing that a young girl’s height at age 6 is 48 inches would definitely give you 
additional information about her height at age 7. We will avoid doing inference for time- 
series data in this course. 


e Normal: Make a histogram, dotplot, stemplot, boxplot, or Normal probability plot of the 
residuals and check for strong skewness or outliers. Ideally, we would check the Normality 
of the residuals at each possible value of x.2- Because we rarely have enough observations at 
each x£-value, however, we make one graph of all the residuals to check for Normality. 


e Equal SD: Look at the scatter of the residuals above and below the “residual=0” 
“residual = 0” line in the residual plot. The variability of the residuals in the vertical 
direction should be roughly the same from the smallest to the largest x2-value. 


Good: Residuals have roughly equal variability 
at all x-values in the data set. 


Bad: The response variable y has greater variability for 
larger values of the explanatory variable x. 
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e Random: See if the data came from a random sample from the population of interest or a 
randomized experiment. If not, we can’t make inferences about a larger population or about 
cause and effect. 


Let’s look at an example that illustrates the process of checking conditions. 


EXAMPLE |The helicopter experiment (>) 
Checking conditions 


Fotosearch 


PROBLEM: Mrs. Barrett’s class did a fun experiment using paper helicopters. After making 
70 helicopters using the same template, students randomly assigned 14 helicopters to each of 
five drop heights: 152 centimeters (cm), 203 cm, 254 cm, 307 cm, and 442 cm. Teams of 
students released the 70 helicopters in a random order and measured the flight times in 
seconds. The class used computer software to carry out a least-squares regression analysis 
for these data. Here are a scatterplot, residual plot, and histogram of the residuals. 
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Check whether the conditions for performing inference about the regression model are met. 


SOLUTION: 


Use the LINER acronym! 


e Linear: The scatterplot shows a clear linear form and there is no leftover curved pattern 
in the residual plot. Vv 


Although there are “stacks” in the residual plot, the residuals are centered on the 


horizontal line at O for each drop height used in the experiment. 


® Independent: Because the helicopters were released in a random order and no helicopter 
was used twice, knowing the result of one observation should not help Us predict the value 
of another observation. V 


e Normal: There is no strong skewness or outliers in the histogram of the residuals. V 


Note that we do not have to check the 10% condition here because there was no 


random sampling. 


e Equal SD: The residual plot shows a similar amount of scatter about the residual=o 
residual = 0 [ine for each drop height. However, flight times seem to vary a little more 


for the helicopters that were dropped. from a height of 307 cm. V 
© Random: The helicopters were randomly assigned to the five possible drop heights. v 


FOR PRACTICE, TRY EXERCISE 5 


You will always see some irregularity when you look for Normality and equal standard 
deviation in the residuals, especially when you have few observations. og Don’t overreact to 


minor issues in the graphs when checking the Normal and Equal SD conditions. 


Estimating the Parameters 


When the conditions are met, we can do inference about the regression model py=B0+B1x. 
Hy = 80 + P12. The first step is to estimate the unknown parameters. If we calculate the sample 
regression line y\=b0+b1x,¥ = 4 + 512, the sample slope b1%1 is an unbiased estimator of the 
true slope B11 and the sample y¥Y intercept b020 is an unbiased estimator of the true y¥ 
intercept B0.°0. The remaining parameter is the standard deviation o,%> which describes the 


variability of the response y¥ about the population (true) regression line. 


AP® EXAM TIP 


We use the same notation as the AP® Statistics exam formula sheet for the 


equation of the sample regression line y*=b0+b1x.¥ = % + 512. However, your 
graphing calculator probably uses the notation y*=at+bx.Y = @ + 6x. Just remember: 
The slope is always the coefficient of x,2» no matter what symbol is used. 


The least-squares regression line computed from the sample data estimates the population 
(true) regression line. So the residuals estimate how much y¥ varies about the population line. 
Because o@ is the standard deviation of responses about the population (true) regression line, 
we estimate it with the standard deviation of the residuals s:s : 


See a ~ \2 

| S residuals s (y; ~%;)? 

s& = | CO SS 
s=)'residuals2n—2=)'(yi-yi)2n-2 \ n—2 n—2 


Recall from Chapter 3 that ss describes the size of a “typical” prediction error. Because s$ is 
estimated from data, it is sometimes called the regression standard error or the root mean 
squared error. 

It is possible to do inference about any of the three parameters in the regression model: £0, 
81°0, 1 or 0.0. However, the slope 81°: of the population (true) regression line is usually the 
most important parameter in a regression problem. So we’ll restrict our attention to inference 
about the slope. 


When the conditions are met, the sampling distribution of the slope b121 is approximately 


Normal with mean pb1=B1"" ~ 9 and standard deviation 
Co 

On = i 

ob1=ooxn ey" 


In practice, we don’t know o@ for the true regression line. So we estimate it with the standard 
deviation of the residuals, s.s. We also don’t know the standard deviation ox?« for the 
population of x%-values. (For reasons beyond the scope of this text, we replace the 
denominator with sxn-1.%+ vn—1.) So we estimate the variability of the sampling distribution 
of bb with the standard error of the slope 
SE, a 
SEb1=ssxn-1 szyn—1 
This standard error has the same interpretation as the standard error of x @ or the standard 
error of p\.P: It measures how far our estimate typically varies from the truth. In this case, it 
measures how far the sample slope typically varies from the population (true) slope if we repeat 
the data production process many times. 
Although we give the formula for the standard error of b11 you should rarely have to 


calculate it by hand. Computer output gives the standard error SEb1> Eh: immediately to the 
right of the sample slope b1.°1- 


AP® EXAM TIP 


The AP® Statistics exam formula sheet gives the formula for the standard error of 
the slope as 


2 
sb1=)\(yi-y/i)2n—-2) (xi-x )2 pa i—®) 


The numerator is just a fancy way of writing the standard deviation of the residuals 
s. Can you show that the denominator of this formula is the same as ours? 


EXAMPLE |The helicopter experiment, part 2 (») 
Estimating the parameters 


istockphoto 


PROBLEM: Earlier, Mrs. Barrett’s class used computer software to perform a least-squares 
regression analysis of their helicopter data. Recall that the data came from dropping 70 paper 
helicopters from various heights (in inches) and measuring the flight times (in seconds). 
Some output from this regression analysis is shown here. We checked conditions for 
performing inference earlier. 


Regression Analysis: Flight time versus Drop height 


Predictor Coef SE Coef T P 
Constant -0.03761 0.05838 -0.64 0.522 
Drop height (cm)  ©.0057244 0©.0002018 28.37 0.000 
S = 0.168181 R-Sq = 92.2% R-Sq(adj) = 92.1% 
a. What is the estimate for B020 ? Interpret this value. 
b. What is the estimate for B12P1 ? Interpret this value. 


c. What is the estimate for 62? Interpret this value. 


d. Give the standard error of the slope SEb15F 2: Interpret this value. 
SOLUTION: 


a. bo=-0.03761;00 = —0.03761; if a helicopter 1s dropped. from o cm, tt will take -0.03761 


0.03761 second to land, on average. 


A negative flight time is clearly unrealistic and is likely due to extrapolating to x=0 


x = 0 from the data in the experiment. 


b. 61=0.0057244;01 = 0.0057244; for each increase of 1cmin drop height, the average 
flight time increases by 0.0057244 second. 


C. s=0.168181;5 = 0.168181; the actual flight times typically vary by about 0.168181 
second from the times predicted with the least-squares regression line using x=drop 
height.” = drop height. 


Remember that the slope and y¥ intercept describe average flight times, not the 


actual flight times. You could also describe the slope as the increase in predicted 
flight time for each increase of 1 cm in drop height. 


d. SEb1=0.0002018:> Ey, = 0.0002018; if we repeated the random assignment many times, 


the slope of the sample regression line would typically vary by about 0.0002018 from 
the slope of the true regression line for predicting flight time from drop height. 


The standard error of the slope is just to the right of the slope, in the column called 


Oo COC: 


FOR PRACTICE, TRY EXERCISE 7 


When we compute the least-squares regression line based on a random sample of data, we 
can think about doing inference for the population regression line. When our least-squares 
regression line is based on data from a randomized experiment, as in the helicopter example, 
the resulting inference is about the true regression line relating the explanatory and response 
variables. From now on, we’|I use the term population regression line in sampling situations 
and the term true regression line when describing experiments. 


Constructing a Confidence Interval for the Slope 


In a regression setting, we often want to estimate the slope B11 of the population (true) 


regression line. The slope b1°1 of the sample regression line is our point estimate for 61.71 “A 
confidence interval is more useful than the point estimate because it gives a set of plausible 
values for B1.A1- 

The confidence interval for 81°: has the familiar form 

Statistic+(critical value)-(standard deviation of statistic) 
statistic + (critical value) - (standard deviation of statistic) 

We use the statistic b1%1 as our point estimate and SEb1°: to estimate the standard 
deviation of the statistic. Because we don’t know the true standard deviation ob1,7""* we must 
use a t*t* critical value rather than a z*2° critical value. Get the t*t* critical value from a tt 
distribution with n-2” — 2 degrees of freedom. (The explanation of why df=n-2df = n — 2 js 
beyond the scope of this book.) Here is the formula: 


b1+t*SEb1"1 = & SEs, 


We call this a té interval for the slope. Here are the details. 


t¢ INTERVAL FOR THE SLOPE 


When the conditions are met, a C% confidence interval for the unknown slope 81/1 of the 
population (true) regression line is 


b1+t«SEb1"1 + @ SE», 


where t*¢* is the critical value for the té distribution with n-2” — 2 degrees of freedom and 
C% of the area between -t*~?” and t*.t*. 


o@ The values of té given in the computer regression output are not the critical values 
for a confidence interval. They come from carrying out a significance test about the y¥ 
intercept or slope of the population (true) regression line. We’II discuss tests in more detail 
shortly. 

You can find a confidence interval for the y¥ intercept B00 of the population (true) 
regression line in the same way, using b0 and SEb0°!% from the “Constant” row of the 
computer output. However, we are usually interested only in the point estimate for B00 that’s 
provided in the output. 

Here is an example using a familiar context that illustrates the four-step process for 
calculating and interpreting a confidence interval for the slope. 


EXAMPLE | How much is that truck worth? (>) f. 


4 STEP 


Confidence interval for a slope 


Oleksiy Maksymenko/imageBROKER/Newscom 


PROBLEM: Everyone knows that cars and trucks lose value the more they are driven. Can 
we predict the price of a used Ford F-150 SuperCrew 4x44 x 4 if we know how many miles 
it has on the odometer? A random sample of 16 used Ford F-150 SuperCrew 4x4s4 4s was 
selected from among those listed for sale on autotrader.com. The number of miles driven and 
price (in dollars) were recorded for each of the trucks. Here are the data: 


Miles driven 70,583 129,484 29,932 29,953 24,495 75,678 8359 4447 


Price ($) 21,994 9500 29,875 41,995 41,995 28,986 31,891 37,991 
Miles driven 34,077 58,023 44,447 68,474 144,162 140,776 29,397 131,385 
Price ($) 34,995 29,988 22,896 33,961 16,883 20,897 27,495 13,997 


Here is some computer output from a least-squares regression analysis of these data. 
Construct and interpret a 90% confidence interval for the slope of the population regression 


line. 


Price ($) 


10,000 + 


5000 + 


Residual ($) 


—5000— * 


T 
80,000 
Miles driven 


T T T T T T T 
120,000 160,000 0 40,000 80,000 120,000 160,000 


Miles driven 


Frequency 
tN 
ut 


— 
ok; 


-8000 ~4000 0 4000 8000 
Residual ($) 


Regression Analysis: Price ($) versus 
Miles driven 


Predictor Coef SE Coeff T P 
Constant 38257 2446 15.640.000 
Miles -0.16292 0.03096-5.260.000 
driven 

Se R-Sq =  R-Sq(adj) = 64.0% 
5740.13 66.4% 


SOLUTION: 


Follow the four-step process! 


STATE: 90% CI for B1=the slope of the population regression 

8, = the slope of the population regression [j¢ relating y=price = price tg x=mifesa = miles 
driven for used Ford F-150 SuperCrew 4x4s4 x 4s fisted for sale on autotrader.com. 
PLAN: tt interval for the slope. 


Remember to use the acronym LINER when checking the conditions. 


Linear: The scatterplot shows a clear linear pattern. Also, the residual plot shows no 
leftover curved patterns. Vv 


Independent: Assume that 16<10%16 < 10% of all used Ford F-150 SuperCrew 4x4s4 x 4s. 
J 


Normal: There is no strong skewness or outliers in the histogram of residuals. V 


Equal SD: The scatter of ‘points around the residual=oresidual = 0 fine appears to be about 
the same at all x-values. V 


Random: Random sample of 16 used Ford F-150 SuperCrew 4x4s.4 x 4s. V 


b1+t*SEb151 +t SEs, 


DO: With 
df=16-2=14,t*=1.761.—0.16292+1.761(0.03096)=—0.16292+0.05452=(-0.21744,—0.10840) 
df = 16—2=14,t* = 1.761. — 0.16292 + 1.761 (0.03096) 

= —0.16292 + 0.05452 
= (—0.21744, —0.10840) 


Refer to Technology Corner 30 for details on how to do calculations for a t interval for 


slope. The calculator gives (-0.2173,-0.1084)(—9-2173, — 0.1084) Using df=14.df = 14. 


CONCLUDE: We are 90% confident that the interval from -0.2174—9-2174 to -0.1084 

0.1084 captures the slope of the population regression line relating y=priced = Price to 
x=milest = miles driven for used Ford F-150 SuperCrew 4x4s4 x 4s fisted for sale on 
autotrader.com. 


This means we are 90% confident that the average price of a used Ford F-150 goes 


down between $0.1084 and $0.2174 per mile. 


FOR PRACTICE, TRY EXERCISE 11 


The change in average price of a used Ford F-150 is quite small for a 1-mile increase in miles 
driven. What if miles driven increased by 1000 miles? We can just multiply both endpoints of 
the confidence interval in the example by 1000 to get a 90% confidence interval for the 
corresponding change in average price. The resulting interval is (-217.44,-108.40). 

(—217.44, —108.40). That is, we are 90% confident that the average price of a used Ford F-150 
will decrease between $108.40 and $217.44 for every additional 1000 miles driven. 


So far, we have used computer regression output when performing inference about the 
slope of a population (true) regression line. The TI-84 and other calculators can do the 
calculations for inference when the sample data are provided. 


30. TECHNOLOGY CORNER | CONSTRUCTING A CONFIDENCE 
INTERVAL FOR SLOPE 


TI-Nspire and other technology instructions are on the book’s website at 


highschool. bfwpub.com/tps6e. 

Let’s use the data from the preceding example to construct a confidence interval for the 
slope of a population (true) regression line on the TI-84. Note: The TI-83 and older 
operating systems for the TI-84 do not include this option. 


NORMAL FLOAT AUTO REAL RADIAN CL fl 


Xlist:Li 
Ylist:Le2 
Frea:1 
C-Level: .9 
RegEQ: 
Calculate 


NORMAL FLOAT AUTO REAL RADIAN CL fl 


y=atbx 

(-.2173, -.1084) 
b= ~-. 1628114837 
df=14 
s=5737.55499 
a=38254. 8639 
r2=, 664549225 
r=-, 8151988868 


e Enter the xz-values (miles driven) into L1 and the y¥-values (price) into L2. 
e Press |STAT|, then choose TESTS and LinRegTInt.... 
e Inthe LinRegTInt screen, adjust the inputs as shown. Then highlight “Calculate” and 


press [ENTER| 


e The linear regression té interval results are shown here. Note that ss is the standard 
deviation of the residuals, not the standard error of the slope. 


AP® EXAM TIP 


When you see a list of data values on an exam question, wait a moment before 


typing the data into your calculator. Read the question through first. Often, 
information is provided that makes it unnecessary for you to enter the data at all. 
This can save you valuable time on the AP® Statistics exam. 


CHECK YOUR UNDERSTANDING 


fy 


”, Many people believe that students learn better if they sit closer to the front of the 
classroom. Does sitting closer cause higher achievement, or do better students simply choose 
to sit in the front? To investigate, a statistics teacher randomly assigned students to seat 
locations in the classroom for a particular chapter and recorded the test score for each student 
at the end of the chapter. Here are the data, along with output from a regression analysis. 


Construct and interpret a 95% confidence interval for the slope of the true regression line. 


Row 1 1 1 1 2 2 2 2 
Score 76 a 94 99 83 85 74 19 
Row 3 3 6 6) 4 4 4 4 
Score 90 88 68 78 94 72 101 70 
Row 4 5 5 5 5 5 6 6 
Score | 79 76 65 90 67 96 88 19 
Row 6 6 tk f 7 i 

Score 90 83 19 76 77 63 


Test score 


Residual 


1 2 3 4 5 6 7 
Row number 


1 2 3 4 $ 6 7 
Row number 


-20 -15 -10 -5 0 5 10 15 20 
Residual 
Term Coef SE Coef T-Value P-Value 
Constant 85.71 4.24 20.22 0.000 
Row lesley 0.947 ale AS 0.248 
S = 10.0673 R-sq = 4.73% R-sq(adj) = 
1.33% 


Performing a Significance Test for the Slope 


When the conditions for inference are met, we can use the slope b1%1 of the sample regression 
line to construct a confidence interval for the slope 81°! of the population (true) regression 
line. We can also perform a significance test to determine whether a specified value of B1°1 is 
plausible. The null hypothesis has the general form H0:81=hypothesized slope. 

Ho : §; = hypothesized slope. To do a test, calculate the standardized test statistic: 


standardized test statistic=statistic - parameterstandard deviation of 


Statistict=b1 — hypothesized slopeSEb1 


; a statistic — parameter 
standardized test statistic = ————_________—__ 
standard deviation of statistic 


b; — hypothesized slope 
SE 
by 


To find the PP-value, use a té distribution with n-2” — 2 degrees of freedom. Here are the 
details for the t test for the slope. 


té TEST FOR THE SLOPE 


Suppose the conditions are met. To perform a test of the hypothesis H0:61=hypothesized 
slope,Ho : 41 = hypothesized slope, compute the standardized test statistic 


b, —hypothesized slope 


SE, 
t=b1—-hypothesized slopeSEb1 1 
Find the PP-value by calculating the probability of getting a tt statistic this large or larger in 


the direction specified by the alternative hypothesis Ha/#e in a tt distribution with df=n-2. 
df = n — 2. 


If sample data suggest a linear relationship between two variables, how can we determine 
whether this happened just by chance or whether there is actually a linear association between 
x@ and yY in the population? By performing a test of H0:81=0.40 : i) =0. a regression line 
with slope 0 is horizontal. That is, the mean of y¥ does not change at all when x# changes. So 
HO: B1=00 : 91 = 9 says that there is no linear association between x and y¥ in the 
population. Put another way, HO/0 says that linear regression ofyYonx® is no better for 
predictingyYthan the mean of the response variabley .¥: 


Does a null hypothesis of “no association” sound familiar? It should. In Chapter 11, you learned 


about chi-square tests for independence, which analyze the relationship between two categorical 
variables. The t test for slope looks for an association between two quantitative variables. 


Regression output from statistical software usually gives the value of té for a test of 
HO0:81=010 : 8, = 0. Tf you want to test a null hypothesis other than HO:61=0,/40 : 8, = 0, get 
the slope and standard error from the output and use the formula to calculate the té statistic. 

Computer software also reports the PP-value for a two-sided test of H0:81=0./20 : 8, = 0. 
For a one-sided test in the proper direction, just divide the PP-value in the output by 2. The 
following example shows what we mean. 


EXAMPLE | Crying and IQ (>) f. 


4 STEP 
Significance test for B 


4 
H. Armstrong Roberts/ClassicStock 
/Archive Photos/Getty Images 


PROBLEM: Infants who cry easily may be more easily stimulated than others. This may be 
a sign of higher IQ. Child development researchers explored the relationship between the 
crying of infants 4 to 10 days old and their later IQ test scores. A snap of a rubber band on 


the sole of the foot caused the infants to cry. The researchers recorded the crying and 
measured its intensity by the number of peaks in the most active 20 seconds. They later 


measured the children’s IQ at age three years using the Stanford—Binet IQ test. The table 


contains data from a random sample of 38 infants.+ 


Crycount IQ Crycount IQ 
20 90 17 94 


Crycount IQ 
10 87 
12 97 

9 103 
16 106 
18 109 
15 114 
12 119 
20 132 
16 136 
33 159 


Crycount 


IQ 

94 
103 
106 
109 
113 
119 
124 
135 
157 


Here is computer output from a least-squares regression analysis of these data. Do these data 
provide convincing evidence at the a=0.05a@ = 0.05 level of a positive linear relationship 
between count of crying peaks and IQ in the population of infants? 


Residual 


ae ae ee ee es he Se Om 


1Q at age three years 
Es 


100 
90 
80 
bi es T T T_T T 
0 5 10 15 20 25 30 35 40 10 15 20 25 30 35 —40 —20 0 20 40 60 
Count of crying peaks Count of crying peaks Residual 
Regression Analysis: IQ versus Crycount 
Predictor Coef SE T P 
Coef 
Constant 91.268 8.934 10822 0.000 
Crycount 1.4929 0.4870 3.07 0.004 
S = 17.50 R-Sq = 20.7% R-Sq(adj) = 18.5% 
SOLUTION: 


Follow the four-step process! 


STATE: We want to test 
Ho : 8; =0 
Ho:81 = oHa:1>0 He : fi > 0 
where 81=the slope? = the slope of the regression line relating y=1Q Score tox=count 
y = IQ Score to x = count of crying peaks in the population of infants. Use a=0.05.¢ = 0.05. 


There is some evidence for Ha/Za because b1=1.4929>0.01 = 1.4929 >0. 


PLAN: tt test ‘for the slope. 


Linear: The scatterplot shows a linear relationship between crying peaks and 1Q, and there 
are no leftover curved patterns in the residual plot. v 


The té statistic and PP-value are found in the row for “Crycount” under the 
corresponding headings. Because the P-value is for a two-sided test and there is 


evidence for HaHa; we cut the PP-value in half for the one-sided test. 


Independent: 38 is less than 10% of all infants. Vv 
Normal: The histogram of residuals does not show strong skewness or clear outliers. V 


Equal SD: The residual plot shows a fairly equal amount of scatter around the horizontal 
line at o for all xx -values. V 


Random: Random sample of 38 infants. Vv 
DO: 


® t=3.07t = 3.07 


. P-value=0.004/2=0.002" Value = 0.004/2 = 0.002 


CONCLUDE: Because the PP-value of 0.002<a=0.05,0-002 <a = 0.05, we reject Hoo: 
There is convincing evidence of a positive linear relationship between the count of crying 


peaks and 1Q score in the population of infants. 


t distribution 
with df = 36 


P-value = 0,002 


Refer to Technology Corner 31 for details on how to do calculations for a té test for 


slope. The calculator gives t=3.065t = 3.065 and P-value=0.002 using df=36. 
P-value = 0.002 using df = 36. 


FOR PRACTICE, TRY EXERCISE 15 


Based on the results of the crying and IQ study, should we ask doctors and parents to make 
infants cry more so that they’1l be smarter later in life? Hardly. This observational study gives 
Statistically significant evidence of a positive linear relationship between the two variables. 
However, we can’t conclude that more intense crying as an infant causes an increase in IQ. 
Maybe infants who cry more are more alert to begin with and tend to score higher on 
intelligence tests. 


31. TECHNOLOGY CORNER | PERFORMING A SIGNIFICANCE 
TEST FOR SLOPE 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tpsé6e. 
Let’s use the data from the crying and IQ study to perform a significance test for the slope of 
the population regression line on the TI-83/84. 


e Enter the xz-values (crying count) into L1 and the y¥-values (IQ score) into L2. 


e Press |STAT|, then choose TESTS and LinRegTTest.... 


e Inthe LinRegTTest screen, adjust the inputs as shown. Then highlight “Calculate” and 
press [ENTER|. Note: Remember that the TI-83/84 uses B? for the slope, not B1.°1- 


ORMAL FLOAT AUTO REAL RADIAN CL fl 


Xlist:Li 
Ylist:L2 
Frea:l1 

B & p:*0 <0 
RegEQ: 
Calculate 


NORMAL FLOAT RUTO REAL RADIAN CL fl 


LinkResT Test 

y=atbx 

B>@ and p> 

t=3. 065489379 

p=, 0020526501 
df=36 

a=91. 26829865 
b=1.492896598 
$s=17.49872122 


e The linear regression té test results take two screens to present. We show only the first 
screen. 


Think About It 


WHAT'S WITH THAT p>0? > 9 IN THE LinRegTTest SCREEN? The slope b1°1 of 
the least-squares regression line is closely related to the correlation rr between the 


explanatory and response variables x® and y¥Y (recall that bi=rsysx"! a "a ), In the same 
way, the slope B19 of the population regression line is closely related to the correlation p? 
(the lowercase Greek letter rho) between x® and y¥ in the population. In particular, the slope 
is 0 when the correlation is 0. 


Testing the null hypothesis HOB=0/405 = 9 is exactly the same as testing that there is no 
correlation between x and y¥ in the population from which we drew our data. You can use 
the test for zero slope to test the hypothesis HO:p=0/4 0: P= 9 of zero correlation between any 
two quantitative variables. That’s a useful trick. Because correlation also makes sense when 
there is no explanatory— response distinction, it is handy to be able to test correlation without 
doing regression. 


CHECK YOUR UNDERSTANDING 


fs 


V The preceding Check Your Understanding (page 781) described some results from an 
experiment about seat locations and test grades. Here again is the output from a least-squares 
regression analysis for these data: 


Term Coef SE Coef T-Value P-Value 
Constant eis), (il) 2h 20.22 0.000 
Row =leiily 02947 SAAS 0.248 
S = R-sq = R-sq(adj) = 
10.0673 4.73% i 33% 


1. Do these data provide convincing evidence at the a=0.05e = 0.05 significance level of 
a negative linear relationship between row and test score for students like those in the 
experiment? Assume that the conditions for regression inference are met. 

2. Describe a Type I error and a Type IJ error in this context. Which error is possible 
based on your conclusion? Explain your reasoning. 


Section 12.1 Summary 


When an association between two quantitative variables is linear, use a least-squares 
regression line to model the relationship between the explanatory variable xz and the 
response variable y.¥- 

Inference in this setting uses the sample regression line y\=b0+b1x¥ = % + 512 to estimate 
or test a claim about the population (true) regression line py=R0+B1x.4¥ ~ Bo + Piz. 

The conditions for regression inference are 


: Linear: The actual relationship between x# and yy¥ is linear. For any fixed value of x,”> 


the mean response py” falls on the population (true) regression line py=B0+PB1x. 
py = Bo { By 


« Independent: Individual observations are independent of each other. When sampling 
without replacement, check the 10% condition. 


= Normal: For any fixed value of x,» the response y¥ varies according to a Normal 
distribution. 


: Equal SD: The standard deviation of y¥ (call it 07) is the same for all values of x.2. 


= Random: The data come from a random sample from the population of interest or a 
randomized experiment. 


When the conditions for inference are met, the sampling distribution of the sample slope b1 
bi is approximately Normal with mean pb1=B1/": = 1 and standard deviation ob1=ooxn. 


Th, = —e a 
Orn 
e The slope b1°1 and intercept b0% of the sample regression line estimate the slope B11 and 
intercept B0°0 of the population (true) regression line. Use the standard deviation of the 


8 


SE,, = — 
residuals s$ to estimate o@ and the standard error of the slope SEb1=ssxn-1 ° sen 
to estimate ob1.7%" 

° Confidence intervals and significance tests for the slope 81°! of the population regression 
line are based on a t¢ distribution with n-2” — 2 degrees of freedom. 
¢ The ttinterval for the slope 81°: is b1+t#SEb1,01 = # SEs, . 
¢ To test the null hypothesis HO: 81=hypothesized slope,/40:51 = hypothesized slope, carry out 
a tétest for the slope. This test uses the standardized test statistic 
b,; —hypothesized slope 
- SE; 
t=b1-hypothesized slopeSEb1 1 


The most common null hypothesis is H0:81=0,/40:1 = 9, which says that there is no linear 
relationship between x@ and y¥ in the population. 


12.1 Technology Corners | @ 


TI-Nspire and other technology instructions are on the book’s website at 
highschool. bfwpub.com/tps6e. 


30. Constructing a confidence interval for slope Page 781 
31. Performing a significance test for slope Page 785 


Section 12.1 Exercises 


1. Predicting height Using the health records of every student at a high school, the school 
nurse created a scatterplot relating y=height¥ = height (jn centimeters) to x=age® = 48° (in 
years). After verifying that the conditions for the regression model were met, the nurse 
calculated the equation of the population regression line to be py=105+4.2x 
Hy = 205 +22 With o=7o'= 7 cm. 


a. According to the population regression line, what is the average height of 15-year-old 
students at this high school? 


b. About what percent of 15-year-old students at this school are taller than 180 cm? 


c. If the nurse used a random sample of 50 students from the school to calculate the 
regression line instead of using all the students, would the slope of the sample regression 
line be exactly 4.2? Explain your answer. 


2. Predicting high temperatures Using the daily high and low temperature readings at 
Chicago’s O’Hare International Airport for an entire year, a meteorologist made a 
scatterplot relating y=highY = high temperature to x=lowz = low temperature, both in 
degrees Fahrenheit.? After verifying that the conditions for the regression model were met, 
the meteorologist calculated the equation of the population regression line to be 
Uy=16.6+1.02xMéy = 16.6 + 1.02% vith 6-6 64°F. = 6.64°F. 


a. According to the population regression line, what is the average high temperature on 
days when the low temperature is 40°F 240° F? 


b. About what percent of days with a low temperature of 40°F40 F have a high 
temperature greater than 70°F?70 °F? 


c. If the meteorologist used a random sample of 10 days to calculate the regression line 
instead of using all the days in the year, would the slope of the sample regression line be 
exactly 1.02? Explain your answer. 

3. Oil and residuals Researchers examined data on the depth of small defects in the Trans- 
Alaska Oil Pipeline. The researchers compared the results of measurements on 100 defects 
made in the field with measurements of the same defects made in the laboratory.2 The 
figure shows a residual plot for the least-squares regression line based on these data. 
Explain why the conditions for performing inference about the slope B1°1 of the 
population regression line are not met. 
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4. SAT Math scores Is there a relationship between the percent of high school graduates in 
each state who took the SAT and the state’s mean SAT Math score? Here is a residual plot 
from a linear regression analysis that used data from all 50 states in a recent year. Explain 
why the conditions for performing inference about the slope 81°! of the population 
regression line are not met. 


Residual 


0 10 20 30 40 50 60 70 80 90 
Percent of high school graduates taking the SAT 


5. po #75 2) Beer and BAC How well does the number of beers a person drinks predict 


his or her blood alcohol content (BAC)? Sixteen volunteers aged 21 or older with an initial 
BAC of 0 took part in a study to find out. Each volunteer drank a randomly assigned 
number of cans of beer. Thirty minutes later, a police officer measured their BAC. A least- 
squares regression analysis was performed on the data using x=number’ = number of 
beers and y=BAC.Y > BAC. Here is a residual plot and a histogram of the residuals. Check 
whether the conditions for performing inference about the regression model are met. 


Residual 


0 1 7) 3 4 5 6 , 8 9 
Number of beers 


Frequency 


—0.03 -0.02 -0.01 0.00 0.01 0.02 0.03 0.04 
Residual 


6. Prey attracts predators Here is one way in which nature regulates the size of animal 


populations: high population density attracts predators, which remove a higher proportion 
of the population than when the density of the prey is low. One study looked at kelp perch 
and their common predator, the kelp bass. On each of four occasions, the researcher set up 
four large circular pens on sandy ocean bottoms off the coast of southern California. He 
randomly assigned young perch to 1 of 4 pens so that one pen had 10 perch, one pen had 20 
perch, one pen had 40 perch, and the final pen had 60 perch. Then he dropped the nets 
protecting the pens, allowing bass to swarm in, and counted the number of perch killed 
after two hours.4 A regression analysis was performed on the 16 data points using 
x=numberz = number of perch in pen and y=proportionY = Proportion of perch killed. 
Here is a residual plot and a histogram of the residuals. Check whether the conditions for 
performing inference about the regression model are met. 


Residual 
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Number of perch 


Frequency 
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7. PQ 777 |) Beer and BAC Refer to Exercise 5. Here is computer output from the least- 


squares regression analysis of the beer and blood alcohol data. 


Dependent variable is: BAC 

No Selector 

R squared = 80.0% R squared (adjusted) = 78.6% 

S = 0.0204 with 16 - 2 = 14 degrees of freedom 
Variable Coefficient s.e. of Coeff t-ratio prob 
Constant -0.012701 0.0126 -1.00 0.3320 


10. 


11. 


Beers 0.017964 0.0024 7.84 <0.0001 


a. What is the estimate for 8020? Interpret this value. 
b. What is the estimate for 8171’ Interpret this value. 
c. What is the estimate for 62? Interpret this value. 


d. Give the standard error of the slope SEb1 52h - Interpret this value. 
Prey attracts predators Refer to Exercise 6. Here is computer output from the least- 
squares regression analysis of the perch data. 


Predictor Coef Stdev. t-ration p 
Constant 0.12049 0.09269 1.30 0.215 
Perch 0.0085690 . 002456 3.49 0.004 
S = R-Sq = 46.5% R-Sq(adj) = 
0.1886 42.7% 


. What is the estimate for 80?°0° Interpret this value. 


feb) 


. What is the estimate for 8121" Interpret this value. 


oO 


c. What is the estimate for 620? Interpret this value. 


SEs, - 


d. Give the standard error of the slope SEb Interpret this value. 


Beer and BAC Refer to Exercises 5 and 7. 


a. Find the critical value for a 99% confidence interval for the slope of the true regression 
line. Then calculate the confidence interval. 


b. Interpret the interval from part (a). 


c. Explain the meaning of “99% confident” in this context. 
Prey attracts predators Refer to Exercises 6 and 8. 


a. Find the critical value for a 90% confidence interval for the slope of the true regression 
line. Then calculate the confidence interval. 


b. Interpret the interval from part (a). 


c. Explain the meaning of “90% confident” in this context. 
pq 779 |») Less mess? Kerry and Danielle wanted to investigate if tapping on a can of 


soda would reduce the amount of soda expelled after the can has been shaken. For their 
experiment, they vigorously shook 40 cans of soda and randomly assigned each can to be 
tapped for 0 seconds, 4 seconds, 8 seconds, or 12 seconds. After opening the cans and 
waiting for the fizzing to stop, they measured the amount expelled (in milliliters) by 
subtracting the amount remaining from the original amount in the can.2 Here are their data: 


Amount expelled (mL) 
(0 sec) (4 sec) (8 sec) (12 sec) 
110 95 88 80 
100 105 84 15 


105 105 87 80 


105 105 85 75 
105 95 79 70 
110 90 100 65 
107 88 85 rae 
105 95 85 77 
104 94 80 76 
106 96 80 75 


Here is some computer output from a least-squares regression analysis of these data. 
Construct and interpret a 95% confidence interval for the slope of the true regression line. 


z 
3 
g 
5 
} 
& 
< 
Tapping time (sec) 
15 ° 
E 
E 
= 
0 Z 4 6 8 10 12 
Tapping time (sec) 
5 
5 
3 
5 
fae 
-15 -10 -5 0 5 10 15 20 
Residual (mL) 
Predictor Coef SE Coef i P 
Constant 106.36 1.3238 80.345 0.000 
Tapping time -2.6350 0.1769 -14.895 0.000 


S = 5.00347 R-Sq = 85.4% R-Sq(adj) = 85.0% 


12. More mess? When Mentos are dropped into a newly opened bottle of Diet Coke, carbon 
dioxide is released from the Diet Coke very rapidly, causing the Diet Coke to be expelled 
from the bottle. To see if using more Mentos causes more Diet Coke to be expelled, 
Brittany and Allie used twenty-four 2-cup bottles of Diet Coke and randomly assigned each 
bottle to receive either 2, 3, 4, or 5 Mentos. After waiting for the fizzing to stop, they 
measured the amount expelled (in cups) by subtracting the amount remaining from the 
original amount in the bottle.® Here are their data: 


Amount expelled (cups) 


(2 Mentos) (3 Mentos) (4 Mentos) (5 Mentos) 
1.125 1.1875 1.25 1.25 
125 1-125 1.3125 1.4375 
1.0625 1.25 1.25 1.3125 
125 1.1875 1.375 1.3125 
1.125 1.3125 1.3125 1.375 
1.0625 1.1875 1.25 1.4375 


Here is computer output from a least-squares regression analysis of these data. Construct 
and interpret a 95% confidence interval for the slope of the true regression line. 
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2 
-0.10 0.00 0.10 
Residual (cups) 

Predictor Coef SE Coef T P 
Constant 1.0021 0.0451 22.215 0.000 
Mentos 0.0708 0.0123 5.770 0.000 


S = 0.06724 R-Sq = 60.2% R-Sq(adj) = 58.4% 


13. Beavers and beetles Do beavers benefit beetles? Researchers laid out 23 circular plots, 
each 4 meters in diameter, at random in an area where beavers were cutting down 
cottonwood trees. In each plot, they counted the number of stumps from trees cut by 
beavers and the number of clusters of beetle larvae. Ecologists think that the new sprouts 


from stumps are more tender than other cottonwood growth so that beetles prefer them. If 
so, More stumps should produce more beetle larvae. 

Here is computer output for a regression analysis of these data. Construct and interpret 
a 99% confidence interval for the slope of the population regression line. Assume that the 


conditions for performing inference are met. 


Predictor Coef SE Coef 1 P 
Constant -1.286 2.8523 -0.45 0.657 
Stumps 11.894 1.136 10.47 0.000 
S = 6.41939 R-Sq = 83.9% R-Sq(adj ) 83.1% 


14. Ideal proportions The students in Mr. Shenk’s class measured the arm spans and heights 
(in inches) of a random sample of 18 students from their large high school. Here is 
computer output from a least-squares regression analysis of these data. Construct and 
interpret a 90% confidence interval for the slope of the population regression line. Assume 
that the conditions for performing inference are met. 


Predictor Coef Stdev t-ratio P 
Constant 11.547 5.600 2.06 0.056 
Armspan 0.84042 0.08091 10.39 0.000 


S = 1.613  R-Sq = 87.1%  R-Sq(adj) = 86.3% 


15. pg 783 |») Weeds among the corn Lamb’s quarters is a common weed that interferes 


with the growth of corn. An agriculture researcher planted corn at the same rate in 16 small 
plots of ground and then weeded the plots by hand to allow a fixed number of lamb’s 
quarters plants to grow in each meter of corn row. The decision on how many of these 
plants to leave in each plot was made at random. No other weeds were allowed to grow. 
Here are the yields of corn (bushels per acre) in each of the plots:2 


Weeds per meter Yield Weeds per meter Yield 
0 166.7 3 158.6 
0 172.2 3 176.4 
0 165.0 3 153.1 
0 176.9 3 156.0 
1 166.2 9 162.8 
1 15 ia 9 142.4 
1 166.7 9 162.8 
1 161.1 9 162.4 


Here is some computer output from a least-squares regression analysis of these data. Do 
these data provide convincing evidence at the a=0.05@ = 0.05 level that more lamb’s 
quarters reduce corn yield? 


Yield (bushels/acre) 
ES 
1 
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Weeds per meter 
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Residual (bushels/acre) 
Predictor Coef SE Coef T P 
Constant 166.483 2.725 61.11 0.000 
Weeds per -1.0987 0.5712 =4.92 0.075 


meter 
S = 7.97665 R-Sq = 20.9%  R-Sq(adj) = 15.3% 


16. Time at the table Does how long young children remain at the lunch table help predict 
how much they eat? Here are data on a random sample of 20 toddlers observed over several 
months.2 “Time” is the average number of minutes a child spent at the table when lunch 
was served. “Calories” is the average number of calories the child consumed during lunch, 
calculated from careful observation of what the child ate each day. 


Time Calories Time Calories 
21.4 472 42.4 450 
30.8 498 43.1 410 
Sf.t 465 29.2 504 


33.5 456 1.3 437 


32.8 423 28.6 489 


39.5 437 329 436 
22.8 508 30.6 480 
34.1 431 35.4 439 
33.9 479 33.0 444 
43.8 454 43.7 408 


Here is some computer output from a least-squares regression analysis of these data. Do 
these data provide convincing evidence at the a=0.01@ = 0.01 level of a linear relationship 
between time at the table and calories consumed in the population of toddlers? 


510 4 


3S 
= 
£ 
a 
s) 
20 25 30 35 40 45 
Time (min) 
g 
a 
s 
S 
FA 
io 4 
20 25 30 35 40 45 
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Residual (cal) 
Predictor Coef SE Coef T P 
Constant 560.65 29.37 19.09 0.000 
Time -3,.0771 0.8498 -—3.62 0.002 


S = 23.3980 R-Sq = 42.1%  R-Sq(adj) = 38.9% 


17. Is wine good for your heart? A researcher from the University of California, San Diego, 


collected data on average per capita wine consumption and heart disease death rate in a 
random sample of 19 countries for which data were available. The following table displays 
the data.12 


Alcohol from wine Heart disease death rate| Alcohol from wine Heart disease death rate 


(liters/year) (per 100,000) (liters/year) (per 100,000) 
25 211 7.9 107 
3.9 167 1.8 167 
2.9 131 1.9 266 
2.4 191 0.8 227 
2.9 220 6.5 86 
0.8 297 1.6 207 
9.1 fa! 5.8 115 
Zui 172 13 285 
0.8 211 12 199 
0.7 300 


Is there convincing evidence of a negative linear relationship between wine consumption 
and heart disease deaths in the population of countries? 

18. The professor swims Here are data on the time (in minutes) Professor Moore takes to 
swim 2000 yards and his pulse rate (beats per minute) after swimming on a random sample 
of 23 days: 


Time 34.12 35.72 34.72 34.05 34.13 35.72 
Pulse 152 124 140 152 146 128 
Time 36.17 35.57 35.37 35.57 35.43 36.05 
Pulse 136 144 148 144 136 124 
Time 34.85 34.70 34.75 33.93 34.60 34.00 
Pulse 148 144 140 156 136 148 
Time 34.35 35.62 35.68 35.28 35.97 

Pulse 148 132 124 132 139 


Is there convincing evidence of a negative linear relationship between Professor Moore’s 
swim time and his pulse rate in the population of days on which he swims 2000 yards? 
19. Turn up the volume? Nicole and Elena wanted to know if listening to music at a louder 
volume negatively impacts test performance. To investigate, they recruited 30 volunteers 
and randomly assigned 10 volunteers to listen to music at 30 decibels, 10 volunteers to 
listen to music at 60 decibels, and 10 volunteers to listen to music at 90 decibels. While 
listening to the music, each student took a 10-question math test. Here is computer output 
from a least-squares regression analysis using x=volumex = volume and y=number 
y = number correct:11 


Predictor Coef SE Coef T P 
Constant 9.9000 @.7525 13.156 0.0000 


Volume -0.0483 ©.0116 -4,163 0.0003 
S = 1.55781 R-Sq = 38.2%  R-Sq(adj) = 36.0% 


Is there convincing evidence that listening to music at a louder volume hurts test 
performance? Assume the conditions for inference are met. 


20. Pencils and GPA Is there a relationship between a student’s GPA and the number of 


21, 


pencils in his or her backpack? Jordynn and Angie decided to find out by selecting a 
random sample of students from their high school. Here is computer output from a least- 


squares regression analysis using x=number = number of pencils and y=GPA:¥ = GPA: 
i2 


Predictor Coef SE Coef T P 
Constant 3.2413 0.1809 17.920 0.0000 
Pencils -—0.0423 0.0631 -0.670 0.5062 


S = 0.738533 R-Sq = 0.9%  R-Sq(adj) = 0.0% 


Is there convincing evidence of a linear relationship between GPA and number of pencils 
for students at this high school? Assume the conditions for inference are met. 

Stats teachers’ cars A random sample of 21 AP® Statistics teachers was asked to report 
the age (in years) and mileage of their primary vehicles. Here is a scatterplot of the data: 
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Here is some computer output from a least-squares regression analysis of these data. 
Assume that the conditions for regression inference are met. 


Variable Coef SE Coef t-ratio prob 
Constant 7288.54 6591 a ree 0.2826 
Car age 11630.6 1249 9.31 <Q.0001 


S = 19280 R-Sq = 82.0% RSq(adj) = 81.1% 


a. Verify that the 95% confidence interval for the slope of the population regression line is 
(9016.4, 14,244.8). 


b. A national automotive group claims that the typical driver puts 15,000 miles per year on 
his or her main vehicle. We want to test whether AP® Statistics teachers are typical 


drivers. Explain why an appropriate pair of hypotheses for this test is HO:81=15,000 
Hy : 6; = 15,000 versus Ha:81415,000.4a : By # 15,000. 


c. Compute the standardized test statistic and PP-value for the test in part (b). What 
conclusion would you draw at the a=0.05e = 0.05 significance level? 


d. Does the confidence interval in part (a) lead to the same conclusion as the test in part 
(c)? Explain your answer. 

. Paired tires Exercise 75 in Chapter 10 (page 686) compared two methods for estimating 

tire wear. The first method used the amount of weight lost by a tire. The second method 

used the amount of wear in the grooves of the tire. A random sample of 16 tires was 

obtained. Both methods were used to estimate the total distance traveled by each tire. The 


following scatterplot displays the two estimates (in thousands of miles) for each tire.42 
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Here is some computer output from a least-squares regression analysis of these data. 
Assume that the conditions for regression inference are met. 
Predictor Coef SE Coef i P 
Constant 35d 2.105 0.64 0.531 
Weight 0.79021 0.07104 Le 0.000 


S = 2.62078  R-Sq = 89.8%  R-Sq(adj) = 89.1% 


a. Verify that the 99% confidence interval for the slope of the population regression line is 
(0.5787, 1.0017). 


b. Researchers want to test whether there is a difference in the two methods of estimating 
tire wear. Explain why the researchers might want to test the hypotheses H0:61=1 
Ho: 61 = 1 versus Ha:8141 4a 7 fi #1. 


c. Compute the standardized test statistic and PP-value for the test in part (b). What 
conclusion would you draw at the a=0.010 = 0.01 significance level? 


as: 


24. 


25. 


d. Does the confidence interval in part (a) lead to the same conclusion as the test in part 
(c)? Explain your answer. 


Multiple Choice Select the best answer for Exercises 23-28. Exercises 23-28 refer to the 
following setting. To see if students with longer feet tend to be taller, a random sample of 
25 students was selected from a large high school. For each student, x=foot length 

« = foot length (cm) and y=height¥ = height (cm) were recorded. We checked that the 
conditions for inference about the slope of the population regression line are met. Here is a 
portion of the computer output from a least-squares regression analysis using these data: 


Predictor Coef SE Coef T P 
Constant 91.9766 10.2204 8.999 0.000 
Foot length 3.0867 0.4117 7.498 0.000 


S = 6.47044 R-Sq = 72.8%  R-Sq(adj)= 71.5% 


Which of the following is the equation of the least-squares regression line for predicting 
height from foot length? 


a. height\=10.2204+0.4117 (foot length)height = 10.2204 + 0.4117 (foot length) 
b. height\=0.4117+3.0867 (foot lengthybeight = 0.4117 + 3.0867 (foot length) 

c. height\=91.9766+3.0867 (foot lengthyheight = 91.9766 + 3.0867 (foot length) 
d. height\=91.9766+6.47044 (foot lengthybeight = 91.9766 + 6.47044 (foot length) 


e, height\=3.0867+6.47044 (foot length)beight = 3.0867 + 6.47044 (foot length) 

The slope 81/1 of the population regression line describes 

a. the exact increase in height (cm) for students at this high school when foot length 
increases by 1 cm. 

b. the average increase in foot length (cm) for students at this high school when height 
increases by 1 cm. 

c. the average increase in height (cm) for students at this high school when foot length 
increases by 1 cm. 

d. the average increase in foot length (cm) for students in the sample when height increases 


by 1 cm. 


e, the average increase in height (cm) for students in the sample when foot length increases 
by 1 cm. 

Is there convincing evidence that height increases as foot length increases? To answer this 

question, test the hypotheses 

a. H0:81=0M0 : £1 = 9 versus Ha:B1&gt;0.4e * Pi > 0. 

b. H0:81=040 : 61 = 9 versus Ha: B1<0.4a * Ai Kit; 0. 


H0:81=0Ho : 61 = 9 versus Ha:B1#0.4e : Fi # 0. 


C. 
d. HO0:B1&gt;00 : 81 > 9 versus Ha:B1=0.4a : Pi = 0. 
e. H0:81=14%0 : 91 = 1 versus Ha:B1&gt;14e : fi > 1. 
26. Which of the following is the best interpretation of the value 0.4117 in the computer 
output? 
a. For each increase of 1 cm in foot length, the average height increases by about 0.4117 


27. 


28. 


d. 


e. 


cm. 


When using this model to predict height, the predictions will typically be off by about 
0.4117 cm. 


The linear relationship between foot length and height accounts for 41.17% of the 
variation in height. 


The linear relationship between foot length and height is moderate and positive. 


In repeated samples of size 25, the slope of the sample regression line for predicting 
height from foot length will typically vary from the population slope by about 0.4117. 


Which of the following is a 95% confidence interval for the population slope B1?1 


a. 


Di 


3.086740.41173.0867 = 0.4117 
3.0867+0.85183.0867 = 0.8518 


c. 3.0867+0.80693.0867 + 0.8069 


d. 


e 


3.086740.84973.0867 = 0.8497 
3.0867+0.84813.0867 = 0.8481 


Which of the following would have resulted in a violation of the conditions for inference? 


a. If the entire sample was selected from one classroom 


If the sample size was 15 instead of 25 


If the scatterplot of x=footz = foot length and y=height¥ = height did not show a 
perfect linear relationship 


If the histogram of heights had an outlier 


If the standard deviation of foot length was different from the standard deviation of 
height 


Recycle and Review Exercises 29-31 refer to the following setting. Does the color in 
which words are printed affect your ability to read them? Do the words themselves affect 
your ability to name the color in which they are printed? Mr. Starnes designed a study to 


investigate these questions using the 16 students in his AP® Statistics class as subjects. 
Each student performed the following two tasks in random order while a partner timed his 
or her performance: (1) Read 32 words aloud as quickly as possible, and (2) say the color 
in which each of 32 words is printed as quickly as possible. Try both tasks for yourself 
using the word list given. 


BROWN RED BLUE GREEN 
RED GREEN BROWN BROWN 
GREEN RED BLUE BLUE 
BROWN BLUE GREEN RED 
BLUE BROWN RED RED 
RED BLUE BROWN GREEN 
BLUE GREEN GREEN BLUE 
GREEN BROWN RED BROWN 


29. Color words (4.2) Let’s review the design of the study. 
a. Explain why this was an experiment and not an observational study. 


b. Did Mr. Starnes use a completely randomized design or randomized block design? Why 
do you think he chose this experimental design? 
c. Explain the purpose of the random assignment in the context of the study. 


Here are the data from Mr. Starnes’s experiment. For each subject, the time to perform the 
two tasks is given to the nearest second. 


Subject Words Colors Subject Words Colors 
1 13 20 9 10 16 
2 10 21 10 9 13 
) 15 22 11 11 11 
4 12 25 12 17 26 
5 13 17 13 15 20 
6 11 13 14 15 15 
rg 14 32 15 12 18 
8 16 21 16 10 18 


30. Color words (10.3) Now let’s analyze the data. 


a. Calculate the difference (Colors-Words)(\Colors ~ Words) for each subject and 
summarize the distribution of differences with a boxplot. Does the graph provide 
evidence of a difference in the average time required to perform the two tasks? Explain 
your answer. 


b. Explain why it is not safe to use paired té procedures to do inference about the mean 
difference in time to complete the two tasks. 
31. Color words (3.1, 3.2, 12.1) Can we use a student’s word task time to predict his or her 
color task time? 


a. Make an appropriate scatterplot to help answer this question. Describe what you see. 


b. Use technology to find the equation of the least-squares regression line. Define any 
variables you use. 


c. Find and interpret the residual for the student who completed the word task in 9 seconds. 


d. Assume that the conditions for performing inference about the slope of the true 
regression line are met. The PP-value for a test of HO: 81=0/40:1 = 9 versus Ha: 
B1&gt;04a:F1 > 9 is 0.0215. Interpret this value. 

Note: John Ridley Stroop is often credited with the discovery in 1935 of the fact that the 

color in which “color words” are printed interferes with people’s ability to identify the 

color. The paper outlining the so-called Stroop effect, though, was originally published by 

German researchers in 1929. 

32. Yahtzee (5.3, 6.3) In the game of Yahtzee, 5 six-sided dice are rolled simultaneously. To 
get a Yahtzee, the player must get the same number on all 5 dice. 


1 a 
a. Luis says that the probability of getting a Yahtzee in one roll of the dice is (16)5. ( E , 
Explain why Luis is wrong. 


b. Nassir decides to keep rolling all 5 dice until he gets a Yahtzee. He is surprised when he 
still hasn’t gotten a Yahtzee after 25 rolls. Should he be? Calculate an appropriate 
probability to support your answer. 


SECTION 12.2 Transforming to Achieve Linearity 


LEARNING TARGETS By the end of the section, you should be able to: 


e Use transformations involving powers and roots to find a power model that 
describes the relationship between two quantitative variables, and use the model 
to make predictions. 


Use transformations involving logarithms to find a power model that describes 
the relationship between two quantitative variables, and use the model to make 
predictions. 


Use transformations involving logarithms to find an exponential model that 
describes the relationship between two quantitative variables, and use the model 
to make predictions. 


Determine which of several transformations does a better job of producing a 
linear relationship. 


In Chapter 3, we learned how to analyze relationships between two quantitative variables that 
showed a linear pattern. When two-variable data show a curved relationship, we must develop 
new techniques for finding an appropriate model. This section describes several simple 
transformations of data that can straighten a nonlinear pattern. Once the data have been 
transformed to achieve linearity, we can use least-squares regression to generate a useful model 
for making predictions. And if the conditions for regression inference are met, we can estimate 
or test a claim about the slope of the population (true) regression line using the transformed 
data. 

The Gapminder website (www.gapminder.org) provides loads of data on the health and 
well-being of the world’s inhabitants. Figure 12.6 shows a scatterplot of data from 
Gapminder./4 The individuals are all the world’s nations for which data were available in 2015. 
The explanatory variable, income per person, is a measure of how rich a country is. The 
response variable is life expectancy at birth. 
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FIGURE 12.6 Scatterplot of the life expectancy of people in many nations against each nation’s 
income per person. The color of each circle indicates the geographic region in which that 
country is located. The size of each circle is based on the population of the country—bigger 
circles indicate larger populations. 


We expect people in richer countries to live longer because they have better access to 
medical care and typically lead healthier lives. The overall pattern of the scatterplot does show 
this, but the relationship is not linear. Life expectancy rises very quickly as income per person 
increases and then levels off. People in very rich countries such as the United States live no 
longer than people in poorer but not extremely poor nations. In some less wealthy countries, 
people live longer than in the United States. 


Four African nations are outliers. Their life expectancies are similar to those of their 
neighbors, but their income per person is higher. Gabon and Equatorial Guinea produce oil, and 
South Africa and Botswana produce diamonds. It may be that income from mineral exports 
goes mainly to a few people and so pulls up income per person without much effect on either 
the income or the life expectancy of ordinary citizens. That is, income per person is a mean, 
and we know that mean income can be much higher than median income. 

The scatterplot in Figure 12.6 shows a curved pattern. We can straighten things out using 
logarithms. Figure 12.7 plots the logarithm of income per person against life expectancy for 
these same countries. The effect is remarkable. This graph has a clear, linear pattern. 
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FIGURE 12.7 Scatterplot of life expectancy against log(income per person) for many nations. 


Applying a function such as the logarithm or square root to a quantitative variable is called 
transforming the data. Transforming data amounts to changing the scale of measurement that 
was used when the data were collected. In Chapter 2 , we discussed linear transformations, 
such as converting temperature in degrees Fahrenheit to degrees Celsius or converting distance 
in miles to kilometers. However, linear transformations cannot straighten a curved relationship 
between two variables. To do that, we resort to functions that are not linear. The logarithm 
function, applied in the income and life expectancy example, is a nonlinear function. We’|l 
return to transformations involving logarithms later. 


Transforming with Powers and Roots 


Vasko/Getty Images 


When you visit a pizza parlor, you order a pizza by its diameter—say, 10 inches, 12 inches, or 
14 inches. But the amount you get to eat depends on the area of the pizza. The area of a circle is 
m7 times the square of its radius r.’. So the area of a round pizza with diameter x is 


area = mr? = n(=) — a? = 52 
2 4 4 


This is a power model of the form y=axpY = @2” with a=n/42 = 7/4 and p=2.P = 2: 
Pp y=axp p 


area=mr2=n(x2)2=n(x24)=n4x2 


When we are dealing with things of the same general form, whether circles or fish or 
people, we expect area to go up with the square of a dimension such as diameter or height. 
Volume should go up with the cube of a linear dimension. That is, geometry tells us to expect 
power models in some settings. There are other physical relationships between two variables 
that are described by power models. Here are some examples from science. 


e The distance that an object dropped from a given height falls is related to time since release 
by the model 


; _— 
distance=a(time)2“istance = a(time) 


e The time it takes a pendulum to complete one back-and-forth swing (its period) is related to 
its length by the model 


period=alength=a(length)1/2Period = a,/length = a(length)'”” 
e The intensity of a light bulb is related to distance from the bulb by the model 


‘ ‘ ; =) 
intensity = = a(distance) 


intensity=adistance2=a(distance)—2 distance” 


Although a power model of the form y=axpY = @®” describes the nonlinear relationship 
between x and y¥ in each of these settings, there is a linear relationship between xp” and y. 
Y¥- Tf we transform the values of the explanatory variable x by raising them to the p? power, 


and graph the points (xp, y),\?"; Y) > the scatterplot should have a linear form. The following 
example shows what we mean. 


EXAMPLE |Go fish! @& 


Transforming with powers 


YZ 


iy Up 


Les Breault/Alamy 


PROBLEM: Imagine that you have been put in charge of organizing a fishing tournament in 
which prizes will be given for the heaviest Atlantic Ocean rockfish caught. You know that 
many of the fish caught during the tournament will be measured and released. You are also 
aware that using delicate scales to try to weigh a fish that is flopping around in a moving 
boat will probably not yield very accurate results. It would be much easier to measure the 
length of the fish while on the boat. What you need is a way to convert the length of the fish 


to its weight. 


You contact the nearby marine research laboratory, and it provides reference data on the 
length (in centi- meters) and weight (in grams) for Atlantic Ocean rockfish of several sizes.12 


Here is a scatterplot of the data. Note the clear curved form. 


Length 5.2 8.5 ibs) 14.3 16.8 19:2 
Weight 2 8 ZA 38 69 117 
Length 28.2 29.6 30.8 32.0 33.0 34.0 
Weight 318 Sie 455 504 518 537 
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Because length is one-dimensional and weight (like volume) is three-dimensional, a power 


a 3 
model of the form weight=a (length)3¥eight = a(length)” should describe the relationship. 
Here is a scatterplot of weight versus length?: 
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Because the transformation made the association roughly linear, we used computer software 
to perform a linear regression analysis of y=weightY = Weight versus x=length3. 
z = length*. 


Regression Analysis: Weight versus Length‘3 


Predictor Coef SE Coef T P 
Constant 4.066 6.902 0.59 0.563 
Length43 0.0146774 0.0002404 61.07 0.000 


$=18.8412 R-Sq=99.5% R- 
Sq(adj )=99 ; 5x5 = 18.8412 R-Sq = 99.5% R-Sq(adj) = 99.5% 


a. Give the equation of the least-squares regression line. Define any variables you use. 


b. Suppose a contestant in the fishing tournament catches an Atlantic Ocean rockfish that’s 
36 centimeters long. Use the model from part (a) to predict the fish’s weight. 


SOLUTION: 
weight — 4.066 + 0.0146774(36)° 


b. weight\=4.066+0.0146774(36)3weight\=688.9 grams Weight = 688.9 grams 


If you write the equation as y*=4.066+0.0146774x3,! = 4.066 + 0.01467742°, make 


sure to define y=weight¥ = Weight and x=length.% = length. 


FOR PRACTICE, TRY EXERCISE 33 


There’s another way to transform the data in the “Go fish!” example to achieve linearity. 


We can take the cube root of the weight values and graph weight3 V Weight versus length. 
Figure 12.8 shows that the resulting scatterplot has a linear form. 
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FIGURE 12.8 The scatterplot of weight3 V weight versus length is linear. 


3 
Why does this transformation work? Start with weight=a(length)3¥ight = a(length)” ang 
take the cube root of both sides of the equation: 


\/weight = 4) a(length)* 
weight3=a(length)33weight3=a3(length) v weight = ,/a (length) 


That is, there is a linear relationship between length and weight3. v weight. 


EXAMPLE |Go fish! @& 


Transforming with roots 


ae - 
Doug Wilson/Alamy 


PROBLEM: Figure 12.8 shows that the relationship between length and weight3V W@lsht jg 
roughly linear for Atlantic Ocean rockfish. Here is computer output from a linear regression 


analysis of y=weight3¥ ~ v weight versus x=length® = length 


Regression Analysis: weight3V “°85t versus Length 


Predictor Coef SE Coef T P 
Constant -—0.02204 0.07762 -0.28 0.780 
Length 0.246616 0.002868 86.00 0.000 


S = 0.124161 R-Sq = 99.8% R-Sq(adj) = 99.7% 


a. Give the equation of the least-squares regression line. Define any variables you use. 


b. Suppose a contestant in the fishing tournament catches an Atlantic Ocean rockfish that’s 
36 centimeters long. Use the model from part (a) to predict the fish’s weight. 


SOLUTION: 


ie 


Lo = 4 +. 0.246616 
) weight3A=-0.02204+0.246616 (length) \/ weight 0.02204 + 0.246616 (length) 


0.02204 +- 0.246616z2, 


If you write the equation as y3=-0.02204+0.246616x,V 4 = 


make sure to define y=weight¥ = weight and x=length.2 = length. 


b. weight3\=-0.02204+0.246616(36)=8.85 6weightA=8.8563 =694.6grams 
ae 4 g 
\/ weight = -—0.02204 + 0.246616 (36) = 8.856 


weight = §8.856° = 694.6 grams 


The least-squares regression line gives the predicted value of the cube root of 


weight. To get the predicted weight, reverse the cube root by raising the result to 
the third power. 


FOR PRACTICE, TRY EXERCISE 35 


When experience or theory suggests that the relationship between two variables is 
described by a power model of the form y=axpY = @©” where pP is known, there are two 
methods for transforming the data to achieve linearity. 


.. Raise the values of the explanatory variable x# to the pP? power and plot the points (xp, y). 
(x?,y). 


’, Take the pPth root of the values of the response variable y¥ and plot the points (xp, y). 
(x?,y). 


What if you have no idea what value of p? to use? You could guess and test until you find a 
transformation that works. Some technology comes with built-in sliders that allow you to 
dynamically adjust the power and watch the scatterplot change shape as you do. 


It turns out that there is a much more efficient method for linearizing a curved pattern in a 
scatterplot. Instead of transforming with powers and roots, we use logarithms. This more 
general method works when the data follow an unknown power model or any of several other 
common mathematical models. 


Transforming with Logarithms: Power Models 


To achieve linearity from a power model, we apply the logarithm transformation to both 
variables. Here are the details: 


.. A power model has the form y=axp,¥ = 2”, where a and p? are constants. 
’. Take the logarithm of both sides of this equation. Using properties of logarithms, we get 
log y=log (axp)=loga+log(xp)=log atp log x 
log y = log (ax’) = loga + log (x”) = loga + ploga 
The equation 
log y=log atp log xlog y = loga + plogz 


shows that taking the logarithm of both variables results in a linear relationship between 
log x@ and log y.¥-Note: You can use base-10 logarithms or natural (base-e) logarithms to 
straighten the association. 
3. Look carefully: the powerp?P in the power model becomes the slope of the straight line that 
links log y!0S ¥ to log x.!0g 2. 

If a power model describes the relationship between two variables, a scatterplot of the 
logarithms of both variables should produce a linear pattern. Then we can fit a least-squares 
regression line to the transformed data and use the linear model to make predictions. Here’s an 
example. 


EXAMPLE |Go fish! (>) 


Transforming with logarithms: Power models 


RGB Ventures/SuperStock 
/Alamy 


PROBLEM: In the preceding examples, we used powers and roots to find a model for 

predicting the weight of an Atlantic Ocean rockfish from its length. We still expect a power 
3 

model of the form weight=a(length)3¥isht = a(length)” based on geometry. Here once 

again is a scatterplot of the data from the local marine research lab: 


Weight (g) 


Length (cm) 


We took the logarithm (base 10) of the values for both variables. Here is some computer 
output from a linear regression analysis of the transformed data. 
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Regression Analysis: log(Weight) versus log(Length) 


Predictor Coef SE Coef T P 
Constant -1.89940 0.03799 -49.99 0.000 
log(Length) 3.04942 0.02764 110.31 0.000 


S = 0.0281823 R-Sq = 99.9% R-Sq(adj) = 99.8% 


a. Based on the output, explain why it would be reasonable to use a power model to describe 
the relationship between weight and length for Atlantic Ocean rockfish. 


b. Give the equation of the least-squares regression line. Be sure to define any variables you 
use. 


c. Suppose a contestant in the fishing tournament catches an Atlantic Ocean rockfish that’s 


36 centimeters long. Use the model from part (b) to predict the fish’s weight. 


SOLUTION: 


a. The scatterplot of log(weight) versus log(length) has a linear form, and the residual plot 
shows a fairly random scatter of ‘points about the residual = olineresidual = Oline. Soa 
power model seems reasonable here. 


If a power model describes the relationship between two variables x and y, thena 


linear model should describe the relationship between log x and log y. 


b. log(weight)\=-1.89940+3.04942 log (length) 


ea 


log (weight) = -—1.89940 + 3.04942 log (length) 


If you write the equation as log(y)*=—-1.89940+3.04942 log(x), 


log (y) = —1.89940 + 3.04942 log(x), make sure to define y=weight¥ = weight and 
x=length.2 = length. 


G log(weight)\=-1.89940+3.04942 [og(36)=2.84 64 log(weight)\=2.8464weight\=102.8464 
log (weight) = -—1.89940 + 3.04942 log (36) = 2.8464 


log (weight) = 2.8464 


weight = 1075 ~ 702.1 grams 


The least-squares regression line gives the predicted value of the base-10 


logarithm of weight. To get the predicted weight, undo the logarithm by raising 10 
to the 2.8464 power. 
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in (weight) 
> 


2.0 25 3.0 Ke 4.0 
In (length) 


On the TI-83/84, you can “undo” the logarithm using the function keys. To solve 


log(weight)\=2.8464, log (weight) = 2.8464, press 2.8464 


In addition to base-10 logarithms, you can also use natural (base-e) logarithms to transform 
the variables. Using the same Atlantic Ocean rockfish data, here is a scatterplot of In(weight) 
versus In(length). 


The least-squares regression line for these data is 
In(weight)\=—4.3735+3.04942 In(length)!2 (weight) = —4.3735 + 3.04942 In (length) 


To predict the weight of an Atlantic Ocean rockfish that is 36 centimeters, we start by 
substituting 36 for length. 


In(weight)\=-4.3735+3.04942 In(36)=6.55415 
In (weight) = —4.3735 + 3.04942 In (36) = 6.55415 


To get the predicted weight, we then undo the natural logarithm by raising e to the 6.55415 
power. 


weight’\=e6.55415=702.2 gramsweight = e®°*! = 702.2 grams 


On the TI-83/84, you can “undo” the natural logarithm using the function keys. To solve 


In(weight)=6.55415,m(weight) = 6.55415, procs 6.55415 |ENTER|. 


Your calculator and most statistical software will calculate the logarithms of all the values 
of a variable with a single command. The important thing to remember is that if the relationship 
between two variables is described by a power model, then we can linearize the relationship by 
taking the logarithm of both the explanatory and response variables. 


Think About It 


HOW DO WE FIND THE POWER MODEL FOR PREDICTING YY from X?X°? The 
least-squares line for the transformed rockfish data is 


poeta. 


log(weight)\=-1.89940+3.04942 log(length)!os (weight) = —1.89940 + 3.04942 log (length) 


If we use the definition of the logarithm as an exponent, we can rewrite this equation as 


weight\=10-1.89940 + 3.04942 log(length) Weight = {0 1.89940+4-3.04942 log(length) 
Using properties of exponents, we can simplify this as follows: 


weight“=10-1.89940- 103.04942log(length)using the fact 
thatbmbn=bm+nweight/=10-1.89940- 10log(length)3.04942using the fact 
that plog x=logxpweight/=0.0126(length)3.04942using the fact that 10logx=x 

weight = $f 1.89940 | 1093-04942 log(length) using the fact thath™b" — p™*" 

weight = 10~!59%0 . joleatlength)3.04942 cing the fact that plog 2 = log x? 

weight = 0.0126(length)*""” using the fact that 10logx = x 

This equation is now in the familiar form of a power model y=axpY = @2” with a=0.0126 

ja = 0.0126 and b=3.04942.b = 3.04942. Notice how close the power is to 3, as expected from 
geometry. 


We could use the power model to predict the weight of a 36-centimeter-long Atlantic 
Ocean rockfish: 


erat 9@\3-04942 
weight = 0.0126(36)3.04942%701.76 gramsWeight = 0.0126(36) ~ 701.76 grams 


This is roughly the same prediction we got earlier. Here is the scatterplot of the original 
ockfish data with the power model added. Note how well this model fits the association! 
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Transforming with Logarithms: Exponential Models 


A linear model has the form y=b0+b1x.¥ = %0 + 61”. The value of y¥ increases (or decreases) 
at a constant rate as x@ increases. The slope b12 describes the constant rate of change of a 
linear model. That is, for each 1-unit increase in x,”» the model predicts an increase of b151 
units in y.¥- You can think of a linear model as describing the repeated addition of a constant 
amount. Sometimes the relationship between y¥ and x is based on repeated multiplication by 
a constant factor. That is, each time x@ increases by 1 unit, the value of y¥ is multiplied by b.. 


An exponential model of the form y=abx¥ = @" describes such growth by multiplication. 

Populations of living things tend to grow exponentially if not restrained by outside limits 
such as lack of food or space. More pleasantly (unless we’re talking about credit card debt!), 
money also displays exponential growth when interest is compounded each time period. 
Compounding means that the last period’s income earns income in the next period. Figure 12.9 
shows the balance of a savings account where $100 is invested at 6% interest, compounded 
annually (assuming no additional deposits or withdrawals). After x2 years, the account balance 
yY is given by the exponential model y=100(1.06)x.¥ — 100(1.06)". 
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FIGURE 12.9 Scatterplot of the exponential growth of a $100 investment in a savings account 
paying 6% interest, compounded annually. 


An exponential model of the form y=abx¥ = 5° describes the relationship between x2 and 
y,¥> where a@ and bé are constants. We can use logarithms to produce a linear relationship. 
Start by taking the logarithm of each side (we’Il use base 10, but the natural logarithm In using 
base e€ would work just as well). Then use algebraic properties of logarithms to simplify the 
resulting expressions. Here are the details: 


log y=log (abx)taking the logarithm of both sideslog y=log at+log (bx)using the property 
log(mn)=log m+log nlog y=log a+x log busing the property log mp=p log m 
log y = log (ab*) taking the logarithm of both sides 
log y = loga + log (b*) using the property log (mn) = log m + logn 


logy=loga+alogb using the property logm? = plogm 
We can then rearrange the final equation as 
log y=log a+(log b)x!o8 y = loga + (log b) x 


Notice that log alg @ and log plog © are constants because a and b are constants. So the 
equation gives a linear model relating the explanatory variable x# to the transformed variable 
log y.¥- Thus, if the relationship between two variables follows an exponential model, a 
scatterplot of the logarithm of y¥ against xx should show a roughly linear association. 


EXAMPLE |Moore’s law and computer chips (>) 


Transforming with logarithms: Exponential models 


Nandana de silva/Alamy 


PROBLEM: Gordon Moore, one of the founders of Intel Corporation, predicted in 1965 that 
the number of transistors on an integrated circuit chip would double every 18 months. This is 
Moore’s law, one way to measure the revolution in computing. Here are data on the dates 


and number of transistors for Intel microprocessors:1® 


Processor Year Transistors |Processor Year Transistors 

Intel 4004 1971 2,300|Pentium III Tualatin 2001 45,000,000 
Intel 8008 1972 3,500|Itanium 2 McKinley 2002 220,000,000 
Intel 8080 1974 4,500|Itanium 2 Madison 6M 2003 410,000,000 
Intel 8086 1978 29,000 |Itanium 2 with 9 MB cache 2004 592,000,000 
Intel 80286 1982 134,000) Dual-core Itanium 2 2006 1,700,000,000 
Intel 80386 1985 275,000/Six-core Xeon 7400 2008 1,900,000,000 
Intel 80486 1989 1,180,235|8-core Xeon Nehalem-EX 2010 2,300,000,000 
Pentium 1993 3,100,000/10-core Xeon Westmere- 2011 2,600,000,000 


Pentium Pro 1995 5,500,000|61-core Xeon Phi 2012 5,000,000,000 
Pentium Il Klamath 1997 7,500,000/18-core Xeon Haswell-E5 2014 5,560,000,000 
Pentium Ill Katmai 1999  9,500,000/22-core Xeon Broadwell- 2016 7,200,000,000 


Pentium 4 2000 42,000,000 
Willamette 


Here is a scatterplot that shows the growth in the number of transistors on a computer chip 
from 1971 to 2016. Notice that we used “years since 1970” as the explanatory variable. 
We’ll explain this on page 806. If Moore’s law is correct, then an exponential model should 
describe the relationship between the variables. 


a. Here is a scatterplot of the natural (base-ee) logarithm of the number of transistors on a 
computer chip versus years since 1970. Based on this graph, explain why it would be 
reasonable to use an exponential model to describe the relationship between number of 


transistors and years since 1970. 
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b. Here is some computer output from a linear regression analysis of the transformed data. 
Give the equation of the least-squares regression line. Be sure to define any variables you 
use. 


c. Use your model from part (b) to predict the number of transistors on an Intel computer 
chip in 2020. 


Predictor Coef SE Coef T P 
Constant 7.2272 0.3058 23.64 0.000 
Years since 1970 0.3542 0.0102 34.59 0.000 


S = 0.6653 R-Sq = 98.2% R-Sq(adj) = 98.2% 
SOLUTION: 


a. The scatterplot of (n(transistors) versus years since 1970 has a fairly linear pattern. So 
an exponential model seems reasonable here. 


If an exponential model describes the relationship between two variables xz and y, 


Ys we expect a scatterplot of (x,%> In y¥) to be roughly linear. 


b. (n(transistors)A= P22 f2tOe25 42 (years since19 70) 
In(transistors) = 7.2272 + 0.3542 (years since 1970) 


2020 is 50 years since 1970. 


©. (n(transistors)A=7.22 72+0.3542 (so) = 24.9372transistorsA=e24.9372 = 
In(transistors) = 7.2272 + 0.3542 (50) = 24.9372 


a 


67,622,053,360 transistors = e71%'? — 67,622,053, 360 


This model predicts that an Intel chip made in 2020 will have about 68 billion 
transistors. 


The least-squares regression line gives the predicted value of In (transistors). To 


get the predicted number of transistors, undo the logarithm by raising ee to the 
24.9372 power. 


FOR PRACTICE, TRY EXERCISE 43 


Here is a residual plot for the linear regression in part (b) of the example: 


Residual 


T T T T T ee 
0 10 20 30 40 50 


Years since 1970 


The residual plot shows a leftover pattern, with the residuals going from positive to 
negative to positive to negative as we move from left to right. However, the residuals are small 
in size relative to the transformed y¥-values, and the scatterplot of the transformed data is 
much more linear than the original scatterplot. We feel reasonably comfortable using this 
model to make predictions about the number of transistors on a computer chip. 

Let’s recap this big idea: When an association follows an exponential model, the 
transformation to achieve linearity is carried out by taking the logarithm of the response 
variable. The crucial property of the logarithm for our purposes is that if a variable grows 
exponentially, its logarithm grows linearly. 


Think About It 


HOW DO WE FIND THE EXPONENTIAL MODEL FOR PREDICTING YY FROM 
X?.X°” The least-squares line for the transformed data in the computer chip example is 


In(transistors)\=7.2272+0.3542(years since 1970) 
In (transistors) = 7.2272 + 0.3542 (years since 1970) 


If we use the definition of the logarithm as an exponent, we can rewrite this equation as 


transistors/\=e7.2272+0.3542(years since 1970)transistors = e*?" 


2+0,3542(years since 1970) 


Using properties of exponents, we can simplify this as follows: 


transistors\=e7.2272-e0.3542(years since 1970)using the fact 
that bnbn=bm-+ntransistors\=e7.2272-(e0.3542)(years since 1970)using the fact 
that (bm)n=bmntransistors\=1376.4-(1.4250)(years since 1970)simplifying 


S = 957% FAs — id ° 7 — 

transistors = e”2272 . ¢9-3542(years since 1970) using the fact that bb" = b"™"" 
See 7.227 35: rears since 1970 . 

transistors = e7-2272 . (¢9-3542){vears since 1970) cing the fact that (b”)" = b™” 


transistors = 1376.4 - (1.4250) sme 9) simplifying 


This equation is now in the familiar form of an exponential model y=abx¥ > ab” with 
a=1376.4a = 1376.4 and b=1.4250.5 = 1.4250. Here is the scatterplot of the original transistor 
data with the exponential model added: 


Transistors (billions) 
Co = N ww + na na ~ © 


0 5 10 15 20 25 30 35 40 45 
Years since 1970 


We could use the exponential model to predict the number of transistors on an Intel chip in 
2020: transistors\=1376.4(1.4250)50*6.7529-1010. 


ETT - f 5 50 ~~ 10 i. 8 bos . 
transistors = 1376.4(1.4250)"" ~ 6.7529 - 10". This is roughly the same prediction we obtained 


earlier. 


The calculation at the end of the Think About It feature might give you some idea of why 
we used years since 1970 as the explanatory variable in the example. To make a prediction, we 
substituted the value x=502 = 50 into the equation for the exponential model. This value is the 
exponent in our calculation. If we had used year as the explanatory variable, our exponent 


would have been 2020. Such a large exponent can lead to overflow errors on a calculator. 


Putting It All Together: Which Transformation Should We Choose? 


Suppose that a scatterplot shows a curved relationship between two quantitative variables xx 
and y.¥- How can we decide whether a power model or an exponential model better describes 
the relationship? The following example shows the strategy we should use. 


EXAMPLE |What’s a planet, anyway? (>) 
Choosing a model 


NASA/JPL-Caltech 


PROBLEM: On July 31, 2005, a team of astronomers announced that they had discovered 
what appeared to be a new planet in our solar system. They had first observed this object 
almost two years earlier using a telescope at Caltech’s Palomar Observatory in California. 
Originally named UB313, the potential planet is bigger than Pluto and has an average 
distance of about 6.3 billion miles from the sun. (For reference, Earth is about 93 million 
miles from the sun.) Could this new astronomical body, now called Eris, be a new planet? 


At the time of the discovery, there were nine known planets in our solar system. Here are 
data on the distance from the sun and period of revolution of those planets, along with a 
scatterplot. Note that distance is measured in astronomical units (AU), the number of Earth 
distances the object is from the sun.12 There appears to be a strong curved relationship 
between distance from the sun and period of revolution. 


In August 2006, the International Astronomical Union agreed on a new definition of planet. Both 


Pluto and Eris were classified as “dwarf planets.” 


Distance from __ Period of revolution 
Planet sun (AU) (Earth years) 


Mercury 0.387 0.241 


Venus 0.723 0.615 
Earth 1.000 1.000 
Mars 1.524 1.881 
Jupiter 5.203 11.862 
Saturn 9.539 29.456 
Uranus 19.191 84.070 
Neptune 30.061 164.810 
Pluto 39.529 248.530 
250 
200 
5 150 
3 
100 
oui 
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0 es 
T T | T T 
0 10 20 30 40 


Distance (AU) 


The following graphs show the results of two different transformations of the data. The 
graph on the left plots the natural logarithm of period against distance from the sun for all 
nine planets. The graph on the right plots the natural logarithm of period against the natural 
logarithm of distance from the sun for the nine planets. 
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. Based on the scatterplots, would an exponential model or a power model provide a better 


description of the relationship between distance and period? Justify your answer. 


. Here is computer output from a linear regression analysis of the transformed data in the 


graph on the right. Give the equation of the least-squares regression line. Be sure to define 
any variables you use. 


Predictor Coef SE Coef Al P 
Constant 0.0002544 ©0.0001759 1.45 0.191 
In(distance) 1.49986 0.00008 18598.27 0.000 


S = 0.000393364 R-Sq = 100.0% R-Sq(adj) = 100.0% 


c. Use your model from part (b) to predict the period of revolution for Eris, which is about 
68.05 AU from the sun. 


d. Here is a residual plot for the linear regression in part (b). Do you expect your prediction 
in part (c) to be too large, too small, or about right? Justify your answer. 


0.00075 
0.00050 
0.00025 


0.00000 


Residual 


—).00025 


—0.00050 - 


In (distance) 


SOLUTION: 


a. The graph of (n(period) Versus (n(distance) ts much more linear than the graph of 


(n(period) Versus distance, SO a power model would be more appropriate. 


If an association follows an exponential model, a graph of In(y) versus x should be 


roughly linear. If an association follows a power model, a graph of In(y) versus In(x) 
should be roughly linear. 


b. (n(period)A = 0.000254441.49986 (n(distance) 
In (period) = 0.0002544 + 1.49986 In (distance) 


iG (n(period)\=0.0002544+1.49986 [n(68.05)=6.330period=e6.330%561 years 
In (period) = 0.0002544 + 1.49986 In (68.05) 
= 6.330 


Sen 6.330 
period = e””” = 561 years 


To predict the period, we have to undo the natural logarithm by raising e to the 


6.330 power. 


d. Eris’s value for (n(distance) is (n(68.05) = 4.220 (68.05) = 4.22, which would fall at the 
far right of the residual plot, where all the residuals are positive. Because it ts likely 


that residual=actualy-predictedy>otesidual = actual y — predicted y > 0, we would 


expect Our prediction to be too small. 
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Here is the scatterplot of the original planetary data with the power model added. It seems 
remarkable that period of revolution is closely related to the 1.5 power of distance from the 
sun. Johannes Kepler made this fascinating discovery about 400 years ago without the aid of 
modern technology—a result known as Kepler’s third law. 
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What if the scatterplots of (log x,”> log y¥) and (x,”> log y¥) both look linear? Fit a least- 
squares regression line to both sets of transformed data. Then compare residual plots and look 
for the one with the most random scatter. If the residual plots look roughly the same, use the 
values of ss and r2r” to decide whether a power model or an exponential model is a better 


choice. 


We have used statistical software to do all the transformations and linear regression 
analysis in this section so far. Now let’s look at how the process works on a graphing 


calculator. 


32. Technology Corner | TRANSFORMING TO ACHIEVE 
LINEARITY 


TI-Nspire and other technology instructions are on the book’s website at 


highschool. bfwpub.com/tps6e. 


We’ll use the planet data to illustrate a general strategy for performing transformations with 
logarithms on the TI-83/84. A similar approach could be used for transforming data with 


powers and roots. 


e Enter the values of the explanatory variable in L1 and the values of the response variable 
in L2. Make a scatterplot of y¥ versus x2 and confirm that there is a curved pattern. 


NORMAL FLOAT AUTO REAL RADIAN CL fl 


Ploti:LisL2 


e Define L3 to be the natural logarithm (In) of L1 and L4 to be the natural logarithm of L2. 
To see whether a power model fits the original data, make a plot of In y¥ (L4) versus In x 
© (L3) and look for linearity. To see whether an exponential model fits the original data, 
make a plot of In y¥ (L4) versus x2 (L1) and look for linearity. 


fl NORMAL FLOAT AUTO REAL RADIAN CL fl 


Ploti:LisL4 


X=".9493306 ¥="1.422958 Y="1.422958 


e Ifa linear pattern is present, calculate the equation of the least-squares regression line. For 
the planet data, we executed the command LinReg(at+bx)L3,L4,LinReg (a + bx) L3, L4. 


NORMAL FLOAT AUTO REAL RADIAN CL fl 
y=atbx 


a=2.5444423E -4 
b=1.499860986 


r2=, 9999999798 
r=. 9999999899 


e Construct a residual plot to look for any departures from the linear pattern. For Xlist, 
enter the list you used as the explanatory variable in the linear regression calculation. For 
Ylist, use the RESID list stored in the calculator. For the planet data, we used L3 as the 
Xlist. 


NORMAL FLOAT AUTO REAL RADIAN CL Hl 


Ploti:Ls,sRESID 


X=".9493306 Y=6.5112E"4 


CHECK YOUR UNDERSTANDING 


. 


y One sad fact about life is that we’ll all die someday. Many adults plan ahead for their 
eventual passing by purchasing life insurance. Many different types of life insurance policies 
are available. Some provide coverage throughout an individual’s life (whole life), while 
others last only for a specified number of years (term life). The policyholder makes regular 
payments (premiums) to the insurance company in return for the coverage. When the insured 
person dies, a payment is made to designated family members or other beneficiaries. 


How do insurance companies decide how much to charge for life insurance? They rely on a 
staff of highly trained actuaries—people with expertise in probability, statistics, and 
advanced mathematics—to establish premiums. For an individual who wants to buy life 
insurance, the premium will depend on the type and amount of the policy as well as personal 
characteristics like age, sex, and health status. 


The table shows monthly premiums for a 10-year term-life insurance policy worth 
$1,000,000.18 


Age (years) Monthly premium 
40 $29 
45 $46 
50 $68 
55 $106 
60 $157 
65 $257 


The output shows three possible models for predicting monthly premium from age. Option 1 


is based on the original data, while Options 2 and 3 involve transformations of the original 
data. Each set of output includes a scatterplot with a least-squares regression line added and 
a residual plot. 


insurance [ Scatter Plot] #] [Scatter Plot]9}) | Insurance [ Scatter Plot] ¥] 
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Age In(Age) 
—— premium =~—343 + 8.63 (age) —— In(premium) = -12.98 + 4.416 In(age) —— In(premium) = -0.063 + 0.0859 (age) 


rr 2 


1. Use each model to predict how much a 58-year-old would pay for such a policy. 


2. What type of function—linear, power, or exponential—best describes the relationship 
between age and monthly premium? Explain your answer. 


Section 12.2 Summary 


e Curved relationships between two quantitative variables can sometimes be changed into 
linear relationships by transforming one or both of the variables. Once we transform the 
data to achieve linearity, we can fit a least-squares regression line to the transformed data 
and use this linear model to make predictions. 


e When theory or experience suggests that the relationship between two variables follows a 
power model of the form y=axp¥ = @©” transformations involving powers and roots can 
linearize a curved pattern in a scatterplot. 

: Option 1: Raise the values of the explanatory variable xz to the power p, then look at a 
graph of (xp,y).(2"»¥) 
« Option 2: Take the pth root of the values of the response variable y,¥ then look at a graph 
of (xyp).(" VI) 
e Another useful strategy for straightening a curved pattern in a scatterplot is to take the 


logarithm of one or both variables. When a power model describes the relationship between 
two variables, a plot of log y¥ versus log x# (or In y¥ versus In x) should be linear. 


e For an exponential model of the form y=abx,¥ = @"; the predicted values of the response 
variable are multiplied by a factor of b for each increase of 1 unit in the explanatory variable. 
When an exponential model describes the relationship between two variables, a plot of log y 


Y versus x (or In y¥ versus x) should be linear. 


12.2 Technology Corner 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


32. Transforming to achieve linearity Page 809 


Section 12.2 Exercises 


33. PQ 797 |») The swinging pendulum Mrs. Hanrahan’s precalculus class collected data 


on the length (in centimeters) of a pendulum and the time (in seconds) the pendulum took 
to complete one back-and-forth swing (called its period). The theoretical relationship 
between a pendulum’s length and its period is 


at 
period =—— ,/length 


period=2nglength v9 


where g@ is a constant representing the acceleration due to gravity (in this case, 


9=980 cm/s29 = 980em/s") Here is a graph of period versus length, V !@®8*4; along with 
output from a linear regression analysis using these variables. 


2.2 
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Regression Analysis: Ciength \/length, period) 


Predictor Coef SE Coef T P 
Constant -0.08594 0.05046 -1.70 0.123 
sqrt(length) @.209999 0.008322 25.23 0.000 
S = 0.0464223 R-Sq = 98.6% R-Sq(adj) = 98.5% 


a. Give the equation of the least-squares regression line. Define any variables you use. 


b. Use the model from part (a) to predict the period of a pendulum with length 80 cm. 


34. Boyle’s law If you have taken a chemistry or physics class, then you are probably familiar 
with Boyle’s law: for gas in a confined space kept at a constant temperature, pressure 
times volume is a constant (in symbols, PV=kPV = k). Students in a chemistry class 
collected data on pressure and volume using a syringe and a pressure probe. If the true 
relationship between the pressure and volume of the gas is PV=k,PV = &; then 


1 


P=k 
P=k 1V V 


l 
Here is a graph of pressure versus 1volume, volume’ along with output from a linear 
regression analysis using these variables: 


3.0 


2.0 ° 


Pressure (atm) 


i 
0.050 0.075 0.100 0.125 0.150 0.175 
1/V 


(- pressure ) 
Regression Analysis: (1volume,pressure) \ ‘°° 


Predictor Coef SE Coef T P 
Constant 0.36774 0.04055 9.07 0.000 
1/V 15.8994 0.4190 37.95 0.000 


S = 0.044205 R-Sq = 99.6% R-Sq(adj) = 99.5% 


a. Give the equation of the least-squares regression line. Define any variables you use. 


b. Use the model from part (a) to predict the pressure in the syringe when the volume is 17 
cubic centimeters. 


35. PQ 799 |p) The swinging pendulum Refer to Exercise 33. Here is a graph of period? 


versus length, along with output from a linear regression analysis using these variables. 


Period” 
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Length (cm) 

Regression Analysis: (length, period?) 
Predictor Coef SE Coef T P 
Constant -0.15465 0.05802 -2.67 0.026 
Length 0.042836 0.001320 32.46 0.000 


S = 0.105469 R-Sq = 99.2% R-Sq(adj) = 99.1% 


a. Give the equation of the least-squares regression line. Define any variables you use. 


b. Use the model from part (a) to predict the period of a pendulum with length 80 
centimeters. 


i 
36. Boyle’s law Refer to Exercise 34. Here is a graph of 1Pressure Pressure versus volume, 


along with output from a linear regression analysis using these variables: 


5.0 LS 10.0 12.5 15.0 17.5 20.0 
Volume (cm*) 


Regression Analysis: 


(volume, ats) 

(volume, 1Pressure) a 
Predictor Coef SE Coef T P 
Constant 0.100170 0.003779 26.51 0.000 
Volume ©.0398119 0.0002741 145.23 0.000 


S = 0.003553 R-Sq = 100.0% R-Sq(adj) = 100.0% 


a. Give the equation of the least-squares regression line. Define any variables you use. 


b. Use the model from part (a) to predict the pressure in the syringe when the volume is 17 
cubic centimeters. 


37. pg 800 _») The swinging pendulum Refer to Exercise 33. We took the logarithm (base 


10) of the values for both length and period. Here is some computer output from a linear 
regression analysis of the transformed data. 
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-0.1 
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Residual 
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12 #13 #14 #15 #16 17 #18 «#219 20 221 
log (length) 


Regression Analysis: log(Period) versus log(Length) 


Predictor Coef SE Coef T P 
Constant -0.73675 0.03808 -19.35 0.000 
log(Length) 0.51701 0.02511 20.59 0.000 


S = @.0185568 R-Sq = 97.9% R-Sq(adj) = 97.7% 
a. Based on the output, explain why it would be reasonable to use a power model to 
describe the relationship between the length and period of a pendulum. 


b. Give the equation of the least-squares regression line. Be sure to define any variables 
you use. 


c. Use the model from part (b) to predict the period of a pendulum with length 80 cm. 


38. Boyle’s law Refer to Exercise 34. We took the logarithm (base 10) of the values for both 
volume and pressure. Here is some computer output from a linear regression analysis of 
the transformed data. 
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0.8 0.9 1.0 1.1 1.2 1.3 
log (volume) 


Regression Analysis: log(Pressure) versus log(Volume) 


Predictor Coef SE Coef T P 
Constant 1.11116 0.01118 99.39 0.000 
log( Volume ) -0.81344 0.01020 -79.78 0.000 


S = 0.00486926 R-Sq = 99.9% R-Sq(adj) = 99.9% 


a. Based on the output, explain why it would be reasonable to use a power model to 
describe the relationship between pressure and volume. 


b. Give the equation of the least-squares regression line. Be sure to define any variables 
you use. 


c. Use the model from part (b) to predict the pressure in the syringe when the volume is 
17 cubic centimeters. 


39. Brawn versus brain How is the weight of an animal’s brain related to the weight of its 
body? Researchers collected data on the brain weight (in grams) and body weight (in 


40. 


kilograms) for 96 species of mammals./2 The following figure is a scatterplot of the 
logarithm of brain weight against the logarithm of body weight for all 96 species. The 
least-squares regression line for the transformed data is 


log y\=1.01+0.72 log xlog y = 1.01 + 0.72 log x 


log (brain weight) 


log (body weight) 


Based on footprints and some other sketchy evidence, some people believe that a large 
ape-like animal, called Sasquatch or Bigfoot, lives in the Pacific Northwest. Bigfoot’s 
weight is estimated to be about 127 kilograms (kg). How big do you expect Bigfoot’s 
brain to be? 


Determining tree biomass It is easy to measure the diameter at breast height (in 
centimeters) of a tree. It’s hard to measure the total aboveground biomass (in kilograms) 
of a tree, because to do this, you must cut and weigh the tree. The biomass is important for 
studies of ecology, so ecologists commonly estimate it using a power model. The 
following figure is a scatterplot of the natural logarithm of biomass against the natural 
logarithm of diameter at breast height (DBH) for 378 trees in tropical rain forests.22 The 
least-squares regression line for the transformed data is 


In yA=-2,00+2.42 In xAiny =—2.00 + 2.42In@ 


Use this model to estimate the biomass of a tropical tree 30 cm in diameter. 


In (biomass) 


2.5 3 3.5 4 4.5 5 
In(DBH) 


41. Braking distance How is the braking distance for a motorcycle related to the speed at 


42. 


which the motorcycle was traveling when the brake was applied? Statistics teacher Aaron 
Waggoner gathered data to answer this question. The table shows the speed (in miles per 
hour) and the distance needed to come to a complete stop when the brake was applied (in 
feet). 


Speed (mph) Distance (ft) Speed (mph) Distance (ft) 
6 1.42 32 52.08 
9 4.92 40 84.00 
19 18.00 48 110.33 
30 44.75 


a. Transform both variables using logarithms. Then calculate and state the least-squares 


regression line using the transformed variables. 


b. Use the model from part (a) to calculate and interpret the residual for the trial when the 


motorcycle was traveling at 48 mph. 


Braking distance, again How is the braking distance for a car related to the amount of 
tread left on the tires? Here are the braking distances (measured in car lengths) for a car 
making a panic stop in standing water, along with the tread depth of the tires (in 1/32 
inch):24 


Tread depth (1/32 Braking distance (car 
in.) lengths) 
11 9.7 
10 9.8 


10.1 
10.4 
10.8 
11.2 
11.8 
12.4 
13.6 
15.2 
18.3 
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a. Transform both variables using logarithms. Then calculate and state the least-squares 
regression line using the transformed variables. 


b. Use the model from part (a) to calculate and interpret the residual for the trial when the 
tread depth was 3/32 inch. 


43. pg 804 |») Killing bacteria Expose marine bacteria to X-rays for time periods from 1 


to 15 minutes. Here is a scatterplot showing the number of surviving bacteria (in 


hundreds) on a culture plate after each exposure time:22 
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a. Below is a scatterplot of the natural logarithm of the number of surviving bacteria 
versus time. Based on this graph, explain why it would be reasonable to use an 
exponential model to describe the relationship between count of bacteria and time. 


In (count) 


6 8 10 12 14 16 
Time (min) 


b. Here is output from a linear regression analysis of the transformed data. Give the 
equation of the least-squares regression line. Be sure to define any variables you use. 


Predictor 
Constant 


Time 


SE Coef T P 
0.05978 99.92 0.000 
0.006575 -33.22 0.000 


S = 0.110016 R-Sq = 98.8% R-Sq(adj) = 98.7% 


c. Use your model to predict the number of surviving bacteria after 17 minutes. 


44. Light through water Some college students collected data on the intensity of light at 
various depths in a lake. Here is a scatterplot of their data: 


Light intensity (lumens) 


Depth (m) 


a. At top right is a scatterplot of the natural logarithm of light intensity versus depth. 
Based on this graph, explain why it would be reasonable to use an exponential model to 
describe the relationship between light intensity and depth. 
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b. Here is computer output from a linear regression analysis of the transformed data. Give 
the equation of the least-squares regression line. Be sure to define any variables you 
use. 


Predictor Coef SE Coef T P 
Constant 6.78910 0.00009 78575.46 0.000 
Depth (m) -0.333021 0.000010 -31783.44 0.000 
S = 0.000055 R-Sq = 100.0% R-Sq(adj) = 100.0% 


c. Use your model to predict the light intensity at a depth of 12 meters. 


45. European population growth Many populations grow exponentially. Here are the data 
for the estimated population of Europe (in millions) from 1700 to 2012.22 The dates are 
recorded as years since 1700 so that x=312% = 312 is the year 2012. 


Years since 1700 0 50 100 150 200 
Population (in millions) 125 163 203 276 408 
Years since 1700 250 299 308 310 312 


Population (in millions) 547 729 732 738 740 


a. Use a logarithm to transform population size. Then calculate and state the least-squares 
regression line using the transformed variable. 


b. Use your model from part (a) to predict the population size of Europe in 2020. 


46. North American population growth Many populations grow exponentially. Here are the 
data for the estimated population of North America (in millions) from 1700 to 2012.24 
The dates are recorded as years since 1700 so that x=3122 = 312 is the year 2012. 


Years since 1700 0 50 100 150 200 
Population (in millions) 2 2 7 426 82 
Years since 1700 250 299 308 310 312 


Population (in millions) 172 307 337 345 351 


a. Use a logarithm to transform population size. Then calculate and state the least-squares 
regression line using the transformed variable. 


b. Use your model from part (a) to predict the population size of North America in 2020. 


47. pg 807 |») Putting success How well do professional golfers putt from various 


distances to the hole? The scatterplot shows various distances to the hole (in feet) and the 
percent of putts made at each distance for a sample of golfers.22 
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The graphs show the results of two different transformations of the data. The first graph 
plots the natural logarithm of percent made against distance. The second graph plots the 
natural logarithm of percent made against the natural logarithm of distance. 
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a. Based on the scatterplots, would an exponential model or a power model provide a 
better description of the relationship between distance and percent made? Justify your 
answer. 


b. Here is computer output from a linear regression analysis of In(percent made) and 
distance. Give the equation of the least-squares regression line. Be sure to define any 
variables you use. 


Predictor Coef SE Coef T P 
Constant 4.6649 0.0825 56.511 0.000 
Distance -0.1091 0.0067 -16.238 0.000 


S = 0.160385 R-Sq = 93.9% R-Sq(adj) = 93.4% 


c. Use your model from part (b) to predict the percent made for putts of 21 feet. 


d. Here is a residual plot for the linear regression in part (b). Do you expect your 
prediction in part (c) to be too large, too small, or about right? Justify your answer. 
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48. Counting carnivores Ecologists look at data to learn about nature’s patterns. One pattern 
they have identified relates the size of a carnivore (body mass in kilograms) to how many 
of those carnivores exist in an area. A good measure of “how many” is to count carnivores 
per 10,000 kg of their prey in the area. The scatterplot shows this relationship between 


body mass and abundance for 25 carnivore species.2° 


Abundance (per 10,000 kg prey) 
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The following graphs show the results of two different transformations of the data. The 
first graph plots the logarithm (base 10) of abundance against body mass. The second 
graph plots the logarithm (base 10) of abundance against the logarithm (base 10) of body 
mass. 
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a. Based on the scatterplots, would an exponential model or a power model provide a 
better description of the relationship between abundance and body mass? Justify your 
answer. 


b. Here is computer output from a linear regression analysis of log(abundance) and 
log(body mass). Give the equation of the least-squares regression line. Be sure to define 
any variables you use. 


Predictor Coef SE Coef T P 
Constant 1.9503 0.1342 14.53 0.000 
log(body mass) -1.04811 0.09802 -10.69 0.000 
S = 0.423352 R-Sq = 83.3% R-Sq(adj) = 82.5% 


c. Use your model from part (b) to predict the abundance of black bears, which have a 
body mass of 92.5 kg. 


d. Here is a residual plot for the linear regression in part (b). Do you expect your 
prediction in part (c) to be too large, too small, or about right? Justify your answer. 
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49. Heart weights of mammals Here are some data on the hearts of various mammals:22 


Length of cavity of left Heart 
Mammal ventricle (cm) weight (g) 
Mouse 0.55 0.13 
Rat 1.00 0.64 
Rabbit 2.20 5.80 
Dog 4.00 102.00 
Sheep 6.50 210.00 
Ox 12.00 2030.00 
Horse 16.00 3900.00 


a. Make an appropriate scatterplot for predicting heart weight from length. Describe what 
you see. 


b. Use transformations to linearize the relationship. Does the relationship between heart 
weight and length seem to follow an exponential model or a power model? Justify your 
answer. 


c. Perform least-squares regression on the transformed data. Give the equation of your 
regression line. Define any variables you use. 


d. Use your model from part (c) to predict the heart weight of a human who has a left 
ventricle 6.8 cm long. 


50. Click-through rates Companies work hard to have their website listed at the top of an 
Internet search. Is there a relationship between a website’s position in the results of an 
Internet search (1=top position,2=2nd position,! = top position, 2 = 2nd position, etc.) 
and the percentage of people who click on the link for the website? Here are click-through 
rates for the top 10 positions in searches on a mobile device:22 

Click- 
Click-through through rate 


Position rate (%) Position (%) 


1 23.53 6 3.80 
Z 14.94 t 2.79 
3 11.19 8 2.11 
4 1.47 9 1.57 
3 5.29 10 1.18 


a. Make an appropriate scatterplot for predicting click-through rate from position. 
Describe what you see. 


b. Use transformations to linearize the relationship. Does the relationship between click- 


through rate and position seem to follow an exponential model or a power model? 
Justify your answer. 


c. Perform least-squares regression on the transformed data. Give the equation of your 
regression line. Define any variables you use. 


d. Use your model from part (c) to predict the click-through rate for a website in the 11th 
position. 


Multiple Choice Select the best answer for Exercises 51-54. 


ol. 


O2. 


51. Suppose that the relationship between a response variable y¥ and an explanatory 
variable xv is modeled by y=2.7(0.316)x.¥ — 2.7(0.316)". Which of these scatterplots 
would approximately follow a straight line? 


a. A plot of y¥ against xz 

b. A plot of y¥ against log xlog x 

c. A plot of log ylS ¥ against xa 

d. A plot of log ylog Y against log xlog x 


e. A plot of yv¥ against x@ 


Some high school physics students dropped a ball and measured the distance fallen (in 
centimeters) at various times (in seconds) after its release. If you have studied physics, 
you probably know that the theoretical relationship between the variables is 

. ° 2 
distance=490(time)2.“istance = 490(time)". Which of the following scatterplots would 
not approximately follow a straight line? 


a. A plot of distance versus (time)2(time)” 


. / tot < ‘oO . 
b. A plot of distancev distance versus time 


c. A plot of distance versus timev time 


53. 


54. 


50. 


d. A plot of In(distance) versus In(time) 


e. A plot of log(distance) versus log(time) 


Students in Mr. Handford’s class dropped a kickball beneath a motion detector. The 
detector recorded the height of the ball (in feet) as it bounced up and down several times. 
Here is computer output from a linear regression analysis of the transformed data of 
log(height) versus bounce number. Predict the highest point the ball reaches on its seventh 
bounce. 


Predictor Coef SE Coef T P 
Constant 0.45374 0.01385 32.76 0.000 
Bounce -—0.117160 0.004176 -28.06 0.000 


S = 0.0132043 R-Sq = 99.6% R-Sq(adj) = 99.5% 


a. 0.35 feet 
b. 2.26 feet 
c. 0.37 feet 
d. 2.32 feet 
e. 0.43 feet 


A scatterplot of y¥ versus x shows a positive, nonlinear association. Two different 
transformations are attempted to try to linearize the association: using the logarithm of the 
y¥-values and using the square root of the y¥-values. Two least-squares regression lines 
are calculated, one that uses x@ to predict log(y)oaly ) and the other that uses x@ to 


predict yv¥ Which of the following would be the best reason to prefer the least-squares 
regression line that uses x@ to predict log(y)7!08(¥)’ 


a. The value of r2r” is smaller. 

b. The standard deviation of the residuals is smaller. 
c. The slope is greater. 

d. The residual plot has more random scatter. 


e. The distribution of residuals is more Normal. 
Recycle and Review 


Shower time (1.3, 2.2, 6.3, 7.3) Marcella takes a shower every morning when she gets up. 
Her time in the shower varies according to a Normal distribution with mean 4.5 minutes 
and standard deviation 0.9 minute. 


a. Find the probability that Marcella’s shower lasts between 3 and 6 minutes on a 


randomly selected day. 


b. If Marcella took a 7-minute shower, would it be classified as an outlier by the 1.5JQR 
rule? Justify your answer. 


c. Suppose we choose 10 days at random and record the length of Marcella’s shower each 
day. What’s the probability that her shower time is 7 minutes or greater on at least 2 of 
the days? 


d. Find the probability that the mean length of her shower times on these 10 days exceeds 
5 minutes. 


56. Tattoos (8.2) What percent of U.S. adults have one or more tattoos? The Harris Poll 
conducted an online survey of 2302 adults and found 29% of respondents had at least 1 
tattoo. According to the published report, “Respondents for this survey were selected from 
among those who have agreed to participate in Harris Interactive surveys.”22 Explain why 
it would not be appropriate to use these data to construct a 95% confidence interval for the 
proportion of all U.S. adults who have tattoos. 

Exercises 57 and 58 refer to the following setting. About 1100 high school teachers attended a 
weeklong summer institute for teaching AP® Statistics classes. After learning of the survey 
described in Exercise 56, the teachers in the AP® Statistics class wondered whether the results 
of the tattoo survey would be similar for teachers. They designed a survey to find out. The class 
opted to take a random sample of 100 teachers at the institute. One of the questions on the 
survey was: Do you have any tattoos on your body? 


(Circle one) YES NO 


57. Tattoos (8.2, 9.2) Of the 98 teachers who responded, 23.5% said that they had one or 
more tattoos. 


a. Construct and interpret a 95% confidence interval for the true proportion of all teachers 
at the AP® institute who would say they have tattoos. 


b. Does the interval in part (a) provide convincing evidence that the proportion of all 
teachers at the institute who would say they have tattoos is different from 0.29 (the 
value cited in the Harris Poll report)? Justify your answer. 


c. Two of the selected teachers refused to respond to the survey. If both of these teachers 
had responded, could your answer to part (b) have changed? Justify your answer. 


58. Tattoos (4.1) One of the first decisions the class had to make was what kind of sampling 
method to use. 


a. They knew that a simple random sample was the “preferred” method. With 1100 
teachers in 40 different sessions, the class decided not to use an SRS. Give at least two 
reasons why you think they made this decision. 


b. The AP® Statistics class believed that there might be systematic differences in the 
proportions of teachers who had tattoos based on the subject areas that they taught. 
What sampling method would you recommend to account for this possibility? Explain a 
Statistical advantage of this method over an SRS. 


a 


MM _. FRAPPY! FREE RESPONSE AP® 
PROBLEM, YAY! 


The following problem is modeled after actual AP® Statistics exam free response 
questions. Your task is to generate a complete, concise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you use, because you will be 
scored on the correctness of your methods as well as on the accuracy and completeness of 
your results and explanations. 


A random sample of 14 golfers was selected from the 147 players on the Ladies 
Professional Golf Association (LPGA) tour in a recent year. The total amount of money 
won during the year (in dollars) and the scoring average for each player in the sample was 
recorded. Lower scoring averages are better in golf. 

The scatterplot below displays the relationship between money and scoring average 
for these 14 players. 


1,800,000 
1,600,000 
1,400,000 
1,200,000 
1,000,000 


Scoring average 


a. Explain why it would not be appropriate to construct a confidence interval for the slope 
of the least-squares regression line relating money to scoring average. 
A scatterplot of the natural logarithm of money versus scoring average is shown, along 
with some computer output for a least-squares regression using the transformed data. 


In (money) 


Scoring average 


Predictor Coeff SE Coef lr P 
Constant 77.537 7.035 11.02 0.000 
Scoring average -0.90470 0.09679 -9.35 0.000 
S = 0.475059 R-Sq = 87.9% R-Sq(adj) = 86.9% 


b. Predict the amount of money won for an LPGA golfer with a scoring average of 70. 


c. Calculate and interpret a 95% confidence interval for the slope of the least-squares 
regression line relating In(money) to scoring average. Assume that the conditions for 


inference have been met. 


After you finish, you can view two example solutions on the book’s website 
(highschool.bfwpub.com/tops6e). Determine whether you think each solution is 
“complete,” “substantial,” “developing,” or “minimal.” If the solution is not 
complete, what improvements would you suggest to the student who wrote it? 
Finally, your teacher will provide you with a scoring rubric. Score your response 
and note what, if anything, you would do differently to improve your own score. 


Chapter 12 Review 


Section 12.1: Inference for Linear Regression 


In this section, you learned that the sample regression line y\=b0+b1x¥ = 4 + 61% estimates 
the population (true) regression line py=B0+B1x.v = 90 + 91: The sampling distribution of 
the sample slope b11 is the foundation for doing inference about the population (true) slope 
81.%1- When the conditions are met, the sampling distribution of b141 has an approximately 


oh, = 


Or /n 


Normal distribution with mean pb1=B1/% ~ 9 and standard deviation ob1=coxn. 


There are five conditions for performing inference about the slope of a population (true) 
least-squares regression line. Remember them with the acronym LINER. 
e The linear condition says that the mean value of the response variable py" falls on the 
population (true) regression line py=R0+B1x.v = 9 + P1%- To check the linear condition, 
verify that there are no leftover curved patterns in the residual plot. 


e The independent condition says that individual observations are independent of each other. 
To check the independent condition, verify that the sample size is less than 10% of the 
population size when sampling without replacement from a population. Also, convince 
yourself that knowing the response for one individual won’t help you predict the response 
for another individual. 


e The Normal condition says that the distribution of y¥-values is approximately Normal for 
each value of x.2- To check the Normal condition, graph a dotplot, histogram, stemplot, 
boxplot, or Normal probability plot of the residuals and verify that there are no outliers or 
strong skewness. 


e The equal SD condition says that for each value of x,” the distribution of y¥ should have 
the same standard deviation. To check the equal SD condition, verify that the residuals have 
roughly the same amount of scatter around the residual=Oresidual = 0 line for each value of 
x& on the residual plot. 


e The random condition says that the data are from a random sample or a randomized 
experiment. To check the random condition, verify that randomness was properly used in the 
data collection process. 


To construct and interpret a confidence interval for the slope of the population (true) least- 
squares regression line, follow the familiar four-step process. The formula for the confidence 
interval is b1+t*«SEb1,"! +t SEv., where txt” is the tt critical value with df=n-2.4f = n — 2. 
The standard error of the slope SEb1°2% describes how far the sample slope typically varies 


from the population (true) slope in repeated random samples or random assignments. The 


SE,, = ——-. 
formula for the standard error of the slope is SEb1=ssxn-1. *\ " sevi-l’ The standard error 


of the slope is typically provided with standard computer output for least-squares regression. 


When you conduct a significance test for the slope of the population (true) least-squares 
regression line, use the standardized test statistic t=b1—hypothesized slopeSEb1 


b, hypothesized slope 
SEp, with df=n-2.d4f = m — 2. In most cases, the null hypothesis is H0:$1=0. 
Ho : 6; = 0. This hypothesis says that a straight-line relationship between x@ and yY is no 
better at predicting y¥ than using the mean value y .¥: The value of the standardized test 
statistic for a test of HO:81=0,40 : 1 = 9, along with a two-sided PP-value, is typically 
provided with standard computer output for least-squares regression. 


Section 12.2: Transforming to Achieve Linearity 
When the association between two variables is nonlinear, transforming one or both of the 
variables can result in a linear association. 

If the association between two variables follows a power model in the form y=axp,¥ = 22" 
there are several transformations that will result in a linear association. 


e Raise the values of x# to the power of p? and plot y¥ versus xp.2”. 
e Calculate the pPth root of the y¥-values and plot ypv¥ versus X.@. 


e Calculate the logarithms of the x@-values and the y¥-values, and plot log(y)losly ) versus 
log(x).lo8(@ )- You can use base-10 logarithms (log) or base-ee logarithms (In). 

If the association between two variables follows an exponential model in the form y=abx, 
y = ab", transform the data by computing the logarithms of the y¥-values and plot log(y)loaly ) 
versus x@ (or In(y¥) versus x). 

Once you have achieved linearity, calculate the equation of the least-squares regression line 
using the transformed data. Remember to include the transformed variables when you are 
writing the equation of the line. Likewise, when using the line to make predictions, make sure 
that the prediction is in the original units of y.¥- If you transformed the y¥ variable, you will 
need to undo the transformation after using the least-squares regression line. 

To decide between two or more transformations, choose the one that produces the most 
linear association and whose residual plot has the most random scatter. 


What Did You Learn? 


Related 
Example on Relevant Chapter Review 
Learning Target Section Page(s) Exercise(s) 
Check the conditions for 12.1 775 R12.2 


performing inference about 
the slope B11 of the 
population (true) 
regression line. 


Interpret the values of 
b0,b1,s¥%, 51,8 and SE b 
45», in context, and 


determine these values 
from computer output. 


Construct and interpret a 
confidence interval for the 
slope B11 of the 
population (true) 
regression line. 


Perform a significance test 
about the slope p11 of 
the population (true) 
regression line. 


Use transformations 
involving powers and roots 
to find a power model that 
describes the relationship 
between two quantitative 
variables, and use the 


model to make predictions. 


Use transformations 
involving logarithms to find 
a power model that 
describes the relationship 
between two quantitative 
variables, and use the 


model to make predictions. 


Use transformations 
involving logarithms to find 
an exponential model that 
describes the relationship 
between two quantitative 
variables, and use the 


model to make predictions. 


Determine which of 
several transformations 
does a better job of 
producing a linear 
relationship. 
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Chapter 12 Review Exercises 


These exercises are designed to help you review the important ideas and methods of the 


chapter. 


Exercises R12.1—R12.4 refer to the following setting. Do taller students require fewer steps to 
walk a fixed distance? The scatterplot shows the relationship between x=height® = height (jn 
inches) and y=numberY > number of steps required to walk the length of a school hallway for 


a random sample of 36 students at a high school. 
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A least-squares regression analysis was performed on the data. Here is some computer output 


from the analysis: 


Residual 
ie 


-10 
60 62 64 66 68 70 72 74 
Height (in.) 
10 
8 
B 6 
) 
= 
— 2 
E 4 
2 
-8 -6 4 —2 0 2 4 6 8 10 
Residual 
Predictor Coef SE Coef T P 
Constant 113.57 13.085 8.679 0.000 
Height -0.9211 0.1938 bias aae 


S = 3.50429 R-Sq = 39.9% R-Sq(adj) = 38.1% 


R12.1 > Long legs 
a. Describe what the scatterplot tells you about the relationship between height and 
number of steps. 


b. What is the equation of the least-squares regression line? Define any variables you 
use. 


c. Identify the value of each of the following from the computer output. Then provide an 
interpretation of each value. 


i. bO2%. 
ii. b1P1 


iii. ss 
iv. SEbISE®, 
R12.2 |») Long legs Verify that the conditions for inference about the slope of the least- 
squares regression line are met in this context. 
R12.3 |») Long legs Do these data provide convincing evidence at the a=0.050 = 0.05 level 


that taller students at this school require fewer steps to walk a fixed distance? Assume 
that the conditions for inference are met. 


R12.4 |») Long legs Construct and interpret a 95% confidence interval for the slope of the 


population regression line. Assume that the conditions for inference are met. Explain 
how the interval provides more information than the test in R12.3. 


R12.5 |») Light intensity In a physics class, the intensity of a 100-watt lightbulb was 


measured by a sensor at various distances from the light source. Here is a scatterplot of 
the data. Note that a candela is a unit of luminous intensity in the International System of 
Units. 
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Physics textbooks suggest that the relationship between light intensity y¥ and 
distance x® should follow an “inverse square law,” that is, a power law model of the 


_9 1 
y=ar ~“=—a-. : 
form y=ax-2=a1x2.” rv? We transformed the distance measurements by 


squaring them and then taking their reciprocals. Here is some computer output and a 
residual plot from a least-squares regression analysis of the transformed data. Note that 
the horizontal axis on the residual plot displays predicted light intensity. 


0.006 


0.005 ° 
0.004 
— 0.003 
BS 
= 0.002 
2 0.001 . . 
3 ; 
0.000 
0.001 
0,002 : 
0,003 - 
0.10 0.15 0.20 0.25 0.30 
Predicted intensity (candelas) 
Predictor Coef SE Coef T P 
Constant -0.000595 0.001821 -0.33 0.751 


Distance\(-2) 0.299624 0.003237 92.56 0.000 
S=0.0024837 R-Sq=99.9% R-Sq(adj)=99.9% 


a. Did this transformation achieve linearity? Give appropriate evidence to justify your 
answer. 


b. What is the equation of the least-squares regression line? Define any variables you 
use. 


c. Predict the intensity of a 100-watt bulb at a distance of 2.1 meters. 


R12.6 |») Pricey diamonds Here is a scatterplot showing the relationship between the 


weight (in carats) and price (in dollars) of round, clear, internally flawless diamonds with 


excellent cuts:22 
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a. Explain why a linear model is not appropriate for describing the relationship between 
price and weight of diamonds. 


b. We used software to transform the data in hopes of achieving linearity. The output 
shows the results of two different transformations. Would an exponential model or a 
power model describe the relationship better? Justify your answer. 


Transformation 1: 


z 

‘= 

& 

= 

In (weight) 

Term Coef SE Coef T-Value P-Value 
Constant 9.7062 0.0209 465.10 0.000 
lnWeight 2.2913 0.0332 68.91 0.000 
S=0.171328 R-sq=98.10% R-sq(adj )=98.08% 


Transformation 2: 


In (price) 


Weight (carats) 


Term Coef SE Coef' T-Value P-Value 
Constant 8.2709 0.0988 83.72 0.000 
Weight 1.3791 0.0549 25.12 0.000 
S=0.443287 R-Sq=87.28% R-sq(adj )=87.14% 


c. Use each model to predict the price for a diamond of this type that weighs 2 carats. 
Which prediction do you think will be better? Explain your reasoning. 


Chapter 12 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


112.1 Which of the following is not one of the conditions that must be satisfied in order to 
perform inference about the slope of a least-squares regression line? 


a. For each value of x,”» the population of y-values is Normally distributed. 


b. The standard deviation o@ of the population of y-values corresponding to a given 
value of x@ is always the same, regardless of the specific value of x.@. 


c. The sample size—that is, the number of paired observations (x, y)—exceeds 30. 


d. There exists a straight line such that, for each value of x,” the mean py“ of the 
corresponding population of y¥-values lies on that straight line. 
e. The data come from a random sample or a randomized experiment. 
T12.2 Students in a statistics class drew circles of varying diameters and counted how many 
Cheerios could be placed in the circle. The scatterplot shows the results. 
The students want to determine an appropriate equation for the relationship 
between diameter and the number of Cheerios. The students decide to transform the 


data to make it appear more linear before computing a least-squares regression line. 
Which of the following transformations would be reasonable for them to try? 


Cheerios 


Diameter 
I. Plot the square root of the number of Cheerios against diameter. 
II. Plot the cube of the number of Cheerios against diameter. 
Il. Plot the log of the number of Cheerios against the log of the diameter. 


IV. Plot the number of Cheerios against the log of the diameter. 


I and II 

I and III 
c. IL and II 
d. Il andIV 
e. ITand IV 


aT pp 


T12.3 Inference about the slope 81/1 of a least-squares regression line is based on which of 
the following distributions? 


a. The té distribution with n-1” ~ 1 degrees of freedom 
b. The standard Normal distribution 

c. The chi-square distribution with n—1” ~ | degrees of freedom 
d. The té distribution with n-2" — 2 degrees of freedom 


e. The Normal distribution with mean p and standard deviation of 


Exercises T12.4—T12.8 refer to the following setting. An old saying in golf is “You 
drive for show and you putt for dough.” The point is that good putting is more 
important than long driving for shooting low scores and hence winning money. To see 
if this is the case, data from a random sample of 69 of the nearly 1000 players on the 
PGA Tour’s world money list are examined. The average number of putts per hole 
(fewer is better) and the player’s total winnings for the previous season are recorded 
and a least-squares regression line was fitted to the data. Assume the conditions for 
inference about the slope are met. Here is computer output from the regression 


analysis: 
Predictor Coef SE Coef T P 
Constant 7897179 3023782 6.86 0.000 
Avg. putts -4139198 1698371 **** **** 


S = 281777 R-Sq = 8.1% R-Sq(adj) = 7.8% 


T12.4 By about how much does the sample slope typically vary from the population slope in 
repeated random samples of n=69n = 69 golfers? 


a. 7,897,179 
b. 1,698,371 

c. 3,023,782 

d. 281,777 

e. —4,139,198~4; 139, 198 


T12.5 Suppose that the researchers test the hypotheses HO: B1=0/0 : 91 = 9 versus 
Ha:81<0.40:f1 &lt; 0. Which of the following is the value of the t¢ statistic for this 
test? 


o p 


e. 
d. 


e. 


2.61 


~) 44-2.44 
2.44 
-20.24—20.24 
0.081 


T12.6 The PP-value for the test in Exercise T12.5 is 0.0087. Which of the following is a 
correct interpretation of this result? 


a. 


e. 


The probability there is no linear relationship between average number of putts per 
hole and total winnings for these 69 players is 0.0087. 


. The probability there is no linear relationship between average number of putts per 


hole and total winnings for all players on the PGA Tour’s world money list is 
0.0087. 


If there is no linear relationship between average number of putts per hole and total 

winnings for the players in the sample, the probability of getting a random sample 

of 69 players that yields a least-squares regression line with a slope of —4,139,198 
4, 139, 198 or less is 0.0087. 


. If there is no linear relationship between average number of putts per hole and total 


winnings for the players on the PGA Tour’s world money list, the probability of 
getting a random sample of 69 players that yields a least-squares regression line 
with a slope of —4,139,198 4,139,198 or less is 0.0087. 


The probability of making a Type I error is 0.0087. 


T12.7 Which of the following is the 95% confidence interval for the slope 81°: of the 
population regression line? 


a. 


b. 


C; 
d. 


e. 


7,897,179+3,023,782/:897,179 + 3,023,782 
7,897,179+2.000(3,023,782)" 897,179 + 2.000 (3,023,782) 
~4,139,198+1,698,371—4:139,198 + 1,698,371 
~4,139,198+1.960(1,698,371)~4:139,198 + 1.960 (1,698,371) 
~4,139,198+2.000(1,698,371)~4:139,198 + 2.000 (1,698,371) 


T12.8 Which of the following would make the calculation in Exercise T12.7 invalid? 


- 


a. If the scatterplot of the sample data wasn’t perfectly linear 
b. 


C. 


If the distribution of earnings has an outlier 
If the distribution of earnings wasn’t approximately Normal 


If the earnings for golfers with small putting averages was much more variable than 
the earnings for golfers with large putting averages 


If the standard deviation of earnings is much larger than the standard deviation of 


putting average 
T12.9 Which of the following would provide evidence that a power model of the form 
y=axp,¥ = 22", where p~0P #0 and p#1,? * 1, describes the relationship between a 
response variable y¥ and an explanatory variable x? 
a. A scatterplot of y¥ versus x@ looks approximately linear. 
b. A scatterplot of In y¥ versus x# looks approximately linear. 
c. A scatterplot of y¥ versus In x# looks approximately linear. 
d. A scatterplot of In y¥ versus In x2 looks approximately linear. 
e. None of these 


T12.10 We record data on the population of a particular country from 1960 to 2010. A 
scatterplot reveals a clear curved relationship between population and year. However, a 
different scatterplot reveals a strong linear relationship between the logarithm (base 10) 
of the population and the year. The least-squares regression line for the transformed 
data is 


log (population)\=—13.5+0.01(year)!°8 (population) =—13.5 + 0.01 (year) 


Based on this equation, which of the following is the best estimate for the population of 
the country in the year 2020? 


a. 6.7 
b. 812 

c. 5,000,000 
d. 6,700,000 
e. 8,120,000 


Section Il: Free Response Show all your work. Indicate clearly the methods you use, 
because you will be graded on the correctness of your methods as well as on the accuracy and 
completeness of your results and explanations. 


T12.11 Growth hormones are often used to increase the weight gain of chickens. In an 
experiment using 15 chickens, 3 chickens were randomly assigned to each of 5 different 
doses of growth hormone (0, 0.2, 0.4, 0.8, and 1.0 milligrams). The subsequent weight 
gain (in ounces) was recorded for each chicken. A researcher plots the data and finds 
that a linear relationship appears to hold. Here is computer output from a least-squares 
regression analysis of these data. Assume that the conditions for performing inference 
about the slope B11 of the true regression line are met. 


Predictor Coef SE Coef T P 
Constant 4.5459 0.6166 7.37 <0.0001 
Dose 4.8323 1.0164 4.75 0.0004 


S = 3.135 R-Sq = 38.4% R-Sq(adj) = 37.7% 


a. Interpret each of the following in context: 
i. The slope 
ii. The y¥ intercept 
iii. The standard deviation of the residuals 
iv. The standard error of the slope 


b. Do the data provide convincing evidence of a linear relationship between dose and 
weight gain? Carry out a significance test at the a=0.05a = 0.05 level. 


c. Construct and interpret a 95% confidence interval for the slope parameter. 


T12.12 Foresters are interested in predicting the amount of usable lumber they can harvest 
from various tree species. They collect data on the diameter at breast height (DBH) in 
inches and the yield in board feet of a random sample of 20 Ponderosa pine trees that 
have been harvested. (Note that a board foot is defined as a piece of lumber 12 inches 
by 12 inches by 1 inch.) Here is a scatterplot of the data. 


300 


Yield (board feet) 
7 


DBH (in.) 


a. Here is some computer output and a residual plot from a least-squares regression on 
these data. Explain why a linear model may not be appropriate in this case. 


Predictor Coef SE Coef T P 
Constant -191.12 16.98 -11.25 0.000 
DBH (inches) 11.0413 0.5752 19.19 0.000 


S = 20.3290 R-Sq = 95.3% R-Sq(adj) = 95.1% 


Residual 


20 25 30 35 40 
DBH (in.) 


The foresters are considering two possible transformations of the original data: (1) 
cubing the diameter values or (2) taking the natural logarithm of the yield 
measurements. After transforming the data, a least-squares regression analysis is 


performed. Here is some computer output and a residual plot for each of the two 
possible regression models: 


Option 1: Cubing the diameter values 


Predictor Coef SE Coef T P 
Constant 2.078 5.444 0.38 0.707 
DBHA3 0.0042597 0.0001549 27.50 0.000 


S = 14.3601 R-Sq = 97.7% R-Sq(adj) = 97.5% 
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Option 2: Taking natural logarithm of yield measurements 


Predictor Coef SE Coef T P 
Constant 1.2319 0.1795 6.86 0.000 
DBH 0.113417 0.006081 18.65 0.000 


S = 0.214894 R-Sq = 95.1% R-Sq(adj) = 94.8% 


Residual 


3.0 3.5 4.0 4.5 5.0 5.5 6.0 
Predicted In (yield) 


b. Use both models to predict the amount of usable lumber from a Ponderosa pine with 
diameter 30 inches. 


c. Which of the predictions in part (b) seems more reliable? Give appropriate evidence 
to support your choice. 


Final Project 


In this project, your team will formulate a statistical question, design a study to answer the 
question, conduct the study, collect the data, analyze the data, and use statistical inference to 
answer the question. You may do your study on any topic, but you must be able to include the 
six steps listed above. 


.. Write a proposal describing the design of your study. Make sure to include the following: 


Describe the statistical question you are trying to answer. 


List the explanatory and response variables (or just the response variable when 
appropriate). 


State the null and alternative hypothesis you will be testing, along with the name of the 
test you will use to analyze the results. 


Describe how you will collect the data, so the conditions for inference will be satisfied. 


Explain how your study will be safe and ethical if you are using human subjects. 


’, Once your teacher has approved your proposal, carry out the study. 


3. Create a poster to summarize your project. Make sure to include the following: 


Title (in the form of a question). 


Introduction. The introduction should discuss what question you are trying to answer, why 
you chose this topic, what your hypotheses are, and how you will analyze your data. 


Data Collection. In this section, you will describe how you obtained your data. Be 
specific. 


Graphs and Summary Statistics, Including Raw Data. Begin by providing the raw data. If 
the data are quantitative, list them in a table. If the data are categorical, summarize them in 
a two-way table. Then make graphs that are well labeled and easy to compare, and list 
appropriate summary Statistics. Use the graphs and summary statistics to describe the 
evidence for the alternative hypothesis. 


Analysis and Conclusion. Identify the inference procedure you used and discuss the 
conditions for inference. Give the (standardized) test statistic and PP-value (with 
interpretation), along with the appropriate conclusion. Then provide the corresponding 
confidence interval (with interpretation) or follow-up analysis (for chi-square tests). 


Reflections. In this section, you should also discuss any possible errors (e.g., Type I or 
Type II), limitations to your conclusion, what you could do to improve the study the next 
time, and any other critical reflections. 


The key to a good statistical poster is communication and organization. Make sure all 
components of the poster are focused on answering the question of interest and that 


statistical vocabulary is used correctly. Include live action pictures of your data collection 
in progress. 


Cumulative AP® Practice Test 4 


Section |: Multiple Choice Choose the best answer for Questions AP4.1—AP4.40. 


AP4.1 A major agricultural company is testing a new variety of wheat to determine whether 
it is more resistant to certain insects than the current wheat variety. The proportion of a 
current wheat crop lost to insects is 0.04. Thus, the company wishes to test the 
following hypotheses: 


Hy : p= 0.04 
H0:p=0.04Ha:p<0.04 H, : p< 0.04 


Which of the following significance levels and sample sizes would lead to the highest 
power for this test? 


a. 
b. 
ic 
d. 


e. 


n=200n = 200 and a=0.01e = 0.01 
n=400n = 400 and a=0.05a = 0.05 
n=400n = 400 and a=0.01e = 0.01 
n=500n = 500 and a=0.01e = 0.01 
n=500n = 500 and a=0.05e = 0.05 


AP4.2 If P(A)=0.24P(A) = 0.24 and P(B)=0.52/(B) = 0.52 and events A and B are 
independent, what is P(A or B)? 


a. 0.1248 
b. 

G 
d. 


e. 


0.28 
0.6352 
0.76 


The answer cannot be determined from the given information. 


AP4.3 Sam has determined that the weights of unpeeled bananas from his local store have a 
mean of 116 grams with a standard deviation of 9 grams. Assuming that the 
distribution of weight is approximately Normal, to the nearest gram, the heaviest 30% 
of these bananas weigh at least how much? 


P. 


a. 107 ¢ 
b. 


C. 


121 ¢ 
lite 
125 ¢ 
116 ¢g 


AP4.4 The school board in a certain school district obtained a random sample of 200 
residents and asked if they were in favor of raising property taxes to fund the hiring of 
more statistics teachers. The resulting confidence interval for the true proportion of 
residents in favor of raising taxes was (0.183, 0.257). Which of the following is the 
margin of error for this confidence interval? 


a. 0.037 
b. 0.074 
c. 0.183 
d. 0.220 
e. 0.257 


AP4.5 After a name-brand drug has been sold for several years, the Food and Drug 
Administration (FDA) will allow other companies to produce a generic equivalent. The 
FDA will permit the generic drug to be sold as long as there isn’t convincing evidence 
that it is less effective than the name-brand drug. For a proposed generic drug intended 
to lower blood pressure, the following hypotheses will be used: 


H0:1G=pN versus Ha:pG<pNo:He = py versus H,:ig &lt; py 


where 


1G=true mean reduction in blood pressureusing the generic druguG=true mean 
reduction in blood pressureusing the name@brand drug 
ig = true mean reduction in blood pressure 


using the generic drug 
Hg = true mean reduction in blood pressure 


using the name@brand drug 


In the context of this situation, which of the following describes a Type I error? 


a. The FDA finds convincing evidence that the generic drug is less effective, when in 
reality it is less effective. 


b. The FDA finds convincing evidence that the generic drug is less effective, when in 
reality it is equally effective. 


c. The FDA finds convincing evidence that the generic drug is equally effective, when 
in reality it is less effective. 


d. The FDA fails to find convincing evidence that the generic drug is less effective, 
when in reality it is less effective. 


e. The FDA fails to find convincing evidence that the generic drug is less effective, 
when in reality it is equally effective. 


AP4.6 The town council wants to estimate the proportion of all adults in their medium-sized 
town who favor a tax increase to support the local school system. Which of the 


following sampling plans is most appropriate for estimating this proportion? 
a. A random sample of 250 names from the local phone book 
b. A random sample of 200 parents whose children attend one of the local schools 


c. A sample consisting of 500 people from the city who take an online survey about 
the issue 


d. A random sample of 300 homeowners in the town 


e. A random sample of 100 people from an alphabetical list of all adults who live in 
the town 


AP4.7 Which of the following is a categorical variable? 
a. The weight of an automobile 
b. The time required to complete the Olympic marathon 
c. The fuel efficiency (in miles per gallon) of a hybrid car 
d. The brand of shampoo purchased by shoppers in a grocery store 
e. The closing price of a particular stock on the New York Stock Exchange 


AP4.8 A large machine is filled with thousands of small pieces of candy, 40% of which are 
orange. When money is deposited, the machine dispenses 60 randomly selected pieces 
of candy. The machine will be recalibrated if a group of 60 candies contains fewer than 
18 that are orange. What is the approximate probability that this will happen if the 
machine is working correctly? 


P | z &lt; 
a. P(z<0.3-0.4(0.4)(0.6)60) {a 

P | z &lt; = 
b. P(z<0.3-0.4(0.3)(0.7)60) — 

P| gait: 224 
c. P(z<0.3-0.4(0.4)(0.6)60) a 

P (- &lt eae) 
d. P(z<0.3-0.4(0.4)(0.6)60) - 


P oo &lt: 0.4-—0.3 


y/ (0.3)(0.7) 


e. P(z<0.4-0.3(0.3)(0.7)60) > 


AP4.9 A random sample of 900 students at a very large university was asked which social 
networking site they used most often during a typical week. Their responses are shown 


in the table. 


Gender 

Male Female Total 

Facebook 504 

Twitter 80 

Networking site ea oe 
Pinterest 49 

Snapchat 72 

Total 900 


Assuming that gender and preferred networking site are independent, how many 
females do you expect to choose LinkedIn? 


a. 87.00 
b. 90.00 
c. 95.40 
d. 97.50 
e. 103.35 


AP4.10 Insurance adjusters are always concerned about being overcharged for accident 
repairs. The adjusters suspect that Repair Shop 1 quotes higher estimates than Repair 
Shop 2. To check their suspicion, the adjusters randomly select 12 cars that were 
recently involved in an accident and then take each of the cars to both repair shops to 
obtain separate estimates of the cost to fix the vehicle. The estimates are given in 


hundreds of dollars. 

Car 1 2 2 4 5 6 
Shop 1 212 25.2 39.0 11.3 15.0 18.1 
Shop 2 21 24.1 36.8 11.5 13.7 17.6 
Car 7 8 9 10 11 12 
Shop 1 25.3 2a.2 12.4 42.6 27.6 12.9 
Shop 2 24.8 21.3 12.1 42.0 26.7 12.5 


Assuming that the conditions for inference are met, which of the following significance 
tests should be used to determine whether the adjusters’ suspicion is correct? 


a. A paired té test 

b. A two-sample té test 

c. A té test to see if the slope of the population regression line is 0 
d. A chi-square test for homogeneity 


e. A chi-square test for goodness of fit 


AP4.11 A survey firm wants to ask a random sample of adults in Ohio if they support an 
increase in the state sales tax from 5.75% to 6%, with the additional revenue going to 
education. Let p/P denote the proportion in the sample who say that they support the 
increase. Suppose that 40% of all adults in Ohio support the increase. If the survey firm 
wants the standard deviation of the sampling distribution of p/P to equal 0.01, how 
large a sample size is needed? 


a. 1500 
b. 2400 
c. 2401 
d. 2500 
e. 9220 


AP4.12 A set of 10 cards consists of 5 red cards and 5 black cards. The cards are shuffled 
thoroughly, and you choose one at random, observe its color, and replace it in the set. 
The cards are thoroughly reshuffled, and you again choose a card at random, observe 
its color, and replace it in the set. This is done a total of four times. Let XX be the 
number of red cards observed in these four trials. The random variable XX has which 
of the following probability distributions? 


a. The Normal distribution with mean 2 and standard deviation 1 
b. The binomial distribution with n=10n = 10 and p=0.5P = 9.5 
c. The binomial distribution with n=5n = 5 and p=0.5? = 9.5 

d. The binomial distribution with n=4n = 4 and p=0.5P = 0.5 

e. The geometric distribution with p=0.5? = 9-5 


AP4.13 A study of road rage asked random samples of 596 men and 523 women about their 
behavior while driving. Based on their answers, each respondent was assigned a road 
rage score on a scale of 0 to 20. The respondents were chosen by random-digit dialing 
of telephone numbers. Are the conditions for inference about a difference in means 
satisfied? 


a. Maybe; the data came from independent random samples, but we should examine 
the data to check for Normality. 


b. No; road rage scores on a scale of 0 to 20 can’t be Normal. 
c. No; a paired té test should be used in this case. 


d. Yes; the large sample sizes guarantee that the corresponding population 
distributions will be Normal. 


e. Yes; we have two independent random samples and large sample sizes, and the 10% 
condition is met. 


AP4.14 Do hummingbirds prefer store-bought food made from concentrate or a simple 
mixture of sugar and water? To find out, a researcher obtains 10 identical hummingbird 


feeders and fills 5, chosen at random, with store-bought food from concentrate and the 
other 5 with a mixture of sugar and water. The feeders are then randomly assigned to 
10 possible hanging locations in the researcher’s yard. Which inference procedure 
should you use to test whether hummingbirds show a preference for store-bought food 
based on amount consumed? 

a. A one-sample z2 test for a proportion 

b. A two-sample z2 test for a difference in proportions 

c. A chi-square test for independence 

d. A two-sample t? test 

e. A paired té test 


AP4.15 A Harris poll found that 54% of American adults don’t think that human beings 
developed from earlier species. The poll’s margin of error for 95% confidence was 3%. 
This means that 


a. there is a 95% chance the interval (51%, 57%) contains the true percent of 
American adults who do not think that human beings developed from earlier 
species. 


b. the poll used a method that provides an estimate within 3% of the truth about the 
population in 95% of samples. 

c. if Harris conducts another poll using the same method, the results of the second poll 
will lie between 51% and 57%. 

d. there is a 3% chance that the interval is incorrect. 

e. the poll used a method that would result in an interval that contains 54% in 95% of 


all possible samples of the same size from this population. 


AP4.16 Two six-sided dice are rolled and the sum of the faces showing is recorded after 
each roll. Let X=theX = the number of rolls required to obtain a sum greater than 7. If 
100 trials are conducted, which of the following is most likely to be the result of the 
simulation? 


a. 


Number of rolls 
Xx Number of trails 


34 


NOOB WN BF 
K 
oO 


- 
oO 
FOF OF NM 


Number of rolls 


XX Number of trails 
0 34 
1 20 
2 16 
3 10 
4 6 
5 6 
5 3 
7 2 
8 1 
9 0 

10 1 
11 6) 
12 1 


Number of rolls 


XX Number of trails 
1 18 
2 23 
3 26 
4 15 
5 9 
6 6 
¢ 1 
8 0 
9 1 

10 0 
11 6) 
12 0 


13 1 


d. 
Number of rolls 
XX Number of trails 
1 10 
2 9 
3 10 
4 12 
5 7 
6 13 
7 10 
8 7 
9 9 
10 10 
11 2 
12 1 
e, 


Number of rolls 


XX Number of trails 
1 zZ 
2 2 
3 5 
4 10 
5 11 
6 15 
7 22 
8 17 
9 9 
10 4 
14 Zz 
12 0 
13 1 


AP4.17 Women who are severely overweight suffer economic consequences, a study has 
shown. They have household incomes that are $6710 less than other women, on 
average. The findings are from an eight-year observational study of 10,039 randomly 
selected women who were 16 to 24 years old when the research began. If the difference 


in average incomes is statistically significant, does this study give convincing evidence 
that being severely overweight causes a woman to have a lower income? 


a. Yes; the study included both women who were severely overweight and women 
who were not. 


b. Yes; the subjects in the study were selected at random. 


c. Yes, because the difference in average incomes is larger than would be expected by 
chance alone. 


d. No; the study showed that there is no connection between income and being 
severely overweight. 


e. No; the study suggests an association between income and being severely 
overweight, but we can’t draw a cause-and-effect conclusion. 


Questions AP 4.18 and 4.19 refer to the following situation. Could mud wrestling be 
the cause of a rash contracted by University of Washington students? Two physicians 
at the university’s student health center wondered about this when one male and six 
female students complained of rashes after participating in a mud-wrestling event. 
Questionnaires were sent to a random sample of students who participated in the event. 
The results, by gender, are summarized in the following table. 


Gender 
Male Female Total 
Yes 24 
Developed 
rash? NO 50 
Total 50 24 74 


Here is some computer output for the preceding table. The output includes the observed 
counts, the expected counts, and the chi-square statistic. 


Expected counts are printed below observed counts 


MALE FEMALE Total 

Yes 12 12 24 
16.22 7.78 

No 38 12 50 
33.78 16.22 

Total 50 24 74 


ChiSq = 5.002 


AP4.18 The cell that contributes most to the chi-square statistic is 
. men who developed a rash. 


a 
b. men who did not develop a rash. 


ie) 


women who developed a rash. 


A. 


women who did not develop a rash. 


e. 


both (a) and (d). 


AP4.19 From the chi-square test performed in this study, we may conclude that 


a. 


there is convincing evidence of an association between the gender of an individual 
participating in the event and development of a rash. 


mud wrestling causes a rash, especially for women. 


there is absolutely no evidence of any relationship between the gender of an 
individual participating in the event and the subsequent development of a rash. 


development of a rash is a real possibility if you participate in mud wrestling, 
especially if you do so regularly. 


the gender of the individual participating in the event and the development of a rash 
are independent. 


AP4.20 Random assignment is part of a well-designed comparative experiment because 


a 


d. 


ec. 


it is more fair to the subjects. 


it helps create roughly equivalent groups before treatments are imposed on the 
subjects. 


it allows researchers to generalize the results of their experiment to a larger 
population. 


it helps eliminate any possibility of bias in the experiment. 


it prevents the placebo effect from occurring. 


AP4.21 The following back-to-back stemplots compare the ages of players from two minor- 


league hockey teams (1|7=17 years). 


(1|7 = 17 years). 


Team A Team B 
98777 1 788889 
44333221 2 00123444 
7766555 2 556679 
521 3 023 
86 2 55 


Which of the following cannot be justified from the plots? 


a. Team A has the same number of players in their 30s as does Team B. 
b. 

C. 
d. 


e. 


The median age of both teams is the same. 
Both age distributions are skewed to the right. 
The range of age is greater for Team A 


There are no outliers by the 1.5/QR rule in either distribution. 


AP4.22 A distribution that represents the number of cars XX parked in a randomly selected 
residential driveway on any night is given by 


xivi 0 4. 2 3 4 
piPi 0.10 0.20 0.35 0.20 0.15 


Given that there is at least 1 car parked on a randomly selected residential driveway on 
a particular night, which of the following is closest to the probability that exactly 4 cars 
are parked on that driveway? 


a. 0.10 
b. 0.15 
O47 
d. 0.75 
e. 0.90 


AP4.23 Which sampling method was used in each of the following settings, in order from I 
to IV? 


I. I. A student chooses to survey the first 20 students to arrive at school. 


II. II. The name of each student in a school is written on a card, the cards are well 
mixed, and 10 names are drawn. 


Il. III. A state agency randomly selects 50 people from each of the state’s senatorial 
districts. 


IV. IV. A city council randomly selects eight city blocks and then surveys all the 
voting-age residents on those blocks. 


a. Voluntary response, SRS, stratified, cluster 
b. Convenience, SRS, stratified, cluster 
c. Convenience, cluster, SRS, stratified 
d. Convenience, SRS, cluster, stratified 
e. Cluster, SRS, stratified, convenience 


AP4.24 Western lowland gorillas, whose main habitat is in central Africa, have a mean 
weight of 275 pounds with a standard deviation of 40 pounds. Capuchin monkeys, 
whose main habitat is Brazil and other parts of Latin America, have a mean weight of 6 
pounds with a standard deviation of 1.1 pounds. Both distributions of weight are 
approximately Normally distributed. If a particular western lowland gorilla is known to 
weigh 345 pounds, approximately how much would a capuchin monkey have to weigh, 
in pounds, to have the same standardized weight as the gorilla? 


a. 4.08 
b. 7.27 
@ 7.93 
8.20 


aan 


e. There is not enough information to determine the weight of a capuchin monkey. 


AP4.25 Suppose that the mean weight of a certain breed of pig is 280 pounds with a 
standard deviation of 80 pounds. The distribution of weight for these pigs tends to be 
somewhat skewed to the right. A random sample of 100 pigs is taken. Which of the 
following statements about the sampling distribution of the sample mean weight x @ is 
true? 


a. It will be Normally distributed with a mean of 280 pounds and a standard deviation 
of 80 pounds. 


b. It will be Normally distributed with a mean of 280 pounds and a standard deviation 
of 8 pounds. 


c. It will be approximately Normally distributed with a mean of 280 pounds and a 
standard deviation of 80 pounds. 


d. It will be approximately Normally distributed with a mean of 280 pounds and a 
standard deviation of 8 pounds. 


e. There is not enough information to determine the mean and standard deviation of 
the sampling distribution. 


AP4.26 Which of the following statements about the té distribution with degrees of freedom 
df is (are) true? 


I. I. It is symmetric. 


II. II. It has more variability than the t¢ distribution with df+14f + 1 degrees of 
freedom. 


III. Il. As df increases, the té distribution approaches the standard Normal distribution. 
a. I only 

b. Il only 

c. I only 

d. I and III 

e. I, IL, and III 


Questions AP4.27—AP4.29 refer to the following situation. Park rangers are interested 
in estimating the weight of the bears that inhabit their state. The rangers have data on 
weight (in pounds) and neck girth (distance around the neck in inches) for 10 randomly 
selected bears. Here is some regression output for these data: 
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Constant -241.70 38.57 -6.270.000 

Neck 20.230 1.695 11.930.000 

girth 

Ss R-Sq = 94.7% 

26.7565 


AP4.27 Which of the following is the correct value of the correlation and its corresponding 
interpretation? 


a. 


e. 


The correlation is 0.947, and 94.7% of the variability in a bear’s weight can be 
accounted for by the least-squares regression line using neck girth as the 
explanatory variable. 


. The correlation is 0.947. The linear association between a bear’s neck girth and its 


weight is strong and positive. 


The correlation is 0.973, and 97.3% of the variability in a bear’s weight can be 
accounted for by the least-squares regression line using neck girth as the 
explanatory variable. 


. The correlation is 0.973. The linear association between a bear’s neck girth and its 


weight is strong and positive. 


The correlation cannot be calculated without the data. 


AP4.28 Which of the following represents a 95% confidence interval for the slope of the 
population least-squares regression line relating the weight of a bear and its neck girth? 


a. 
b. 
C. 
d. 


e. 


20.230+1.69520.230 = 1.695 
20.23043.8320.230 + 3.83 
20.230+3.9120.230 = 3.91 
20.230+20.2220.230 = 20.22 
26.7565+3.8326.7565 + 3.83 


AP4.29 A bear was recently captured whose neck girth was 35 inches and whose weight 
was 466.35 pounds. If this bear were added to the data set, what would be the effect on 


the value of s?s? 


a. It would decrease the value of ss because the added point is an outlier. 


. It would decrease the value of ss because the added point lies on the least-squares 


regression line. 
It would increase the value of ss because the added point is an outlier. 


It would increase the value of ss because the added point lies on the least-squares 
regression line. 


It would have no effect on the value of ss because the added point lies on the least- 
squares regression line. 


AP4.30 An experimenter wishes to test if one of two types of fish food (a standard fish food 
and a new product) is better for producing fish of equal weight after a two-month 
feeding program. The experimenter has two identical fish tanks (1 and 2) and is 
considering how to assign 40 fish, each of which has a numbered tag, to the tanks. The 
best way to do this would be to 


a. 


put all the odd-numbered fish in Tank 1 and the even-numbered fish in Tank 2. Give 
the standard food to Tank 1 and the new product to Tank 2. 


. obtain pairs of fish whose weights are roughly equal at the start of the experiment 


and randomly assign one of the pair to Tank 1 and the other to Tank 2. Give the 
standard food to Tank 1 and the new product to Tank 2. 


proceed as in option (b), but put the heavier of each pair into Tank 2. Give the 
standard food to Tank 1 and the new product to Tank 2. 


. assign the fish completely at random to the two tanks using a coin flip: heads means 


Tank 1 and tails means Tank 2. Give the standard food to Tank 1 and the new 
product to Tank 2. 


divide the 40 fish into two groups, with the 20 heaviest fish in one group. Randomly 
choose which tank to assign the heaviest fish and assign the lightest fish to the other 
tank. Give the standard food to Tank 1 and the new product to Tank 2. 


AP4.31 A city wants to conduct a poll of taxpayers to determine the level of support for 
constructing a new city-owned baseball stadium. Which of the following is the main 
reason for using a large sample size in constructing a confidence interval to estimate 
the proportion of city taxpayers who would support such a project? 


a. To increase the confidence level 
b. 

C. 
d. 


e. 


To eliminate any confounding variables 
To reduce nonresponse bias 
To increase the precision of the estimate 


To reduce undercoverage 


AP4.32 A standard deck of playing cards contains 52 cards, of which 4 are aces and 13 are 


hearts. You are offered a choice of the following two wagers: 


I. 


Il. 


Draw one card at random from the deck. You win $10 if the card drawn is an ace. 
Otherwise, you lose $1. 


Draw one card at random from the deck. If the card drawn is a heart, you win $2. 
Otherwise, you lose $1. 


Which of the two wagers should you prefer? 


a 
b. 
C. 
d. 


e. 


. Wager 1, because it has a greater expected value 


Wager 2, because it has a greater expected value 
Wager 1, because it has a greater probability of winning 
Wager 2, because it has a greater probability of winning 


Both wagers are equally favorable. 


AP4.33 Here are boxplots of SAT Critical Reading and Math scores for a randomly selected 
group of female juniors at a highly competitive suburban school: 


400 500 600 700 800 
Score 


Which of the following cannot be justified by the plots? 


The maximum Critical Reading score is greater than the maximum Math score. 


. Critical Reading scores are skewed to the right, whereas Math scores are somewhat 


skewed to the left. 


The median Critical Reading score and the median Math score for females are about 
the same. 


There appear to be no outliers in the distributions of SAT Critical Reading score. 


The mean Critical Reading score and the mean Math score for females are about the 
same. 


AP4.34 A distribution of exam scores has mean 60 and standard deviation 18. If each score 
is doubled, and then 5 is subtracted from that result, what will the mean and standard 
deviation of the new scores be? 


a. 


b. 


mean=115mean = 115 and standard deviation=31standard deviation = 31 


mean=115mean = 115 and standard deviation=36standard deviation = 36 


C. mean=120mean = 120 and standard deviation=6standard deviation = 6 
d. mean=120mean = 120 and standard deviation=31standard deviation = 31 
e. mean=12Qmean = 120 and standard deviation=36standard deviation = 36 


AP4.35 In a clinical trial, 30 patients with a certain blood disease are randomly assigned to 
two groups. One group is then randomly assigned the currently marketed medicine, and 
the other group receives the experimental medicine. Every week, patients report to the 
clinic where blood tests are conducted. The clinic technician is unaware of the kind of 
medicine each patient is taking, and the patient is also unaware of which medicine he or 
she has been given. This design can be described as 


a. a double-blind, completely randomized experiment, with the currently marketed 
medicine and the experimental medicine as the two treatments. 


b. asingle-blind, completely randomized experiment, with the currently marketed 
medicine and the experimental medicine as the two treatments. 


c. a double-blind, matched pairs design, with the currently marketed medicine and the 
experimental medicine forming a pair. 


d. a double-blind, block design that is not a matched pairs design, with the currently 
marketed medicine and the experimental medicine as the two blocks. 


e. a double-blind, randomized observational study. 


AP4.36 A local investment club that meets monthly has 200 members ranging in age from 
27 to 81. A cumulative relative frequency graph of age is shown. Approximately how 
many members of the club are more than 60 years of age? 


Cumulative relative frequency (%) 


20 30 40 50 60 70 80 
Age of member (years) 


20 
44 
c. 78 
90 
e. 110 


aT pp 


A. 


AP4.37 A manufacturer of electronic components is testing the durability of a newly 
designed integrated circuit to determine whether its life span is longer than that of the 
earlier model, which has a mean life span of 58 months. The company takes a simple 
random sample of 120 integrated circuits and simulates typical use until they stop 
working. The null and alternative hypotheses used for the significance test are HO: 
p=58Ho:# = 58 and Ha: p&gt;58.4a'# > 58. The PP-value for the resulting one- 
sample té test is 0.035. Which of the following best describes what the PP-value 
measures? 


a. The probability that the new integrated circuit has the same life span as the current 
model is 0.035. 


b. The probability that the test correctly rejects the null hypothesis in favor of the 
alternative hypothesis is 0.035. 


c. The probability that a single new integrated circuit will not last as long as one of the 
earlier circuits is 0.035. 


d. The probability of getting a sample mean as far or farther above 58 if there really is 
no difference between the new and the old circuits is 0.035. 


e. The probability of getting a sample mean for the new integrated circuit that is less 
than the mean for the earlier model is 0.035. 


Questions AP4.38 and AP4.39 refer to the following situation. Do children’s fear levels 
change over time and, if so, in what ways? Little research has been done on the 
prevalence and persistence of fears in children. Several years ago, two researchers 
surveyed a randomly selected group of 94 third- and fourth-grade children, asking them 
to rate their level of fearfulness about a variety of situations. Two years later, the 
children again completed the same survey. The researchers computed the overall fear 
rating for each child in both years and were interested in the relationship between these 
ratings. They then assumed that the true regression line was 


later rating=B0+B1 (initial rating)Miater rating = 0 + A) (initial rating) 
and that the assumptions for regression inference were satisfied. This model was fitted 


to the data using least-squares regression. The following results were obtained from 
Statistical software. 


Predictor Coefficient St. Dev. 
Constant 0.877917 0.1184 
Initial rating @.397911 0.0676 
S = 0.2374 R-Sq = 0.274 


Here is a scatterplot of the later ratings versus the initial ratings and a plot of the 
residuals versus the initial ratings: 
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AP4.38 Which of the following statements is supported by these plots? 


a. 


The abundance of outliers and influential observations in the plots means that the 
assumptions for regression are clearly violated. 


These plots contain dramatic evidence that the standard deviation of the response 
about the true regression line is not approximately the same for each x#-value. 


These plots call into question the validity of the assumption that the later ratings 
vary Normally about the least-squares line for each value of the initial ratings. 


A linear model isn’t appropriate here because the residual plot shows no association. 


There is no striking evidence that the assumptions for regression inference are 
violated. 


AP4.39 George’s initial fear rating was 0.2 higher than Jonny’s. What does the model 


predict about their later fear ratings? 
a. George’s will be about 0.96 higher than Jonny’s. 
b. George’s will be about 0.40 higher than Jonny’s. 


c. George’s will be about 0.20 higher than Jonny’s 
d. George’s will be about 0.08 higher than Jonny’s. 
e. George’s will be about the same as Jonny’s. 
AP4.40 The table provides data on the political affiliation and opinion about the death 


penalty of 850 randomly selected voters from a congressional district. 


Opinion about death penalty 


Favor Oppose Total 

Republican 397 

- Democrat 248 
Political party 

Other 205 

Total 850 


Which of the following does not support the conclusion that being a Republican and 
favoring the death penalty are not independent? 


299 , 98 
a. 299494498356 491 7 356 
299 , 397 


b. 299494¥397850 494 7 850 


494 of 299 
~ 397 


c. 4948504299397 850 


d. 4948504397850 850 * 850 


(397)(494) 


e. (397)(494)850#299 850 7 799 


Section II: Free Response Show all your work. Indicate clearly the methods you use, 
because you will be graded on the correctness of your methods as well as on the accuracy and 
completeness of your results and explanations. 


AP4.41 The body’s natural electrical field helps wounds heal. If diabetes changes this field, it 
might explain why people with diabetes heal more slowly. A study of this idea 
compared randomly selected normal mice and randomly selected mice bred to 
spontaneously develop diabetes. The investigators attached sensors to the right hip and 
front feet of the mice and measured the difference in electrical potential (in millivolts) 
between these locations. Graphs of the data for each group reveal no outliers or strong 
skewness. The following computer output provides numerical summaries of the data.24 


Variable N Mean StDev 
Diabetic mice 2413.090 4.839 
Normal mice 1810.022 2.915 


Minimum Q1 Median Q3 Maximum 
1.050 10.038 12.650 17.038 22.600 
4.950 8.238 9.250 12.375 16.100 


Is there convincing evidence at the a=0.05a = 0.05 level that the mean electrical 
potential differs for normal mice and mice with diabetes? 


AP4.42 Can physical activity in youth lead to mental sharpness in old age? A 2010 study 
investigating this question involved 9344 randomly selected, mostly white women over 
age 65 from four U.S. states. These women were asked about their levels of physical 
activity during their teenage years, 30s, 50s, and later years. Those who reported being 
physically active as teens enjoyed the lowest level of cognitive decline—only 8.5% had 
cognitive impairment—compared with 16.7% of women who reported not being 
physically active at that time. 


a. State an appropriate pair of hypotheses that the researchers could use to test whether 
the proportion of women who suffered a cognitive decline was significantly smaller 
for women who were physically active in their youth than for women who were not 
physically active at that time. Be sure to define any parameters you use. 


b. Assuming the conditions for performing inference are met, what inference method 
would you use to test the hypotheses you identified in part (a)? Do not carry out the 
test. 


c. Suppose the test in part (b) shows that the proportion of women who suffered a 
cognitive decline was significantly smaller for women who were physically active in 
their youth than for women who were not physically active at that time. Can we 
generalize the results of this study to all women aged 65 and older? Justify your 
answer. 


d. We cannot conclude that being physically active as a teen causes a lower level of 
cognitive decline for women over 65, due to possible confounding with other 
variables. Explain the concept of confounding and give an example of a potential 
confounding variable in this study. 


AP4.43 In a recent poll, randomly selected New York State residents at various fast-food 
restaurants were asked if they supported or opposed a “fat tax” on sugared soda. Thirty- 
one percent said that they were in favor of such a tax and 66% were opposed. But when 
asked if they would support such a tax if the money raised were used to fund health care 
given the high incidence of obesity in the United States, 48% said that they were in 
favor and 49% were opposed. 


a. In this situation, explain how bias may have been introduced based on the way the 
questions were worded and suggest a way that the questions could have been 
worded differently in order to avoid this bias. 


b. In this situation, explain how bias may have been introduced based on the way the 
sample was taken and suggest a way that the sample could have been obtained in 
order to avoid this bias. 


c. This poll was conducted only in New York State. Suppose the pollsters wanted to 
ensure that estimates for the proportion of people who would support a tax on 


sugared soda were available for each state as well as an overall estimate for the 
nation as a whole. Identify a sampling method that would achieve this goal and 
briefly describe how the sample would be taken. 


AP4.44 Each morning, coffee is brewed in the school workroom by one of three faculty 
members, depending on who arrives first at work. Mr. Worcester arrives first 10% of 
the time, Dr. Currier arrives first 50% of the time, and Mr. Legacy arrives first on the 
remaining mornings. The probability that the coffee is strong when brewed by Dr. 
Currier is 0.1, while the corresponding probabilities when it is brewed by Mr. Legacy 
and Mr. Worcester are 0.2 and 0.3, respectively. Mr. Worcester likes strong coffee! 


a. What is the probability that on a randomly selected morning the coffee will be 
strong? 


b. If the coffee is strong on a randomly selected morning, what is the probability that it 
was brewed by Dr. Currier? 


AP4.45 The following table gives data on the mean number of seeds produced in a year by 
several common tree species and the mean weight (in milligrams) of the seeds 
produced. Two species appear twice because their seeds were counted in two locations. 
We might expect that trees with heavy seeds produce fewer of them, but what 
mathematical model best describes the relationship?=2 


Tree species Seed count Seed weight (mg) 
Paper birch Zi 200 0.6 
Yellow birch 12,158 1.6 
White spruce 7202 2.0 
Engelmann spruce 3671 3.3 
Red spruce 5051 3.4 
Tulip tree 13,509 9.1 
Ponderosa pine 2667 off 
White fir 5196 40.0 
Sugar maple 1751 48.0 
Sugar pine 1159 216.0 
American beech 463 247.0 
American beech 1892 247.0 
Black oak 93 1851.0 
Scarlet oak 525 1930.0 
Red oak A411 2475.0 
Red oak 253 2475.0 
Pignut hickory 40 3423.0 
White oak 184 3669.0 
Chestnut oak 107 4535.0 


a. Describe the association between seed count and seed weight shown in the 


scatterplot. 
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b. Two alternative models based on transforming the original data are proposed to 
predict the seed weight from the seed count. Here are graphs and computer output 
from a least-squares regression analysis of the transformed data. 


Model A: 
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Predictor Coef SE Coef T P 
Constant 6.1394 0.5726 10.72 0.000 
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Model B: 


In (weight) 
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Fitted value 
Predictor Coef SE T P 
Coef 
Constant 15.491 1.081 14.33 0.000 
1n(count ) -1.5222 0.1470 -10.35 0.000 
5 = R-Sq = R-Sq(adj) = 85.5% 


1.16932 86 . 3% 


Which model, A or B, is more appropriate for predicting seed weight from seed 
count? Justify your answer. 


c. Using the model you chose in part (b), predict the seed weight if the seed count is 
3700. 


AP4.46 A company manufactures plastic lids for disposable coffee cups. When the 
manufacturing process is working correctly, the diameters of the lids are approximately 
Normally distributed with a mean diameter of 4 inches and a standard deviation of 0.02 
inch. To make sure the machine is not producing lids that are too big or too small, each 
hour a random sample of 25 lids is selected and the sample mean x @ is calculated. 


a. Describe the shape, center, and variability of the sampling distribution of the sample 


mean diameter, assuming the machine is working properly. 


The company decides that it will shut down the machine if the sample mean 
diameter is less than 3.99 inches or greater than 4.01 inches, because this indicates 
that some lids will be too small or too large for the cups. If the sample mean is less 
than 3.99 or greater than 4.01, all the lids manufactured that hour are thrown away 
because the company does not want to sell bad products. 


b. Assuming that the machine is working properly, what is the probability that a 
random sample of 25 lids will have a mean diameter less than 3.99 inches or greater 
than 4.01 inches? 


Also, to identify any trends, each hour the company records the value of the sample 
mean on a chart, like the one given here. 
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One benefit of using this type of chart is that out-of-control production trends can be 
noticed before it is too late and lids have to be thrown away. For example, if the 
sample mean is consistently greater than 4 (but less than 4.01), this would suggest 
that something might be wrong with the machine. If such a trend is noticed before 
the sample mean gets larger than 4.01, then the machine can be fixed without having 
to throw away any lids. 


c. Assuming that the manufacturing process is working correctly, what is the 
probability that the sample mean diameter will be above the desired mean of 4.00 
but below the upper boundary of 4.01? 


d. Assuming that the manufacturing process is working correctly, what is the 
probability that in 5 consecutive samples, 4 or 5 of the sample means will be above 
the desired mean of 4.00 but below the upper boundary of 4.01? 


e. Which of the following results gives more convincing evidence that the machine 
needs to be shut down? Explain your answer. 


. Getting a single sample mean below 3.99 or above 4.01 


or 


’. Taking 5 consecutive samples and having at least 4 of the sample means be 
between 4.00 and 4.01 
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Chapter 4 Notes 


i This is an important topic, but it is not required for the AP® Statistics exam. 


ii This is an important topic, but it is not required for the AP® Statistics exam. 


iii This is an important topic, but it is not required for the AP® Statistics exam. 


Chapter 6 Notes 


i This topic is not required for the AP® Statistics exam. Some teachers prefer to discuss this 
topic when presenting the sampling distribution of p/? (Chapter 7). 


Chapter 1 

Introduction 

Answers to Check Your Understanding 

page 5: 1. The individuals are the cars in the student parking lot. 


2. He recorded the car’s license plate to identify the individuals. The variables recorded for each 
individual are the model (categorical), year (quantitative), color (categorical), highway gas mileage 
(quantitative), weight (quantitative), and whether or not it has a navigation system (categorical). 


Answers to Odd-Numbered Introduction Exercises 

1.1 (a) The AP® Statistics students who completed a questionnaire on the first day of class. (b) gender 
(categorical), grade level (categorical), GPA (quantitative), children in family (quantitative), homework 
last night (min) (categorical), and type of phone (categorical). 

1.3 The individuals are movies. The variables are year (quantitative), rating (categorical), time (min) 
(quantitative), genre (categorical), and box office ($) (quantitative). Note: Year might be considered 
categorical if we want to know how many of these movies were made each year, rather than the average 
year. 

1.5 The categorical variables are type of wood, type of water repellent, and paint color. The quantitative 
variables are paint thickness and weathering time. 

1.7 Student answers will vary. Examples of categorical variables could include region of the country and 
type of institution (2-year college, 4-year college, university). Examples of quantitative variables could 
include retention rate, graduation rate, class sizes, and faculty salaries. 
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Section 1.1 
Answers to Check Your Understanding 
page 12: 


Famous Rich Healthy Happy 
Preference 


A slight majority (52.5%) of students in the sample said they would prefer to be Happy. Rich and Famous 
were preferred about equally (20% and 17.5%). The least popular choice was Healthy (10%). 


page 16: 1. 113/338 = 0.334. This value makes sense because there were three treatments, so we would 
expect about one-third of the subjects to be assigned to this treatment. 


2. The distribution of change in depression is: Full response: 91/338 = 0.269, Partial response: 55/338 = 
0.163, and No response: 192/338 = 0.568. 


3, 27/338 = 0.08 5 8% 

page 22: 1. 27/91 = 0.297 

2. 70/113 = 0.619 = 61.9% 

3. The distribution of change in depression for the subjects receiving each of the three treatments is: 

St. John’s wort Full response: 27/113 = 0.239; Partial response: 16/113 = 0.142; No response: 70/113 = 
0.619 

Zoloft Full response: 27/109 = 0.248; Partial response: 26/109 = 0.239; No response: 56/109 = 0.514 
Placebo Full response: 37/116 = 0.319; Partial response: 13/116 = 0.112; No response: 66/116 = 0.569 


BB No response {J Partial response 
[] Full response 
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4. There does not appear to be a strong association between treatment and change in depression for these 
subjects because the distribution of response status is very similar for the three different treatments. Also, 
note that the treatment with the highest rate of “full response” was the placebo. 


Answers to Odd-Numbered Section 1.1 Exercises 


1.11 (a) The individuals are the babies. (b) It appears that births occurred with similar frequencies on 
weekdays (Monday through Friday), but with noticeably smaller frequencies on the weekend days 
(Saturday and Sunday). 


Births 


1.13 First, a relative frequency table must be constructed. 


Camera brand Relative frequency 
Canon 23/45 = 51.1% 
Sony 6/45 = 13.3% 

Nikon 11/45 = 24.4% 
Fujifilm 3/45 = 6.7% 
Olympus 2/45 = 4.4% 


The relative frequency bar graph is given here. 
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Camera brand 


The most popular brand of camera among the 45 most recent purchases on the Internet auction site is 
Canon, followed by Nikon, Sony, Fujifilm, and Olympus. Canon is the overwhelming favorite with over 
50% of the customers purchasing this brand. Also noteworthy is that almost 25% of the customers 
purchased a Nikon camera. 


1.15 (a) 2% (b) The most popular color of vehicles sold that year was white, followed by black, silver, 


and gray. It appears that a majority of car buyers that year preferred vehicles that were shades of black 
and white. 
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Vehicle color 


(c) It would be appropriate to make a pie chart of these data (including the other category) because the 
numbers in the table refer to parts of a single whole. 

1.17 Estimates will vary, but should be close to 63% Mexican and 9% Puerto Rican. 

1.19 The areas of the pictures should be proportional to the numbers of students they represent. As drawn, 
it appears that most of the students arrived by car but in reality, most came by bus (14 took the bus, 9 
came in Cars). 

1.21 By starting the vertical scale at 12 instead of 0, it looks like the percent of binge-watchers who think 
that 5 to 6 episodes is too many to watch in one viewing session is almost 20 times higher than the percent 


of binge-watchers who think that 3 to 4 episodes is too many to watch in one viewing session. In truth, the 
percent of binge-watchers who think that 5 to 6 episodes is too many to watch in one viewing session 
(31%) is less than 3 times higher than the percent of binge-watchers who think that 3 to 4 episodes is too 
many to watch in one viewing session (13%). Similar arguments can be made for the relative sizes of the 
other categories represented in the bar graph. 

1.23 (a) 50/150 = 0.333 (b) 10.7% said they saw broken glass at the accident; 89.3% said they did not. (c) 
10.67% 

1.25 (a) 71.25% (b) 0.633 (c) 12.1% 

1.27 (a) 0.241 (b) 68% 


1.29 (a) The distributions of responses for the three treatment groups are: 


“Smashed into” “Hit” Control 
Yes: 16/50 = 32% Yes: 7/50 = 14% Yes: 6/50 = 12% 
No: 34/50 = 68% No: 43/50 = 86% No: 44/50 = 88% 


The segmented bar graph is shown here. 
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(b) The segmented bar graph reveals that there is an association between opinion about broken glass at the 
accident and treatment received for subjects in the study. Knowing which treatment a subject received 
helps us predict whether or not that person will respond that he or she saw broken glass at the accident. 


1.31 (a) 0.655 (b) 73.7% (c) Based on the segmented bar graph, there is an association between perceived 
body image and gender. Males are about 4 times as likely as females to perceive that they are 
underweight. Males are also less likely to perceive that they are overweight. However, the overwhelming 
majority of both genders perceive that their body image is about right. 

1.33 Answers may vary. Regardless of whether a student went to a private or public college, most 
students chose a school that was at least 11 miles from home. Those who went to a public university were 
most likely to choose a school that was 11 to 50 miles from home (about 30%), while those who went to a 
private university were most likely to choose a school that was 101 to 500 miles from home (about 29%). 
1.35 (a) The graph reveals that as age increases, the percent that use smartphones for navigation 
decreases. (b) It would not be appropriate to make a pie chart of these data because the category 


percentages are not parts of the same whole. 


1.37 Answers will vary. Two possible tables are given here. 


10 40 30 20 


50 0 30 20 


1.39 (a) 64 of the 200 patients transported by helicopter, or 32% of patients died. 260 of the 1100 patients 
transported by ambulance, or 23.6% of patients died. (b) Of the patients who were in serious accidents, 
48% who were transported by helicopter died and 60% who were transported by ambulance died. Of the 
patients who were in less serious accidents, 16% who were transported by helicopter died and 20% who 
were transported by ambulance died. Whether the patients were in a serious accident or less serious 
accident, the percentage who died was greater for those who were transported by ambulance. (c) Overall, 
a greater percentage of patients who were transported by helicopter died, but when broken down by 
seriousness of the accident, in both instances, a greater percentage of patients who were transported by 
ambulance died. This is because people in more serious accidents were also more likely to be transported 
by helicopter and were more likely to die. People who were involved in less serious accidents were less 
likely to be transported by helicopter and were less likely to die. Overall, this makes it appear that those 
who are transported by helicopters are more likely to die; but when the patients are broken out by 
seriousness of the accident, we can see that the driving factor for the overall death rates is the seriousness 
of the accident, not the method of transportation. 
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Section 1.2 
Answers to Check Your Understanding 


L.O00 1.50 2.00 2.50 3.00 
Cost ($) 


page 34: 1. 


2. The distribution of cost is skewed to the right, with a single peak at $1.50. There are two small gaps at 
$2.25 and $2.75. 

page 37: 1. Shape: The distribution of time to eat ice cream (in seconds) for the female sample is skewed 
to the right with no clear peak. The distribution of time to eat ice cream (in seconds) for the male sample 
is skewed to the right, with a single peak at about 17.5 seconds. Outliers: The female distribution appears 
to have one outlier: the female who took approximately 105 seconds to eat the ice cream. The male 
distribution does not appear to contain any outliers. Center: The time it took female students to eat ice 
cream was generally longer (median ~ 45 seconds) than for male students (median * 20 seconds). 
Variability: The ice cream eating times for female students varied more (from about 13 seconds to about 
107 seconds) than for the male students (from about 5 seconds to about 50 seconds). 


page 40: 1. Shape: The distribution of resting pulse rates is skewed to the right, with a single peak on the 
70s stem. The distribution of after-exercise pulse rates is also skewed to the right, with a single peak on 
the 90s stem. Outliers: The distribution of resting pulse rates appears to have one outlier: the student 
whose resting pulse rate was120 bpm. The distribution of after-exercise pulse rates appears to have one 
outlier as well: the student whose after-exercise pulse rate was 146 bpm. Center: The students’ resting 


pulse rates tended to be lower (median = 76 bpm) than their “after exercise” pulse rates (median = 98 
bpm). Variability: The “after exercise” pulse rates vary more (from 86 bpm to 146 bpm) than the resting 
pulse rates (which vary from 68 bpm to 120 bpm). 


2.b 
3.e 
4.c 
page 44: 1. One possible histogram is displayed here. 
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2. The distribution of IQ scores is roughly symmetric and bell shaped with a single peak. There are no 
obvious outliers. The typical IQ score appears to be between 110 and 120 (median = 114). The IQ scores 
vary from about 80 to 150. 

page 46: 1. The distribution of word length for both the journal and the magazine have shapes that are 
skewed to the right and single-peaked. Neither distribution appears to have any outliers. The centers for 
both distributions are about the same at approximately 5-6 letters per word. Both distributions have 
similar variability as the length of words in the journal varies from 1 letter to 14 letters and the length of 
words in the magazine varies from 2 letters to 14 letters. 

2. This is a bar graph. It displays categorical data about first-year students’ planned field of study. 

3. No, because the variable is categorical and the categories could be listed in any order on the horizontal 


axis. 


Answers to Odd-Numbered Section 1.2 Exercises 
1.45 (a) The graph is shown here. 


i Bac LJ Ihe 5 
3 4 5 6 7 8 9 10 1 
Amount of sleep (h) 


(b) 0.179 

1.47 (a) The dot above 3 indicates that there was one game in which the U.S women’s soccer team scored 
3 more goals than their opponent. (b) All 20 of the values are zero or more, which indicates that the U.S. 
women’s soccer team had a very good season. They won 17/20 = 85% of their games, tied the other team 
in 3/20 = 15% of their games, and never lost. 

1.49 The shape of the distribution is left-skewed with a peak between 90 and 100 years. There is a small 
gap around 70 years. 


1.51 The shape of the distribution is roughly symmetric with a single peak at 7. 


1.53 The distribution of difference (U.S. — Opponent) in goals scored is skewed to the right, with two 
potential outliers: when the U.S. team outscored their opponents by 9 and 10 goals. The median difference 
was 2 goals and the differences varied from 0 to 10 goals. 

1.55 The distribution of total family income for Indiana is roughly symmetric, while the distribution of 
total family income for New Jersey is slightly skewed right. The value of $125,000 may be an outlier in 
the Indiana distribution. There are no obvious outliers in the New Jersey distribution. The median for both 
distributions is about the same, approximately $49,000. The distribution of total family income in Indiana 
is less variable than the New Jersey distribution. The incomes in Indiana vary from $0 to about $125,000. 
The incomes in New Jersey vary from $0 to about $170,000. 

1.57 (a) Both distributions have about the same amount of variability. The “external reward” distribution 
varies from 5 to about 24. And the “internal reward” distribution varies from about 12 to 30. (b) The 
center of the internal distribution is greater than the center of the external distribution, indicating that 
external rewards do not promote creativity. 


1.59 (a) The stemplot is shown here. 
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Key: 15 | 9 = 15.9 grams 


(b) The graph reveals that there was one Fun Size Snickers® bar that is “gigantic”! It weighs 19.2 grams. 
(c) 0.412 

1.61 (a) The area of the largest South Carolina county is 1,220 square miles (rounded to the nearest 10 
mi’). (b) The distribution of the area for the 46 South Carolina counties is right-skewed with distinct 
peaks on the 500 mi* and 700 mi* stems. There are no clear outliers. A typical South Carolina county has 
an area of about 655 square miles. The area of the counties varies from about 390 square miles to 1,220 


square miles. 


1.63 (a) If we had not split the stems, most of the data would appear on just a few stems, making it hard to 
identify the shape of the distribution. (b) Key: 16 | 0 means that 16.0% of that state’s residents are aged 25 
to 34. (c) The distribution of percent of residents aged 25—34 is roughly symmetric with a possible outlier 
at 16.0%. 

1.65 The distribution of acorn volume for the Atlantic coast is skewed to the right. The distribution of 
acorn volume for California is roughly symmetric with one high outlier of 17.1 cubic centimeters. The 
distribution of volume of acorn for the Atlantic coast has 3 potential outliers: 8.1, 9.1, and 10.5 cubic 
centimeters. The typical acorn volume for Atlantic coast oak tree species (median = 1.7 cubic centimeters) 
is less than the typical acorn volume for California oak tree species (median = 4.1 cubic centimeters). The 
Atlantic coast distribution (with acorn volumes from 0.3 to 10.5 cubic centimeters) varies less than the 
California distribution (with acorn volumes from 0.4 to 17.1 cubic centimeters.) 


1.67 (a) The histogram is shown here. 


Frequency of CO, 


0 2 4 6 8 10 12 14 16 18 20 
CO, emissions (metric tons per person) 
(b) The distribution of amount of CO» emissions per person in these 48 countries is right-skewed. 
Visually, none of the countries appear to be outliers. 
1.69 The data vary from 14 to 54. We chose intervals of width 6, beginning at 12. 
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The distribution of DRP scores is roughly symmetric. There do not appear to be any outliers. The center 
of the DRP score distribution is between 30 and 36 (with median = 35). The DRP scores vary from 14 to 
54. 

1.71 (a) The shape of the distribution is slightly skewed to the left with a single peak. (b) The center is 
between 0% and 2.5% return on common stocks. (c) The exact value for the minimum return cannot be 
identified because we only have a histogram of the returns, not the actual data. The lowest return was in 
the interval —22.5% to —25%. (d) About 37% of these months (102 out of 273) had negative returns. 


1.73 It is difficult to effectively compare the salaries of the two teams with these two histograms because 
the scale on the horizontal axis is very different from one graph to the other. It also does not help that the 
scales on the y-axis differ as well. 

1.75 (a) No, it would not be appropriate to use frequency histograms instead of relative frequency 
histograms in this setting because there were many more graduates surveyed (314) than non-graduates 
(57). (b) The distribution of total personal income for each group is skewed to the right and single peaked. 
There are some possible high outliers in the graduate distribution. There do not appear to be any outliers 
in the non-graduate distribution. The center of the personal income distribution is larger for graduates than 
non-graduates, indicating that graduates typically have higher incomes than non-graduates in this sample. 
The incomes for graduates vary a lot more (from $0 to $150,000) than non-graduates (from $0 to 
$60,000). 

1.77 A bar graph should be used because birth month is a categorical variable. A possible bar graph is 
given here. 
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1.79 Score earned on the AP® Statistics exam is quantitative, so the histograms shown here use relative 


frequencies because there are many more students who take the AP® Calculus AB exam. 


35 + 
2-1 
ms Fs 

10 
5 
0 t t 

l 2 3 4 5 l 2 3 4 5 

AP® Calculus AB exam scores AP® Statistics exam scores 


The shapes of the two distributions are very different. The distribution of scores on the AP® Calculus AB 
exam has a peak at 1 and another slightly lower peak at 5. The distribution of scores on the AP® Statistics 
exam, however, is more uniform with scores of 1, 3, and 4 being the most frequent and scores of 5 being 
the least frequent. Neither distribution has any outliers. The center of both distributions is 3. Although 
scores on both exams vary from 1 to 5, there are more scores close to the center on the AP® Statistics 
exam and more scores at the extremes on the AP® Calculus exam. 

1.81a 

Lee 


1.85b 


Section 1.3 


Answers to Check Your Understanding 
page 59: 1. The mean weight of the pumpkins is 
x =3.6+4,0+9.6+...+5.4+31+3323=9.935 pounds. 
3.6+4.0+9.6+. ..+5.4+31+433 

= STENT Rr TOPOL TOS — 9,935 pounds. 

23 
2. First we must put the weights in order: 2.0, 2.8, 3.4, 3.6, 4.0, 4.0, 5.4, 5.4, 6.0, 6.0, 6.1, 6.6, 9.6, 9.6, 
11.0, 11.9, 12.4, 12.7, 13.0, 14.0, 15.0, 31.0, and 33.0. Because there are 23 weights, the median is the 
12th weight in this ordered list. The median weight of the pumpkins is 6.6 pounds. 


Si 


3. I would use the median to summarize the typical weight of a pumpkin in this contest because the 
distribution of pumpkin weights is skewed to the right with two possible upper outliers: 31.0 and 33.0 
pounds. 


page 65: 1. Range = Max — Min = 23.3 - 21.5 = 1.8 mpg 


2. The standard deviation of 0.363 mpg tells us that the highway fuel economy of these 25 model year 
2018 Toyota 4 Runners typically varies by about 0.363 mpg from the mean of 22.404 mpg. 
3. Q1=22.24+22.22=22.2; Q3=22.6+22.62=22.6; IQR=Q3-Q1=22.6—-22.2=0.4 mpg 


Q, = BH? = 22.2; Q;, = BA — 99.6; IQR=Q3 — Qi = 22.6 — 22.2 = 0.4mpg 


4. I would use the interquartile range (IQR) to describe variability because there appears to be one upper 
outlier and one lower outlier. The [QR is resistant to outliers, but the range and standard deviation are not. 
page 70: 1. M=166+1672=166.5 grams; Q1=163 grams; Q3=170 grams. 

M= = = 166.5 grams; Q; = 163 grams; Q3 = 170 grams. The IQR = 170 - 163 = 7 grams. 
An outlier is any value below Q, — 1.5(/QR) = 163 - 1.5(7) = 152.5 grams or above Q3 + 1.5(JQR) = 170 
1 1.5(7) = 180.5 grams. This means that the value 152 grams is an outlier. The boxplot is displayed here. 
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2. No, the graph does not support their suspicion. The first quartile of the distribution of weight is 163 
grams, which means that at least 75% of the large fries that they purchased weighed more than the 
advertised weight of 160 grams. 


Answers to Odd-Numbered Section 1.3 Exercises 

1.87 (a) The mean of Joey’s first 14 quiz scores is 85. (b) Including a 15th quiz score of 0, Joey’s mean 
would be 79.3. This illustrates the property of nonresistance. The mean is not resistant. It is sensitive to 
extreme values. 

1.89 (a) 85 (b) Since there are now 15 quiz scores, the median is 84. Notice that the median did not 
change much. This shows that the median is resistant to outliers. 

1.91 (a) 8 electoral votes (b) Since the distribution of number of electoral votes is skewed to the right, the 
mean of this distribution is greater than its median. 

1.93 The mean house price is $276,200 and the median is $234,200. The distribution of house prices is 
likely to be quite skewed to the right because of a few very expensive homes, some of which may be 
outliers. When a distribution is skewed to the right, the mean is bigger than the median. 


1.95 (a) The median is 2 servings of fruit per day. 


(b) Xx =19474=2.62" — 74 2.62 


1.97 (a) Range = Max — Min = 98 — 74 = 24. The range of Joey’s quiz grades after his unexcused absence 


servings of fruit per day 


is 98 — 0 = 98. (b) The range may not be the best way to describe variability for a distribution of 
quantitative data because the range can be heavily affected by outliers. 

1.99 The standard deviation is 2.52 cm. The foot lengths of these 14-year-olds from the United Kingdom 
typically vary by about 2.52 cm from the mean of 24 cm. 


1.101 (a) The size of these 18 files typically varies by about 1.9 megabytes from the mean of 3.2 
megabytes. (b) If the music file that takes up 7.5 megabytes of storage space is replaced with another 
version of the file that only takes up 4 megabytes, the mean would decrease slightly. The standard 
deviation would decrease as well because a file of size 4 megabytes will be closer to the new mean than 
the file of 7.5 megabytes was to the former mean. 

1.103 Variable B has a smaller standard deviation because more of the observations have values closer to 
the mean than in Variable A’s distribution. That is, the typical distance from the mean is smaller for 
Variable B than for Variable A. 

1.105 Q; = 1.9 megabytes; Q3 = 4.7; IQR = Q3 — Q, = 4.7 — 1.9 = 2.8 megabytes 

1.107 An outlier is any value below Q, — 1.5(/QR) = 1.9 - 1.5(2.8) = 22.3 megabytes or above Q3 + 
1.5(7.QR) = 4.7 + 1.5(2.8) = 8.9 megabytes. There are no files in the data set that have fewer than —2.3 
megabytes or more than 8.9 megabytes, so there are no outliers in the distribution. 

1.109 (a) The distribution is skewed to the right because the mean is much larger than the median. Also, 
Q3 is much further from the median than Q,. (b) The amount of money spent typically varies by about 
$21.70 from the mean of $34.70. (c) The first quartile is 19.27 and the third quartile is 45.40, so the IQR 
is 45.40 — 19.27 = 26.13. Any points below 19.27 — 1.5(26.13) = -19.925 or above 45.40 + 1.5(26.13) = 
84.595 are outliers. Because the maximum of 93.34 is greater than 84.595, there is at least one outlier. 
1.111 (a) The median is 9, the first quartile is 3, and the third quartile is 43. The IQR is 43 — 3 = 40. An 
outlier would be any value below 3 — 1.5(40) = -57 or above 43 + 1.5(40) = 103. This means the value of 
118 is an outlier. The boxplot is shown here. 
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(b) The article claims that teens send 1742 texts a month, which works out to be about 58 texts a day 
(assuming a 30-day month). Nearly all of the members of the class (21 of 25) sent fewer than 58 texts per 
day, which seems to contradict the claim in the article. 


1.113 (a) the median and IQR would be a better choice for summarizing the center and variability of the 
distribution of electoral votes than the mean and standard deviation because the boxplot reveals that there 
are three outliers in the data set. The mean and standard deviation are not resistant measures of center and 
variability, so their values are sensitive to these extreme values. (b) The stemplot reveals that the 
distribution has a single peak, which cannot be discerned from the boxplot. Also, the stemplot reveals that 
there are actually four upper outliers rather than three. The value of 29, which is an outlier, gives the 
number of electoral votes for two states. In the boxplot, this appears as one asterisk. However, there are 
two states that have that many electoral votes, not one. 

1.115 Shape: The distribution of energy cost (in dollars) for top freezers looks roughly symmetric. The 
distribution of energy cost (in dollars) for side freezers looks roughly symmetric. The distribution of 


energy cost (in dollars) for bottom freezers looks skewed to the right. Outliers: There are no outliers for 
the top or side freezers. There are at least two bottom freezers with unusually high energy costs (over 
$140 per year). Center: The typical energy cost for the side freezers (median * $75) is greater than the 
typical cost for the bottom freezers (median * $69), which is greater than the typical cost for the top 
freezers (median © $56). Variability: There is much more variability in the energy costs for bottom 
freezers (IQR * $20) than for side freezers (IQR ~ $12) or for top freezers (IQR ~ $8). 


1.117 (a) 
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(b) Numerical summaries are given here. 


nr x & Sy, Minimum Q1 Median Q3 Maximum 
Male 15 62.4 71.4 0 6 28 83 214 
Female 16 128.3 116.0 0 34 107 191 379 


(c) The data from this survey project give very strong evidence that male and female texting habits differ 
considerably at the school. A typical female sends and receives about 79 more text messages in a 2-day 
period than a typical male. The males as a group are also much more consistent in their texting frequency 
than the females. 

1.119 (a) All five income distributions are skewed to the right. This tells us that within each level of 
education, there is much more variability in incomes for the upper 50% of the distribution than for the 
lower 50% of the distribution. (b) Even though the boxplots are not modified (visually showing outliers), 
it is clear that the upper whisker for the advanced degree boxplot is much longer than 1.5 times its IQR, 
indicating that there is at least one upper outlier in the group that earned an advanced degree. (c) As 
education level rises, the median, quartiles, and extremes rise as well—that is, every value in the 5- 
number summary gets larger. This makes sense because one would expect that individuals with more 
education tend to attain jobs that yield more income. (d) The variability in income increases as the 
education level increases. Both the width of the box (the JQR) and the distance from one extreme to the 
other increase as education levels increase. 

1.121 (a) One possible answer is 1, 1, 1, and 1. (b) 0, 0, 10, 10 (c) For part (a), any set of four identical 
numbers will have s, = 0. For part (b), however, there is only one possible answer. We want the values to 
be as far from the mean as possible, so the squared deviations from the mean can be as big as possible. 
Our best choice is two values at each extreme, which makes all four squared deviations equal to 52. 
1.123 d 

1.125e 


1.127 A histogram is given here. 
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Height (cm) 

This distribution is roughly symmetric with a single peak at 170 cm. There do not appear to be any 

outliers. The center of the distribution of heights can be described by the mean of 169.88 cm or the 

median of 169.5. The heights vary from 145.5 cm to 191 cm, so the range is 45.5 cm. The standard 

deviation of heights is 9.687 cm and the JQR is 177 — 163 = 14cm. 


Answers to Chapter 1 Review Exercises 

R1.1 (a) The individuals are buyers. (b) The variables are zip code (categorical), gender (categorical), 
buyer’s distance from the dealer (in miles) (quantitative), car model (categorical), model year 
(categorical), and price (quantitative). 


R1.2 First, a relative frequency table must be constructed. 


Candy selected Relative frequency 
Snickers® 8/30 = 26.7% 

Milky Way® 3/30 = 10% 
Butterfinger® 7/30 = 23.3% 
Twix® 10/30 = 33.3% 

3 Musketeers® 2/30 = 6.7% 


The relative frequency bar graph is given here. 


Percent 


Candy selected 


Students preferred Twix the most (one-third of the students chose this candy), followed by Snickers (27% 
relative frequency), Butterfinger (23% relative frequency), Milky Way (10% relative frequency), and 
lastly 3 Musketeers (7% relative frequency). 

R1.3 (a) The graph is misleading because the “bars” are different widths. For example, the bar for 
“Send/receive text messages” should be roughly twice the size of the bar for “Camera,” but it is actually 
much more than twice as large in area. (b) It would not be appropriate to make a pie chart for these data 


because they do not describe parts of the same whole. Students were free to answer in more than one 
category. 

R1.4 (a) 148/219 = 0.676 = 67.6% were Facebook users. (b) 67/219 = 0.306 = 30.6% were aged 28 and 
over. (c) 21/219 = 0.096 = 9.6% were older Facebook users. (d) 78/148 = 0.527 = 52.7% of the Facebook 
users were younger students. 

R1.5 (a) 


fi Not a Facebook user 
[_] Is a Facebook user 


Percent 


Younger Middle Older 
(18-22) (23-27) (28 and up) 


Age group 


(b) From both the table and the graph, we can see that there is an association between age and Facebook 
use. As age increases, the percent of Facebook users decreases. For younger students, about 95% use 
Facebook. That drops to 70% for 23- to 27-year-old students and drops even further to 31.3% for older 
students. 


R1.6 (a) A stemplot is shown here. 
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49 

50 7 

51 O 
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Key: 48 | 8 = 4.88 


(b) The distribution is roughly symmetric with one possible outlier at 4.88. The center of the distribution 
is between 5.4 and 5.5. The densities vary from 4.88 to 5.85. (c) The mean of the distribution of 
Cavendish’s 29 measurements is 5.45. So these estimates suggest that the Earth’s density is about 5.45 
times the density of water. The currently accepted value for the density of the earth (5.51 times the density 
of water) is slightly larger than the mean of the distribution of density measurements. 


R1.7 (a) 


Frequency 


120 240 360 480 600 
Survival time (days) 


The survival times are right-skewed, as expected. 


(b) 


T T T T 
0 100 200 300 400 500 600 


Survival time (days) 
(c) While both graphs clearly show that the distribution is strongly skewed to the right, the boxplot shows 
that there are several high outliers. 
R1.8 (a) About 11% of low-income and 40% of high-income households consisted of four or more 
people. (b) The shapes of both distributions are skewed to the right. However, the skewness is much 
stronger in the distribution for low income households. On average, household size is larger for high 
income households. In fact, the majority of low-income households consist of only one person. Only 
about 7% of high-income households consist of one person. One-person households might have less 
income because they would include many young single people who have no job or retired single people 
with a fixed income. 


R1.9 (a) The amount of mercury per can of tuna will typically vary by about 0.3 ppm from the mean of 
0.285 ppm. (b) The IQR = 0.380 — 0.071 = 0.309, so any point below 0.071 — 1.5(0.309) = —0.393 or 
above 0.38 + 1.5(0.309) = 0.8435 would be considered an outlier. Because the smallest value is 0.012, 
there are no low outliers. According to the histogram, there are values above 0.8435, so there are several 
high outliers. (c) The mean is much larger than the median of the distribution because the distribution is 
strongly skewed to the right and there are several high outliers. 

R1.10 The distribution for light tuna is skewed to the right with several high outliers. The distribution for 
albacore tuna is more symmetric with just a couple of high outliers. The albacore tuna generally has more 
mercury. Its minimum, first quartile, median, and third quartile are all greater than the respective values 
for light tuna. But that doesn’t mean that light tuna is always better. It has much larger variation in 
mercury concentration, with some cans having as much as twice the amount of mercury as the largest 
amount in the albacore tuna. 
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(b) The first quartile is 30 contacts. The third quartile is 77. IQR = 77 — 30 = 47. Any value below 30 - 
1.5(47) = —40.5 or above 77 + 1.5(47) = 147.5 is an outlier. So the observation of 151 contacts is an 
outlier. (c) It would be better to use the median and JQR to describe the center and variability because the 
distribution of number of contacts is skewed to the right and has a high outlier. 


T1.12 (a) 53/1207 = 0.044 (b) 22/1207 = 1.8% (c) The conditional probabilities are shown in the table. 


Diabetic status 


Number of birth defects Nondiabetic Prediabetic Diabetic 
None 96.1% 96.5% 80.9% 
One or more 3.9% 3.5% 19.1% 


fH One or more birth defects 
[_] No birth defects 


Percent 


Diabetic status 


(d) There is an association between diabetic status and number of birth defects for the women in this 
study. Nondiabetics and prediabetics appear to have babies with birth defects at about the same rate. 
However, those with diabetes have a much higher rate of babies with birth defects. 


T1.13 (a) The longest that any battery lasted was between 550 and 559 hours. (b) Someone might prefer 
to use Brand X because it has a higher minimum lifetime or because its lifetimes are more consistent (less 
variable). (c) Someone might prefer Brand Y because it has a higher median lifetime. 

T1.14 The distribution of reaction time for the Athlete group is slightly skewed to the right. The 
distribution of reaction time for the Other group is roughly symmetric with two high outliers. It appears 
that the Athlete distribution has one high outlier while the Other distribution has two high outliers. The 
reaction times for the students who have not been varsity athletes tended to be slower (median = 292.0 
milliseconds) than for the athletes (median = 261.0 milliseconds). The distribution of reaction time for the 
Other group also has more variability, as their reaction times have an IQR of 70 milliseconds and the 
athletes’ reaction times have an JQR of 64 milliseconds. 


Chapter 2 

Section 2.1 

Answers to Check Your Understanding 

page 95: 1.c 

2. Her daughter weighs more than 87% of girls her age and she is taller than 67% of girls her age. 

3. About 65% of calls lasted less than 30 minutes. This means that about 35% of calls lasted 30 minutes or 
longer. 


4. Q; = 13 minutes; Q3 = 32 minutes; (QR = 32 — 13 = 19 minutes. 
__ 62-67 


page 97: 1. 2=62-674,29=-1.166.° 429. ~ i Lynette’s height is 1.166 standard deviations 
below the mean height of the class. 

_ 74-76 
2. Because Brent’s z-score is —0.85, we know that ~0.85=74-760,— 0-85 ~ ¢ * Solving for o we find 


that o = 2.35 inches. 


page 103: 1. Converting the cost of the rides from dollars to cents will not change the shape. However, it 
will multiply the mean and standard deviation by 100. 

2. Adding 25 cents to the cost of each ride will not change the shape of the distribution, nor will it change 
the variability. It will, however, add 25 cents to the measures of center (mean, median). 


3. Converting the costs to z-scores will not change the shape of the distribution. It will change the mean to 
O and the standard deviation to 1. 


Answers to Odd-Numbered Section 2.1 Exercises 

2.1 (a) Because 17 of the 20 students (85%) own fewer pairs of shoes than Jackson (who owns 22 pairs of 
shoes), Jackson is at the 85th percentile in the number of pairs of shoes distribution. (b) 45% of the boys 
had fewer pairs of shoes than Raul did. Raul was at the 45th percentile. This means that 45% of the 20 
boys, or 9 boys, have fewer pairs of shoes. Therefore, Raul’s response is the 10th value in the ordered list. 
Raul owns 8 pairs of shoes. 

2.3 (a) Because 10 of the 30 observations (33.3%) are below Antawn’s head circumference (22.4 inches), 
Antawn is at the 33.3rd percentile in the head circumference distribution. (b) The player with a head 
circumference of 24 inches is at the 90th percentile of the distribution. 


2.5 This means that the speed limit is set at such a speed that 85% of the vehicle speeds are slower than 
the posted speed. 

2.7 The girl in question weighs more than 48% of girls her age, but is taller than 78% of the girls her age. 
Because she is taller than 78% of girls, but only weighs more than 48% of girls, she is probably fairly 
thin. 

2.9 (a) No; a sprint time of 8 seconds is not unusually slow. A student with an 8-second sprint is at the 
75th percentile, so 25% of the students took that long or longer. (b) The 20th percentile of the distribution 
is approximately 6.7 seconds. 20% of the students completed the 50-yard sprint in less than 6.7 seconds. 
2.11 (a) The first quartile is the 25th percentile. Find 25 on the y-axis, read over to the line and then down 
to the x-axis to get about Q; = 4%. The 3rd quartile is the 75th percentile. Find 75 on the y-axis, read over 
to the line and then down to the x-axis to get about Q3 = 14%. The IQR is approximately 14 — 4 = 10%. 
(b) Arizona, which had 15.1% foreign-born residents that year, is at the 90th percentile. (c) The graph is 
fairly flat between 20% and 27.5% foreign-born residents because there were very few states that had 
20% to 27.5% foreign-born residents that year. 


(d) 
35 
# 30- 
o 25 
5 20 
E is- 
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zs 
0 
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Percent foreign-born residents 
ya 19-873 _ 449 
2.13 (a) The z-score for Montana is z=1.9-8.736.12=-1.12.~ 612. =" "" Montana’s percent of 


foreign-born residents is 1.12 standard deviations below the mean percent of foreign-born residents for all 
states. (b) If we let x denote the percent of foreign-born residents in New York at that time, then we can 


_ 2-873 
solve for x in the equation 2.10=x-8.736.12.7 19 612 ° Thus, x = 21.582% foreign-born residents. 


2.15 (a) The number of pairs of shoes owned by Jackson is 1.10 standard deviations above the average 
number of pairs of shoes owned by the students in the sample. (b) If we let x @ denote the mean number 
of pairs of shoes owned by students in the sample, then we can solve for x © in the equation 

1.10 = 


22—z 


9.42 ° Thus, x =11.64¢ = 11.64 is the mean number of pairs of shoes owned 


1.10=22-x 9.42. 
by the students in the sample. 

2.17 (a) The fact that your standardized score is negative indicates that your bone density is below the 
average for your peer group. In fact, your bone density is about 1.5 standard deviations below average 


among 25-year-old women. (b) If we let o denote the standard deviation of the bone density in Judy’s 
1.45 — 248-956 


reference population, then we can solve for o in the equation —1.45=948-9560. o * Thus, 0 
= 5.52¢/cm?. 

, — 880-500 _j8 
2.19 Eleanor’s standardized score of z=680-500100=1.8° 100 = is higher than Gerald’s 


gas DM 4g 
standardized score of z=29-215=1.6. 5 © 


2.21 (a) The shape of the distribution of corrected long-jump distance will be the same as the original 
distribution of long-jump distance: roughly symmetric with a single peak. (b) The mean of the distribution 
of corrected long-jump distance is 577.3 — 20 = 557.3. The median of the distribution of corrected long- 
jump distance is 577 — 20 = 557 centimeters. (c) The standard deviation of the distribution of corrected 
long-jump distance is the same as the standard deviation of the distribution of long-jump distance, 4.713 
centimeters. The JQR of the distribution of corrected long-jump distance is the same as the JQR of the 
distribution of long-jump distance, 7 centimeters. 


2.23 (a) The shape of the new salary distribution will be the same as the shape of the original salary 
distribution. (b) The mean and median salaries will each increase by $1000. (c) The standard deviation 
and IQR of the new salary distribution will each be the same as they were for the original salary 
distribution. 

2.25 (a) The shape of the distribution of corrected long-jump distance in meters will be the same as the 
distribution of corrected long-jump distance in centimeters: roughly symmetric with a single peak. (b) 
The mean of the distribution of corrected long-jump distance, in meters, is 577.3 — 20 = 557.3 cm + 100 
cm/m = 5.573 meters. (c) The standard deviation of the distribution of corrected long-jump distance, in 
meters, is 4.713 centimeters + 100 = 0.04713 meters. 


2.27 (a) The shape of the resulting salary distribution will be the same as the original distribution of 
salaries. (b) The median will increase by 5% because each value in the distribution is being multiplied by 
1.05. (b) The [QR will increase by 5% because each value in the distribution is being multiplied by 1.05. 


5 100 9 
2.29 (a) The mean temperature reading is 59(77)—1609=25 9 (77) 9 we degrees Celsius. (b) The 


standard deviation of the temperature reading is 59(3)=1.667 9 (3) = 1.667 


2.31 We are told that Mean(Fare) = 2.85 + 2.7 Mean(miles): 15.45 = 2.85 + 2.7 Mean(miles) — 
Mean(miles) = 4.667 miles. 


degrees Celsius. 


To determine the standard deviation of the lengths of his cab rides in miles, we use the following 
equation: 

SD(Fare) = 2.7 SD(miles) = 10.20 = 2.7 SD(miles) — SD(miles) = 3.778 miles 

The mean and standard deviation of the lengths of his cab rides, in miles, are 4.667 miles and 3.778 miles, 
respectively. 

Ze C 

Zaod 

Za? © 


2.39 The distribution is skewed to the right. The two largest values appear to be outliers. The data are 
centered roughly around a median of 15 minutes and the interquartile range is approximately 10 minutes. 


Section 2.2 
Answers to Check Your Understanding 
page 119: 1. The graph is shown here. 


a 
57.0 59.5 62.0 64.5 67.0 69.5 72.0 
Height (in.) 


2. Because 69.5 inches is 2 standard deviations above the mean, approximately 100%-95%2=2.5% 


2 2.5% of young women have heights greater than 69.5 inches. Therefore, 97.5% of young 


women have heights less than 69.5 inches. 


3. Because 62 is 1 standard deviation below the mean, approximately 100%-68%2=16% 


100%-68% _ seg 
2 = 164 of young women have heights below 62 inches. This is not unusually short. 
y — 20-170 _ 5 99 
page 127: 1. (i) z=240-17030=2.33° 30 ~~ ~~“. Table A: The proportion of z-scores above 2.33 


is 1 — 0.9901 = 0.0099. Tech: normalcdf(lower: 2.33, upper: 1000, mean: 0, SD: 1) = 0.0099 (ii) 
normalcdf(lower: 240, upper: 1000, mean: 170, SD: 30) = 0.0098 About 1% of 14-year-old boys have 
cholesterol above 240 mg/dl. 2. (i) z=200-17030=1 and z=240-17030=2.33. 

200—170 240-170 _ 9 99 


ar Tiles m.e= 30 Table A: The proportion of z-scores below z = 1.00 is 0.8413 


and the proportion of z-scores below 2.33 is 0.9901. Thus, the proportion of z-scores between 1 and 2.33 
is 0.9901 — 0.8413 = 0.1488. Tech: normalcdf(lower: 1, upper: 2.33, mean: 0, SD: 1) = 0.1488 (ii) 
normalcdf(lower: 200, upper: 240, mean: 170, SD: 30) = 0.1488. About 15% of 14-year-old boys have 
cholesterol between 200 and 240 mg/dl. 

page 133: 1. (i) Table A: Look in the body of Table A for the value closest to 0.10. A z-score of —1.28 
gives the closest value (0.1003). Tech: invNorm(area: 0.10, mean: 0, SD: 1) = -1.28 


B= ayes eee | . 
~1,28=x-17030 +-38.4=x-170 +x=131.6 2 Om I = LO 


(ii) invNorm(area: 0.10, mean: 170, SD: 30) = 131.6 About 10% of 14-year old boys have cholesterol 
levels that are less than 131.6, so a 14-year-old boy who has a cholesterol level of 131.6 would be at the 


10th percentile of the distribution. 


Answers to Odd-Numbered Section 2.2 Exercises 


2.41 (a) The density curve is shown here. 


0.1 


7 > 
8 1 QRS 5 OE &E 9 10 


Time spent waiting for 
the bus (min) 


(b) Area 5 (base)(height) = (5.3 — 2.5)(0.1) = 0.28 = 28%. On about 28% of days, Sally waits between 2.5 
and 5.3 minutes for the bus. (c) The 70th percentile will have 70% of the wait times to the left of it, so the 


70th percentile of Sally’s wait times is 7 minutes. 

2.43 (a) The density curve must have the height 0.25 because the area must equal 1. (base)(height) = (4) 
(0.25) = 1. (b) Area = (base) (height) = (5 — 3.75)(0.25) = 0.3125 = 31.25%. About 31.25% of the time, 
the light will flash more than 3.75 seconds after the subject clicks “Start.” (c) The 38th percentile of this 
distribution is the time for which 38% of the observations are below it. 

Area = (base)(height) — 0.38 = (x -— 1)(0.25) = 1.52 =x-1 5 x=2.52 

Therefore, the 38th percentile of this distribution is 2.52 seconds. Thirty-eight percent of the time the light 
will flash within 2.52 seconds after the subject clicks “Start.” 

2.45 (a) Mean is C, median is B (the right skew pulls the mean to the right of the median). (b) Mean is B, 
median is B (this distribution is symmetric, so mean = median). 


2.47 The Normal density curve with mean 9.12 and standard deviation 0.05 is shown here. 


T T T T T ’ 
8.97 9,02 9.07 9.12 9.17 9,22 9.27 


Weight (oz) 


2.49 The mean is at the balance point of the distribution, which appears to be 10. We know that the curve 
gets very close to the horizontal axis around 3 standard deviations from the mean. Observing the positive 
side of the curve, the curve shown seems to approach the horizontal axis around the value 16, so 16 
should be approximately 3 standard deviations above the mean. Because 16 — 10 = 6, we estimate 3 
standard deviations to be 6 and therefore 1 standard deviation would be about 2 units. 


2.51 (a) The value 9.02 is 2 standard deviations below the mean, so approximately 100%-95%2=2.5% 


100 %—95 % 
OOK WN — 2.5% 


of bags weigh less than 9.02 ounces. (b) The value 9.07 is 1 standard deviation below 
100%-68% _ se7 

the mean. About 100%-68%2=16% 2 = 167% of the bags weigh less than 9.07 ounces. In other 

words, 9.07 is approximately the 16th percentile of the weights of these potato chip bags. 

9—9.12 


2.53 U) 2=9-9.120.05=—2.40;~ = aoe AE the proportion of z-scores below —2.4 is 0.0082. (ii) 


normalcdf(lower: —1000, upper: 9, mean: 9.12, SD: 0.05) = 0.0082 


About 0.82% of 9-ounce bags of this brand of potato chips weigh less than the advertised 9 ounces. This 
is not likely to pose a problem because the percentage of bags that weigh less than the advertised amount 
is very small. 

2400-2000 


2.0 (1) 2=2400-2000500=0.80; 500 0.9; the proportion of z-scores above 0.80 is 1 - 
0.7881 = 0.2119. (ii) normalcdf(lower: 2400, upper: 1000000, mean: 2000, SD: 500) = 0.2119. About 
21.19% of the meals ordered exceeded the recommended daily allowance of 2400 mg of sodium. 


2.57 (i) z=1200-2000500=-1.60, z=1800-—2000500=-—0.40; 

» — 1200-2000 _ _j gp » — 1800-2000 _ _g ag. 

i 500 — 500 ~~" the proportion of z-scores between —1.60 and —0.40 is 
0.3446 — 0.0548 = 0.2898. (ii) normalcdf(lower: 1200, upper: 1800, mean: 2000, SD: 500) = 0.2898. 


About 28.98% of meals ordered had between 1200 mg and 1800 mg of sodium. 


2.59 (a) z = —1.66 

Table A: The proportion of z-scores above z = —1.66 is 1 — 0.0485 = 0.9515. Tech: normalcdf(lower: 
—1.66, upper: 1000, mean: 0, SD: 1) = 0.9515. The proportion of observations in a standard Normal 
distribution that satisfy z > —1.66 is 0.9515. (b) z = -1.66 and z = 2.85 Table A: The proportion of z-scores 
between —1.66 and 2.85 is 0.9978 — 0.0485 = 0.9493. Tech: normalcdf(lower: —1.66, upper: 2.85, mean: 
0, SD: 1) = 0.9494. The proportion of observations in a standard Normal distribution that satisfy —1.66 < z 


< 2.85 is 0.9494. 
,— 3:53 _ _o we. 
2.61 (a) (i) z=3-5.30.9=-2.56;5 0.9 ~~’ the proportion of z-scores below —2.56 is 0.0052. (ii) 


normalcdf(lower: —1000, upper: 3, mean: 5.3, SD: 0.9) = 0.0053. About 53 out of every 10,000 times 
Mrs. Starnes completes an easy Sudoku puzzle, she does so in less than 3 minutes. This is a proportion of 


_ 6-53 _ ; 
0.0053. (b) (i) z=6-5.30.9=0.78;" 0.9 a0 the proportion of z-scores above 0.78 is 1 — 0.7823 = 


0.2177. 
(ii) normalcdf(lower: 6, upper: 1000, mean: 5.3, SD: 0.9) = 0.2183 


About 21.83% of the time, it takes Mrs. Starnes more than 6 minutes to complete an easy puzzle. (c) (i) 
— 6-53 _g7g , 853 _ 9. 

z=6-5.30.9=0.78, z=8-5.30.9=3;, 0.9 OetB) 2 0.9 35 the proportion of z-scores between 

0.78 and 3.00 is 0.9987 — 0.7823 = 0.2164. (ii) normalcdf(lower: 6, upper: 8, mean: 5.3, SD: 0.9) = 


0.2170. About 21.7% of easy puzzles take Mrs. Starnes between 6 and 8 minutes to complete. 


N 


_ -2-5,3 
2.63 (i) 0.20 area to the left of z-z=-0.84.2 > 2 = —0.84. Solving ~0.84=x-5.30.9 2°34 = G9 
gives x = 4.544. (ii) invNorm(area: 0.2, mean: 5.3, SD: 0.9) = 4.543 minutes. The 20th percentile of Mrs. 


Starnes’s Sudoku times for easy problems is about 4.54 minutes. 


2.65 (i) 0.10 area to the left of z — z = —1.28 and 0.90 area to the left of z — z= 1.28. (ii) invNorm(area: 
0.10, mean: 0, SD: 1) = —1.282 and invNorm(area: 0.90, mean: 0, SD: 1) = 1.282. 


The first and last deciles of the standard Normal distribution are z = —1.282 and z = 1.282, respectively. 
z — 25-0 _gg , — 150-110 _ 1 &. 

2.67 (a) (i) z=125-11025=0.6, z=150-11025=1.6; 25 , 25 ’ the 

proportion of z-scores between 0.6 and 1.6 is 0.9452 — 0.7257 = 0.2195. (ii) normalcdf(lower: 125, upper: 

150, mean: 110, SD: 25) = 0.2195. About 22% of 20- to 34-year-olds have IQ scores between 125 and 


one. =110 
150. (b) (i) 98% area to the left of z — z = 2.05. Solving 2.05=x-110257""° 25 gives x = 161.25. 


(ii) invNorm(area: 0.98, mean: 110, SD: 25) = 161.34. Scores greater than 161.34 qualify for MENSA. 
3.95—-3.98 __ : 
2.69 (a) (i) z=3.95-3.980.02=-1.5;~ “a the proportion of z-scores below —1.5 is 


0.0668. (ii) normalcdf(lower: —1000, upper: 3.95, mean: 3.98, SD: 0.02) = 0.0668. About 7% of the large 
4.05-3.98 _ of, 

lids are too small to fit. (b) (i) z=4.05-3.980.02=3.5;~ 0.02“ *"* the proportion of z-scores 

above 3.50 is approximately 0. (ii) normalcdf(lower: 4.05, upper: 1000, mean: 3.98, SD: 0.02) = 0.0002. 


Approximately 0% of the large lids are too big to fit. (c) It makes more sense to have a larger proportion 


of lids too small rather than too big. If lids are too small, customers will just try another lid. But if lids are 
too large, the customer may not notice and then spill the drink. 

2.71 We are looking for the value of z with an area of 0.15 to the right and 0.85 to the left, and the value 
of z with an area of 0.03 to the right and 0.97 to the left. We get the values z = 1.04 and z = 1.88, 


respectively. Now we need to solve the following system of equations for p and 
75— 


‘ _ 60-p = 
06:1.04=60-1o and 1.88=75-10. and ¢ : 1.04 = —~ and 1.88 = —~. Multiplying both sides of the 


equations by o and subtracting yields 0.840 = 15 or o = 17.86 minutes. Substituting this value back into 
the first equation, we obtain 1.04=60-117.86, or p=60-1.04(17.86)=41.43 
1.04 = —*, or ys = 60 — 1.04(17.86) = 41.43 


17.86 ” minutes. 


2.73 The distribution of highway gas mileage is not approximately Normal because the distribution is 
skewed to the right. 


2.75 The histogram of these data is roughly symmetric and bell-shaped. The mean and standard deviation 
of these data are x =15.825% = 15.825 cubic feet and s, = 1.217 cubic feet. 


e x +1sx=(14.608, 17.042);2 = 182 = (14.608, 17.042); 94 of 36 observations, or 66.7% of 
the observations, are within 1 standard deviation of the mean. 


© x +2sx=(13.391, 18.259);2 = 282 = (13.391, 18.259); 34 of 36 observations, or 94.4% of 
the observations, are within 2 standard deviations of the mean. 


e x +3sx=(12.174, 19.476);2 + 382 = (12.174, 19.476); 36 of 36 observations, or 100% of the 
observations, are within 3 standard deviations of the mean. 

These percentages are quite close to what we would expect based on the 68—95-99.7 rule. Combined with 

the graph, this provides good evidence that this distribution is approximately Normal. 

2.77 The distribution of tuitions in Michigan is not approximately Normal. If it were Normal, then the 


minimum value would be around 3 standard deviations below the mean. However, the actual minimum 
1873—10,614 
has a z-score of just Z=1873-10,6148049=-1.09.~ 8049 = — 1.09. Also, if the distribution were 


Normal, the minimum and maximum would be about the same distance from the mean. However, the 
mean is much farther from the maximum (30,823 — 10,614 = 20,209) than from the minimum (10,614 — 
1873 = 8741). 


2.79 The distribution is approximately Normal because the Normal probability plot is nearly linear. 


2.81 The sharp curve in the Normal probability plot suggests that the data are right-skewed. This can be 
seen in the steep, nearly vertical section in the lower left. These numbers were much closer to the mean 
than would be expected in a Normal distribution, meaning that the values that would be in the left tail are 
piled up close to the center of the distribution. 


2.83 (a) A Normal probability plot is shown. 
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(b) The plot is fairly linear, indicating that the distribution of usable capacity is approximately Normal. 


2.85 b 
2.87 b 
2.89 a 


2.91 36 of the 38 family incomes in Indiana are below $95,000. Because 36/38 = 0.95, or 95%, this 
individual’s income is at the 95th percentile. 35 of the 44 family incomes in New Jersey are below 


$95,000. Because 35/44 = 0.80, or 80%, this individual’s income is at the 80th percentile. 
, = 95,000-47,400 _ | go 

Indiana: z=95,000-47,40029,400=1.62" 29,400 —_ 

95,000—58,100 


New Jersey: 2=95,000-58, 10014,900=0.88~ = 14,900 


The individual from Indiana has an income that is 1.62 standard deviations above the mean of 


= 0.88 


$47,400. The individual from New Jersey has an income that is 0.88 standard deviation above the mean of 
$58,100. 

The individual from Indiana has a higher income, relative to others in his or her state because he or 
she had a higher percentile (95th versus 80th) and had a higher z-score (1.62 versus 0.88) than the 
individual from New Jersey. 


Answers to Chapter 2 Review Exercises 


_ 179-170 _ ; 
R2.1 (a) z=179-1707.5=1.20;,5 0 75 4.28) Paul’s height is 1.20 standard deviations above the 


average male height for his age. (b) 85% of boys Paul’s age are shorter than Paul. 


R2.2 (a) Reading up from 7 hours on the x-axis to the graphed line and then across to the y-axis, we see 
that 7 hours corresponds to about the 58th percentile. (b) To find Q, start at 25 on the y-axis, move across 
to the line and down to the x-axis. Q; is approximately 2.5 hours. To find Q3, start at 75 on the y-axis, 
move across to the line and down to the x-axis. Q3 is approximately 11 hours. Thus, IQR = 11 - 2.5=8.5 


hours per week. 


R2.3 (a) If we converted the guesses from feet to meters, the shape of the distribution would not change. 
43.7 _ 


—— = 13.32 
The new mean would be 43.73.28=13.32 3.28 3.3 meters, the median would be 423.28=12.80 


2. — 12.80 335 — 3-81 

3.28 “~~ meters, the standard deviation would be 12.53.28=3.81 3.28 meters, and the IQR 
12.5 _ « 

would be 12.53.28=3.81 3.28 Paes meters. (b) The mean error would be 43.7 — 42.6 = 1.1 feet. The 


standard deviation of the errors would be the same as the standard deviation of the guesses, 12.5 feet. (c) 
A total of 61 of the 66 students estimated the width of their classroom to be less than 63 feet, so the 
student who estimated the classroom width as 63 feet is at the 61/66 = 0.924 = 92.4th percentile. 


R2.4 (a) The percent of observations that have values less than 13 is 1 — 0.08 = 0.92 = 92%. (b) Answers 
will vary, but the line indicating the median (line A in the graph) should be slightly to the right of the 
main peak, with half of the area to the left and half to the right. (c) Answers will vary, but the line 
indicating the mean (line B in the graph) should be slightly to the right of the line for the median at the 
balance point. 


AB 


R2.5 (a) About 99.7% of the observations will fall within 3 standard deviations of the mean. About 97.7% 
of the babies had birth weights between 2135 and 5201 grams. (b) (i) z=2500-3668511=-2.29; 

2500-3668 
sc TT eI the proportion of z-scores below —2.29 is 0.0110. (ii) normalcdf(lower: —1000, 
upper: 2500, mean: 3668, SD: 511) = 0.0111. About 1% of babies will be identified as low birth weight. 
(c) (i) The first quartile is the boundary value with 25% of the area to its left = z = —0.67. The third 
quartile is the boundary value with 75% of the area to its left — z = 0.67. Solving -0.67=x-3668511 


0.67 — 23668 0.67 — 223068 
511 gives Q3 = 3325.63. Solving 0.67=x-3668511 511 gives Qs = 4010.37. (ii) 


invNorm(area: 0.25, mean: 3668, SD: 511) gives Q; = 3323.34 invNorm(area: 0.75, mean: 3668, SD: 
511) gives Q3 = 4012.66. The quartiles are Q; = 3323.34 grams and Q3 = 4012.66 grams. 


pero 500—694 _ ae _ 900—694 694 
R2.6 (a) (i) z=500-694112=-1.73, 2=900-694112=1.84;7 — — 112 1.73, z= —a— = 1.84; 


the proportion of z-scores below —1.73 is 0.0418. The proportion of z-scores above 1.84 is 1 — 0.9671 = 
0.0329. (ii) normalcdf(lower: —1000, upper: 500, mean: 694, SD: 112) = 0.0418 and normalcdf(lower: 
900, upper: 100000, mean: 694, SD: 112) = 0.0329. The percent of test-takers who earn a score less than 
500 or greater than 900 on the GRE Chemistry test is 4.18% + 3.29% = 7.47%. 


9 = x—694 
22-33 = Ta gives x = 954.96. (ii) 


(b) (i) 99% area to left of z — z = 2.33. Solving 2.33=x-69411 
invNorm(area: 0.99, mean: 694, SD: 112) = 954.55 


The 99th percentile score on the GRE Chemistry test is 954.55. 


R2.7 (a) (i) 2=1.2-1.050.08=1.88, 2=1-1.050.08=-0.063; 00s 1-88, 2 = = —0.063; 


the proportion of z-scores between —0.63 and 1.88 is 0.9699 — 0.2643 = 0.7056. (ii) normalcdf(lower: 1, 

upper: 1.2, mean: 1.05, SD: 0.08) = 0.7036. About 70% of the time the dispenser will put between 1 and 
1.2 ounces of ketchup on a burger. (b) Because the mean of 1.1 is in the middle of the interval from 1 to 
1.2, we are looking for the middle 99% of the distribution. This leaves 0.5% in each tail. 


eat 
(i) 0.005 area to the left of z + z = -2.58. Solving -2.58=1-1.10 2-28 = —3— gives o = 0.039. A 


standard deviation of at most 0.039 ounce will result in at least 99% of burgers getting between 1 and 1.2 


ounces of ketchup. 


R2.8 The distribution of percent of residents aged 65 and older in the 50 states and the District of 
Columbia is roughly symmetric and somewhat bell-shaped. The mean and standard deviation of these 
data are x =13.255%® = 13.255 % and Sy = 1.668%. 


e x +1sx=(11.587, 14.923); + 1s, = (11.587, 14.923); 49 of 51 observations, or 78.4% of the 
observations, are within 1 standard deviation of the mean. 


e x £2sx=(9.919, 16.591);2 = 282 = (9-919, 16.591); 4g of 51 observations, or 94.1% of the 
observations, are within 2 standard deviations of the mean. 


e x +3sx=(8.251, 18.259);7 = 38, = (8.251, 18.259); 59 of 51 observations, or 98.0% of the 


observations, are within 3 standard deviations of the mean. 
These percentages are close to what we would expect based on the 68—95-99.7 rule. Combined with the 
graph, this provides good evidence that this distribution is approximately Normal. 
R2.9 The curve in the Normal probability plot suggests that the data are slightly right-skewed. This can be 
seen in the steep, nearly vertical section in the lower left. These numbers were much closer to the mean 
than would be expected in a Normal distribution, meaning that the values that would be in the left tail are 
piled up close to the center of the distribution. 


Answers to Chapter 2 AP® Practice Test 
T2.1e 
220d 
T2.3b 
T2.4b 
T2.5a 
T2.6e 
ec 
T2.8d 
T2.9b 
T2.10c¢ 


T2.11 (a) A total of 26 of the 40 sale prices were less than the house indicated in red on the dotplot, so 
that home is at the 26/40 = 0.65 = 65th percentile. (b) z=234000—20338887609=0.35. 
= — 234000-203388 __ 9 gx 

87609 Interpretation: The sale price for this home is 0.35 standard deviation above 
the average sale price for the homes in the sample. 

6—7.11 

T2.12 (a) (i) 2=6-7.110.74=-1.5;" a ee the proportion of z-scores below —1.5 is 0.0668. 
(ii) normalcdf(lower: —1000, upper: 6, mean: 7.11, SD: 0.74) = 0.0668. About 6.68% of the students ran 
the mile in less than 6 minutes, which means that (0.0668)(12000) = 801.6, or about 802, students ran the 
mile in less than 6 minutes. (b) (i) 0.90 area to the left of z — z = 1.28. Solving 1.28=x-—7.110.74 


_ z—-7.11 
= 0.74 gives x = 8.06 minutes. (ii) invNorm(area: 0.90, mean: 7.11, SD: 0.74) = 8.06 minutes. It 


took about 8.06 minutes for the slowest 10% of students to run the mile. (c) If the mile run times were 


converted from minutes to seconds, the mean would be (0.74)(60) = 44.4 seconds and the standard 


deviation would be (0.74)(60) = 44.4 seconds. (i) z=400-426.644.4=-0.60, z=500-426.644.4=1.65; 


, — 100-4266 _ _ yey 5 — 500-4266 _ 4 gr. 
eo 444 Se AC 44.4 ~ +99) the difference of z-scores between —0.60 and 1.65 is 


0.9505 — 0.2743 = 0.6762. (ii) normalcdf(lower: 400, upper: 500, mean: 426.6, SD: 44.4) = 0.6763. 
About 67.6% of students who ran the mile had times between 400 and 500 seconds. 


T2.13 No, these data do not seem to follow a Normal distribution. First, there is a large difference 
between the mean and the median. In a Normal distribution, the mean and median are the same, but in this 
distribution the mean is 48.25 and the median is 37.80. Second, the distance between the minimum and 
the median is 37.80 — 2 = 35.80, but the distance between the median and the maximum is 204.90 — 37.80 


= 167.10. Ina Normal distribution, these distances should be the same. Because the mean is larger than 
the median and the distance from the median to the maximum is larger than the distance from the 
minimum to the median, the distribution of oil recovered appears to be skewed to the right. 


Chapter 3 

Section 3.1 

Answers to Check Your Understanding 

page 159: 1. The explanatory variable is the amount of sugar (in grams). The response variable is the 
number of calories. The amount of sugar helps to explain, or predict, the number of calories in movie- 
theater candy. 


2. A scatterplot is shown here. 


Number of calories 


0 50 100 150 
Amount of sugar (g) 
3. There is a moderately strong, positive, linear relationship between the amount of sugar contained in 
movie-theater candy and the number of calories in the candy. The point for peanut M&M’S® is a potential 
outlier with 79 grams of sugar and 790 calories. 
page 162: The correlation of r = —0.838 confirms that the linear association between dash time and long- 


jump distance is strong and negative. 


Answers to Odd-Numbered Section 3.1 Exercises 

3.1 (a) Water temperature is the explanatory variable and weight gain is the response variable. Water 
temperature may help predict or explain changes in the response variable, weight gain. Weight gain 
measures the outcome of the study. (b) Either variable could be the explanatory variable because each one 


could be used to predict or explain the other. 


3.3 A scatterplot is shown here. 


Backpack weight (Ib) 
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Body weight (Ib) 


3.5 Other than several athletes who weigh much more than other athletes of the same height, there is a 


moderately strong, positive, linear relationship between height and weight for these athletes. 


3.7 There is a moderately strong, positive, linear association between backpack weight and body weight 


for these students. There is one possible outlier in the graph—the hiker with body weight 187 pounds and 


pack weight 30 pounds. This hiker makes the form appear to be nonlinear for weights above 140 pounds. 


3.9 (a) A scatterplot with speed as the explanatory variable is shown. 
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(b) There is a strong, nonlinear relationship between speed and amount of fuel used. The relationship is 
negative for speeds up to 60 km/h and positive for speeds beyond 60 km/h. There are no clear outliers. 


3.11 For both groups of athletes, there is a moderately strong, positive, linear association between height 
and weight; however, athletes who participate in the shot put, discus throw, and hammer throw tend to 
weigh more than other track and field athletes of the same height. 


3.13 The relationship is positive, so r > 0. Also, r is closer to 1 than to 0 because the relationship is strong. 


3.15 The correlation of 0.92 indicates that the linear relationship between number of turnovers and 
number of points scored for players in the recent NBA season is strong and positive. 


3.17 Probably not. Although there is a strong, positive association, an increase in turnovers is not likely to 
cause an increase in points for NBA players. It is likely that both of these variables are changing due to 
several other variables, such as time played. 

3.19 (a) The correlation of 0.87 indicates that the linear relationship between amount of sodium and 
number of calories is strong and positive. (b) The hot dog with the lowest calorie content increases the 
correlation. It falls in the linear pattern of the rest of the data. 


3.21 (a) The scatterplot shows a strong, positive, linear relationship between the femur lengths and 
humerus lengths. It appears that all 5 specimens come from the same species. 
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(b) The femur measurements have mean of 58.2 cm and a standard deviation of 13.2 cm. The humerus 
measurements have a mean of 66 cm and a standard deviation of 15.89 cm. The table shows the 


standardized measurements (labeled zfemur and zhumerus) and the product (zfemur x zhumerus) of the 


standardized measurements. 


Femur Humerus z femur z humerus Product 
38 41 =1,53030 =1,57332 2.40765 
56 63 —1.16667 —0.18880 0.03147 
59 70 0.06061 0.25173 0.01526 
64 72 0.43939 0.37760 0.16591 
74 84 1.19697 1.13279 1.35591 


The sum of the products is 3.97620, so the correlation coefficient is r=14(3.97620)=0.9941. 


p — 1Y¢: ; = f/O6 
ey (3.97620) = 0.9941. The very high value of the correlation confirms the strong, positive linear 


association between femur length and humerus length in the scatterplot from part (a). 

3.23 (a) Correlation is unitless. (b) The correlation would stay the same. Correlation makes no distinction 
between explanatory and response variables. (c) If sodium was measured in grams instead of milligrams, 
the correlation would still be 0.87. Because r uses the standardized values of the observations, r does not 
change when we change the units of measurement of x, or y, or both. 

3.25 We would expect the height of women at age 4 and their height as women at age 18 to be the highest 
correlation because it is reasonable to expect taller children to become taller adults and shorter children to 
become shorter adults. The next highest would be the correlation between the heights of male parents and 
their adult children because they share genes. Tall fathers tend to have relatively tall sons, and short 
fathers tend to have relatively short sons. The lowest correlation would be between husbands and their 
wives. Some tall men may prefer to marry tall women, but this isn’t always the case. 


3.27 Answers will vary. Here is one possibility. 


Rob’s score 


50 ] T T T T 
50 60 70 80 90 100 


Mare’s score 


3.29 a. 
Sold 
3.33 b 


3.35 One possible histogram is shown here. 


Frequency 


0 8 16 24 32 
Weight (mg) 


The distribution of weight is skewed to the right with several possible high outliers. The median weight is 
5.4 mg and the JQR is 5.5 mg. 


Section 3.2 

Answers to Check Your Understanding 

page 181: 1. y§=100+40(16)=740 grams¥ = 100 + 40(16) = 740 grams 

2. The residual = actual y — predicted y = 700 — 740 = —40. Interpretation: This rat weighed 40 grams less 
than the weight predicted by the regression line with x = 16 weeks old. 

3. The time is measured in weeks for this equation, so 2 years converts to 104 weeks. We then predict the 
rat’s weight to be y\=100+40(104)=4260 srams,Y = 100 + 40(104) = 4260 grams, \ nich is equivalent 
to 9.4 pounds (about the weight of a large newborn human). This is unreasonable and is the result of 
extrapolation. 

page 182: 1. The slope is 40. Interpretation: The predicted weight goes up by 40 grams for each increase 
of 1 week in the rat’s age. 2. Yes, the y intercept has meaning in this context. The y intercept, 100, is the 
predicted weight (in grams) for a rat at birth (x = 0 weeks). 

page 188: 1. The equation of the least-squares regression line is y\=16.2649+0.0908x, 

y = 16.2649 + 0.09082, where x = body weight and yy = the predicted backpack weight. 


2. A residual plot is given. 
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Body weight (Ib) 
3. Because there appears to be a negative—positive—negative pattern in the residual plot, a linear model is 
not appropriate for these data. 
page 194: 1. y\=33.347+13.2854x,Y = 33.347 + 13.28542, where x = duration of the most recent 
eruption (min) and y/¥ = predicted interval of time until the next eruption (min). 


2. The predicted interval of time until the next eruption goes up by 13.2854 minutes for each increase of 1 


minute in the duration of the most recent eruption. 


3. The actual interval of time until the next eruption (min) is typically about 6.49 minutes away from the 
time predicted by the least-squares regression line with x = duration of the most recent eruption (min). 


4. r2 = 85.4% of the variability in interval is accounted for by the least-squares regression line with x = 


duration. 


Answers to Odd-Numbered Section 3.2 Exercises 


3.37 (a) yA=60.7+0.139(200)=88.59 = 80.7 + 0.139(200) = 88.5 (H) ya=60.7+0.139(400)=116.3(c) 


y = 60.7 + 0.139(400) = 116.3(c) (c) 1am more confident in the prediction in part (a) than the 


prediction in part (b). The payroll for the teams varied from about $75 million to about $275 million. $200 
million is in this interval of payrolls, but $400 million is not, which makes the prediction in part (b) an 
extrapolation. 

3.39 The predicted number of wins for the Chicago Cubs, who spent $182 million on payroll, is 
y\=60.7+0.139(182)=85.998 wins.Y = 60.7 + 0.139(182) = 85.998 wins. The residual = actual y — 
predicted y = 103 — 85.998 = 17.002. Interpretation: The Chicago Cubs won 17.002 more games than the 
number of games predicted by the regression line with x = $182 million. 

3.41 (a) The slope is 0.139. Interpretation: The predicted number of wins goes up by 0.139 for each 
increase of $1 million in payroll. (b) The y intercept does not have meaning in this context. It is not 
reasonable for a team to have a payroll of $0. 

3.43 (a) The predicted number of steps for Kiana, who is 67 inches tall, is 

y\=113.6-0.921(67)=51.893 steps.J = 113.6 — 0.921(67) = 51.893 steps. The residual = actual y — 
predicted y = 49 — 51.893 = —2.893. Interpretation: Kiana took about 2.893 fewer steps than the number 
of steps predicted by the regression line with x = 67 inches. (b) Since Matthew is 10 inches taller than 
Samantha, I expect Matthew to take (10)(—0.921) = 9.21 fewer steps than Samantha. 

3.45 (a) The regression lines are nearly parallel, but the y intercept is much greater for the throwers. (b) 
Discus thrower: y\=—115+5.13(72)=254.36 pounds;¥ = —115 + 5.13(72) = 254.36 pounds; 79 inch 


sprinter: y\=—297+6.41(72)=164.52 pounds.Y = —297 + 6.41(72) = 164.52 pounds. paced on the 
least-squares regression lines computed from the data, we expect a 72-inch discus thrower to weigh about 
89.84 pounds more than a 72-inch sprinter. 


3.47 No; there is an obvious negative—positive-negative pattern in the residual plot so a linear model is 
not appropriate for these data. A curved model would be better. 


3.49 The predicted mean weight of infants in Nahya who are 1 month old is yA=4.88+0.267(1)=5.147 kg. 


y = 4.88 + 0.267(1) = 5.147 kg. From the residual plot, the mean weight of 1-month-old infants is 


about 0.85 kg less than predicted. So the actual mean weight of the infants when they were 1 month old is 
about 5.147 — 0.85 = 4.297 kg. 

3.51 (a) See part (b). (b) y\=300.04+2.829x,Y = 300.04 + 2.8292, where yy = the predicted number of 
calories and x = the amount of sugar (in grams). 
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(c) The line calculated in part (b) is called the “least-squares” regression line because this is the line that 
makes the sum of the squares of the residuals as small as possible. 


3.53 A residual plot is shown. 
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The linear model relating the amount of sugar to the number of calories is appropriate because there is no 
leftover pattern in the residual plot. The residuals look randomly scattered around the residual = 0 line. 
3.55 (a) The actual number of strides required to walk the length of a school hallway is typically about 
3.50 away from the number predicted by the least-squares regression line with x = height of a student (in 
inches). (b) About 39.9% of the variability in number of steps required to walk the length of a school 
hallway is accounted for by the least-squares regression line with x = height of a student (in inches). 

3.57 (a) The predicted free skate score is y\=—16.2+2.07(78.5)=146.295. 

y = —16.2 + 2.07(78.5) = 146.295. The residual is y-y\=150.06-146.295=3.765. 

y— y = 150.06 — 146.295 = 3.765. Interpretation: Yu-Na Kim’s free skate score was 3.765 points 
higher than predicted based on her short program score. (b) The slope is 2.07. Interpretation: The 
predicted free skate score increases by 2.07 points for each additional 1-point increase in the short 
program score. (c) The actual free skate score is typically about 10.2 points away from the score predicted 
by the least-squares regression line with x = short program score. (d) About 73.6% of the variability in 
free skate score is accounted for by the least-squares regression line with x = short program score. 

3.59 (a) Yes; there is no leftover pattern in the residual plot, so a linear model is appropriate for these 
data. (b) r* = 60.21% and the slope is positive, so the correlation is r=0.6021=0.776. 

r= ¥0.6021 = 0.776. (@) y\=1.0021+0.0708x,Y = 1.0021 + 0.07082, where x is the number of 
Mentos and yy is the predicted amount expelled. (d) The value of s = 0.067 ml. Interpretation: The 
actual amount expelled is typically about 0.067 ml away from the amount predicted by the least-squares 
regression line with x = number of Mentos. The value of r* is 60.21%. Interpretation: About 60.21% of 
the variability in amount expelled is accounted for by the least-squares regression line with x = number of 


Mentos. 


3.61 (a) yA=11.898-0.041(42)=10.176 mph.¥ = 11-898 — 0.041(42) = 10.176 mph. The residual = 
actual y — predicted y = 2.2 — 10.176 = —7.976 mph. Interpretation: The actual average wind speed was 
7.976 mph less than the average wind speed predicted by the regression line with x = 42°F. (b) The slope 
is -0.041. Interpretation: The predicted average wind speed decreases by 0.041 mph for each additional 
1-degree increase in average temperature (in degrees Fahrenheit). (c) The actual average wind speeds 
typically vary s = 3.66 mph from the values predicted by the least-squares regression line using x = 
average temperature. (d) r* = 4.8% of the variability in average wind speed is accounted for by the least- 


squares regression line using x = average temperature. 


. b = 0.5 (32) = 0.54. . . 
3.63 (a) The slope is b1=0.5(2.72.5)=0.54. = The y intercept is bp = 68.5 — 
0.54(64.5) = 33.67. So the equation for predicting y = husband’s height from x = wife’s height is 
y\=33.67+0.5x,Y = 33.67 + 0.52, (b) If the value of x is one standard deviation below x @, the 
predicted value of y will be r standard deviations of y below y. So, the predicted value for the husband is 
68.5 — 0.5(2.7) = 67.15 inches. 


3.65 (a) yA=x,Y = ©, where yw = predicted grade on final and x = grade on midterm. (b) A student with 


a score of 50 on the midterm is predicted to score y\=46.6+0.41(50)=67.19 = 46.6 + 0.41(50) = 67.1 


on the final. A student with a score of 100 on the midterm is predicted to score y\=46.6+0.41(100)=87.6 


y = 46.6 + 0.41(100) = 87.6 on the final. (c) These predictions illustrate regression to the mean 


because the student who did poorly on the midterm (50) is predicted to do better on the final (closer to the 
mean), whereas the student who did very well on the midterm (100) is predicted to do worse on the final 
(closer to the mean). 

3.67 (a) Because Jacob has an above-average height and an above-average vertical jump, his point 
increases the positive slope of the least-squares regression line and decreases the y intercept. (b) Jacob’s 
vertical jump is farther from the least-squares regression line than the other students’ vertical jumps. 
Because Jacob’s point has such a large residual, it increases the standard deviation of the residuals. Also, 
because Jacob’s vertical jump is farther from the least-squares regression line than the other students’ 
vertical jumps, the value of r* decreases. The linear association is weaker because of the presence of this 
point. 

3.69 (a) A scatterplot of this relationship is shown here. There is a moderate, positive, linear association 
between HbA and FBG. There are possible outliers to the far right (subject 18) and near the top of the plot 
(subject 15). 
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(b) Because the point for subject 18 is in the positive, linear pattern formed by most of the data values, it 


will make the correlation closer to 1. Also, because the point is likely to be below the least-squares 
regression line, it will “pull down” the line on the right side, making the slope closer to 0. Without the 
outlier, the correlation decreases from r = 0.4819 to r = 0.3837, as expected. Likewise, without the outlier, 
the equation of the line changes from y“=66.4+10.4x to yA=52.3+12.1x. 

y = 66.4 + 10.42 toy = 52.3 + 12.12. (c) The point for subject 15 makes the correlation closer to 0 
because it decreases the strength of what would otherwise be a moderately strong positive association. 
Because this point’s x coordinate is very close to x “, it won’t influence the slope very much. However, 
it will make the y intercept increase because its y coordinate is so large compared to the rest of the values. 
Without the outlier, the correlation increases from r = 0.4819 to r = 0.5684, the slope changes from 10.4 
to 8.92, and the y intercept increases from 66.4 to 69.5. 


3.71a 
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__ 25-18.7 ; 
3.79 (a) (i) 2=25-18.74.3=1.47;, 43 * 0.9292. (ii) normalcdf(lower: —-1000, upper: 25, 
mean: 18.7, SD: 4.3) = 0.9286. About 93% percent of vehicles get worse combined mileage than the 
Chevrolet Malibu. (b) Table A: Look in the body of ‘Table A for the value closest to 0.90. A z-score of 


_ z-18.7 
1.28 gives the closest value (0.8997). Solving 1.28=x-18.74.3""78 43° gives x = 24.2. Tech: 


invNorm(area: 0.9, mean: 18.7, SD: 4.3) = 24.2. The top 10% of all vehicles get at least 24.2 mpg. 


= 1.47 


Answers to Chapter 3 Review Exercises 


R3.1 (a) There is a moderate, positive, linear association between gestation and life span. Without the 
outliers at the top and in the upper right, the association appears moderately strong, positive, and curved. 
(b) The hippopotamus makes the correlation closer to 0 because it decreases the strength of what would 
otherwise be a moderately strong positive association. Because this point’s x coordinate is very close to 
x £, it won’t influence the slope very much. However, it makes the y intercept higher because its y 
coordinate is so large compared to the rest of the values. Because it has such a large residual, it increases 
the standard deviation of the residuals. (c) Because the Asian elephant is in the positive, linear pattern 
formed by most of the data values, it will make the correlation closer to 1. Also, because the point is 
likely to be above the least-squares regression line, it will “pull up” the line on the right side, making the 
slope larger and the y intercept smaller. Because this point is likely to have a small residual, it decreases 
the standard deviation of the residuals. 

R3.2 (a) A positive relationship means that dives with larger values of depth also tend to have larger 
values of duration. (b) A linear relationship means that when depth increases by 1 meter, dive duration 
tends to change by a constant amount, on average. (c) A strong relationship means that the (dive depth, 
dive duration) data points fall close to a line. (d) If the variables are reversed, the correlation will remain 
the same. However, the slope and y intercept will be different. 


R3.3 (a) A linear model is appropriate for these data because there is no leftover pattern in the residual 
plot. (b) Because r* = 0.837 and the slope is positive, the correlation r=+0.837=0.915. 


r= +70.837 = 0.915. Interpretation: The correlation of r = 0.915 confirms that the linear association 


between the age of cars and their mileage is strong and positive. (c) y\=3704+12,188x, 

y = 3704 + 12, 1882, where x represents the age and yA represents the predicted mileage of the cars. 
(d) For a 6-year-old car, the predicted mileage is y\=3704+12,188(6)=76,832. 

y = 3704 + 12, 188(6) = 76, 832. The residual for this particular car is y-y\=65,000-76,832=—11,832. 
y— y = 65,000 — 76,832 = —11,832. Interpretation: The actual number of miles this teacher has 
driven was 11,832 less than the number of miles predicted by the regression line with x = 6 years. (e) The 
value of s = 20,870.5 miles. Interpretation: The actual number of miles is typically about 20,870.5 miles 
away from the number of miles predicted by the least-squares regression line with x = age (in years). The 
value of r* = 83.7%. Interpretation: About 83.7% of the variability in mileage is accounted for by the 
least-squares regression line with x = age (in years). 

R3.4 (a) Average March temperature is the explanatory variable because changes in March temperature 
probably have an effect on the date of first blossom. Also, we are predicting the date of first blossom from 


temperature. 


Days in April to first blossom 


Temperature in Celsius 


(b) The correlation is r = —0.85; y\=33.12-4.69x,Y = 33.12 — 4.692, where x represents the average 
March temperature and yA represents the predicted number of days. The slope is —4.69. Interpretation: 
The predicted number of days in April to the first blossom decreases by 4.69 days for each additional 1- 
degree increase in average March temperature (in degrees Celsius). The y intercept tells us that if the 
average March temperature was 0 degrees Celsius, the predicted number of days in April to first blossom 
is 33.12 (May 3). However, x = 0 is outside of the range of data, so this prediction is an extrapolation and 
may not be trustworthy. (c) No, x = 8.2 is well beyond the values of x we have in the data set (1.5 to 6.2). 
This prediction would be an extrapolation. (d) The predicted number of days until first blossom when the 
average March temperature was 4.5°C is y\=33.12-4.69(4.5)=12.015.4 = 33.12 — 4.69(4.5) = 12.015. 
The residual is y-y4=10-12.015=—2.015.Y y = 10 —12.015 = —2.015. Interpretation: The actual 
number of days until first blossom was 2.015 days less than the number of days predicted by the 
regression line with x = 4.5°C. 
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There is no leftover pattern in the residuals, indicating that a linear model is appropriate. 
R3.5 (a) b1=0.6(830)=0.16; bO=75-0.16(280)=30.2; yA=30.2+0.16x, 


b, = 0.6 (3) ~ 0.16; by = 75 — 0.16(280) = 30.2; § — 30.2 + 0.162, : 
, where y/¥ = the predicted 


final exam score and x = total score before the final examination. (b) y4=30.2+0.16(300)=78.2 
y = 30.2 + 0.16(300) = 78.2 (c) The least-squares regression line is the line that minimizes the sum of 
squared distances between the actual exam scores and predicted exam scores. (d) Because r = 0.36, only 
36% of the variability in the final exam scores is accounted for by the linear model relating final exam 
scores to total score before the final exam. More than half (64%) of the variation in final exam scores is 
not accounted for by the least squares regression line, so Julie has a good reason to think this is not a good 
estimate. 


R3.6 Even though there is a high correlation between number of calculators and math achievement, we 
shouldn’t conclude that increasing the number of calculators will cause an increase in math achievement. 
It is possible that students who are more serious about school have better math achievement and also have 
more calculators. 


Answers to Chapter 3 AP® Practice Test 
T3.1e 
T.2d 
T3.3e 
T3.4a 
I3.a¢ 
T3.6 b 
T3.7e 
T3.8 b 
T3.9 e 
T3.10d 


T3.11 (a) There is a strong, positive linear association between Sarah’s age and her height. 


Height (cm) 
8 


Age (months) 


(b) The regression line for predicting y = height from x = age is yA=71.95+0.3833x. 

y = 71.95 + 0.38332. (c) At age 48 months, we predict Sarah’s height to be 
y\=71.95+0.3833(48)=90.348 cm. = 71-95 + 0.3833(48) = 90.348 cm. the residual for Sarah is 
y-y=90-90.348=—0.348.Y y = 90 — 90.348 = —0.348. Interpretation: Sarah’s actual height was 
0.348 cm less than the height predicted by the regression line with x = 48 months. (d) No; obviously, the 
linear trend will not continue until she is 40 years old. Our data were based only on the first 5 years of life 
and predictions should only be made for ages 0-5. 


T3.12 (a) The unusual point is the one in the upper righthand corner with isotope value about —19.3 and 
silicon value about 345. This point is unusual in that it has such a high silicon value for the given isotope 
value. (b) (i) If the point were removed, the correlation would get closer to —1, because it does not follow 
the linear pattern of the other points. (ii) Because this point is “pulling up” the line on the right side of the 
plot, removing it will make the slope steeper (more negative) and make the y intercept smaller. Note that 
the y-axis is to the right of the points in the scatterplot. (iii) Because this point has a large residual, 
removing it will make the size of the typical residual (s) a little smaller. 

T3.13 (a) Yes; because there is no obvious leftover pattern in the residual plot, a linear model is 
appropriate for describing the relationship between wildebeest abundance and percent of grass area 
burned. (b) y\=92.29-0.05762x,Y = 92.29 — 0.057622, where x represents the number of wildebeest 
and yy represents the predicted percent of the grass burned. (c) The slope = —0.05762. Interpretation: 
The predicted percent of grassy area burned decreases by about 0.058 percent for each additional 1000 
wildebeest. The y intercept does not have meaning in this context, as making a prediction for 0 wildebeest 
is a big extrapolation. It is impossible to know what would happen going from some wildebeest to no 
wildebeest. (d) The value of s = 15.988%. Interpretation: The actual percentage of burned area is 
typically about 15.988% away from the percent predicted by the least-squares regression line with x = 
number of wildebeest (1000s). The value of r? = 64.6%. Interpretation: About 64.6% of the variability in 
percentage of burned area is accounted for by the least-squares regression line with x = number of 
wildebeest (1000s). 


Chapter 4 
Section 4.1 
Answers to Check Your Understanding 


page 225: 1. Convenience sample; this could lead him to overestimate the quality if the farmer puts the 
best oranges on top or if the oranges at the bottom of the crate are damaged from the weight on top of 
them. 


2. Voluntary response sample; those who are happy that the United Nations has its headquarters in the 


U.S. already have what they want and so are less likely to worry about responding to the question. This 
means that the proportion who answered “No” in the sample is likely to be higher than the true proportion 
in the U.S. who would answer “No.” 


page 228: 1. Number the pieces of wood from 1 to 1000. Use the command randInt(1,1000) to select 5 
different integers from 1 to 1000. Inspect the corresponding 5 pieces of wood. 

2. Number the pieces of wood from 000 to 999. Move along a line of random digits from left to right, 
reading three-digit numbers, until 5 different numbers between 000 and 999 have been selected. Inspect 
the corresponding 5 pieces of wood. 


page 232: 1. Because the quality of the pencils might be the same within each shift, but differ across 
shifts, use shifts as strata. At the end of each 8-hour shift, label all the pencils produced during that shift 
from 1 to N, where N is the total number of pencils produced on that shift. Generate 100 different random 
integers from 1 to N and select those pencils for inspection. 

2. The boxes of pencils could be used as a cluster because it would be relatively easy to select boxes. At 
the end of the day, label all the boxes of pencils 1 to N, where N is the number of boxes produced that 
day. Generate 30 different random integers from 1 to N and inspect all the pencils in the selected boxes. 


3. Stratified: We are guaranteed to inspect 100 pencils from each of the three shifts. This will lead to a 
more precise estimate of overall quality if quality is consistent within each shift but differs across the 
three shifts. Cluster: Simplifies the sampling process. Rather than having to label every pencil produced, 
only the boxes of 10 pencils would need to be labeled. 


page 235: 1. (a) Undercoverage (b) Nonresponse (c) Convenience 

2. The estimate of 84% is greater than the percent of all people in the population who would oppose 
banning disposable diapers. By making it sound as if they are not a problem in the landfill, this question 
will result in fewer people suggesting that we should ban disposable diapers. 


Answers to Odd-Numbered Section 4.1 Exercises 


4.1 Population: The 1000 envelopes stuffed during a given hour. Sample: The 40 randomly selected 
envelopes. 


4.3 Population: All local businesses. Sample: The 73 businesses that return the questionnaire. 


4.5 (a) A convenience sample. (b) The estimate of 7.2 hours is probably less than the true average because 
students who arrive first to school had to wake up earlier and may have gotten less sleep than those 
students who are able to sleep in. 


4.7 Voluntary response sample; it is likely that those customers who volunteered to leave reviews feel 
strongly about the hotel, often due to a negative experience. As a result, the 26% from the sample is likely 
greater than the true percentage of all the hotel’s customers who would give the hotel 1 star. 

4.9 Voluntary response sample; it is likely that the true proportion of constituents who oppose the bill is 
less than 871/1128. 


4.11 (a) Number the 40 students from 01 to 40 alphabetically. Moving left to right along a line from the 
random digit table, record two-digit numbers, skipping any numbers that are not between 01 and 40 and 
any repeated numbers, until you have 5 different numbers between 01 and 40. Select the corresponding 5 
students. (b) Johnson (20), Drasin (11), Washburn (38), Rider (31), and Calloway (07). 


4.13 (a) Number the plots from 1 to 1410. Use the command randInt(1,1410) to select 141 different 
integers from 1 to 1410. Select the corresponding 141 plots. (b) Answers will vary. 


4.15 (a) It is not practical to identify and number every tree in the park. (b) Trees along the main road are 
more likely to be damaged by cars and people and may be more susceptible to infestation, leading to an 
overestimate. (c) The percentage is unlikely to be exactly 35% because of sampling variability. 

4.17 Stratified random sampling might be preferred in this context because the employees’ opinions might 
be the same within each type of employee (servers, kitchen staff), but differ across employee types. Using 
employee type as the strata will help provide a more precise estimate of the overall proportion who 
approve of the no-tipping policy. Select an SRS of 15 employees who are servers and 15 employees who 
work in the kitchen to form the overall sample. 


4.19 No; in an SRS, each possible sample of 250 engineers is equally likely to be selected. However, the 
method described restricts the sample to having exactly 200 males and 50 females. 

4.21 (a) Because satisfaction with the property is likely to vary depending on the location of the room, we 
should stratify by floor and view. From each floor, randomly select 2 rooms with each view. Using a 
stratified random sample would assure that the manager got opinions from each type of room and would 
provide a more precise estimate of customer satisfaction. (b) Using floors as clusters, survey the 
registered guest in every room on each of 3 randomly selected floors. This would be a simpler option 
because the manager would need to survey guests on only three floors instead of all over the hotel. 


4.23 (a) Cluster sampling (b) In an SRS, the company would have to visit individual homes all over the 
rural subdivision. With the cluster sampling method, the company has to visit only 5 locations, saving 


time and money. 


4.25 Students who do not live in the dorms cannot be part of the sample. Some would live off campus and 
therefore be less likely to eat on campus than those students who live in the dorm, so the director’s 
estimate for the percent of students who eat regularly on campus will likely be too high. 

4.27 People who did not lose much weight (or who gained weight) after participating in the course may be 
less likely to respond to the survey. This will likely produce an estimated weight loss that is too large, as 
the people who responded to the survey probably lost more weight than those who did not respond. 


4.29 We would not expect many people to claim they have run red lights when they have not, but some 
people will deny running red lights when they have. Thus, the proportion of drivers obtained in the 
sample who admitted to running a red light is likely to be less than the proportion who have actually run a 
red light. 

4.31 When asked in person, the boys may claim that they have never cried during a movie (when in 
reality, they have) because they are embarrassed or ashamed to admit it to the girls asking the survey 
question. Boys who were given an anonymous survey are more likely to be honest about their 


experiences. 


4.33 (a) The wording is clear, but the question is slanted in favor of warning labels because the first 
sentence states that some cell-phone users have developed brain cancer. (b) The question is clear, but it is 
slanted in favor of national health insurance by asserting it would reduce administrative costs and not 
providing any counter-arguments. The phrase “do you agree” also pushes respondents toward the desired 
response. (c) Not clear; for those who do understand the question, it is slanted because it suggests reasons 
why one should support recycling. It could be rewritten as: “Do you support economic incentives to 


promote recycling?” 
4.35 c 
4.37d 
4.39 d 


4.41 (a) The predicted number of points scored decreases by 4.084 points for each additional turnover. (b) 
The actual number of points scored is typically about 57.3 points away from the number of points 
predicted by the least-squares regression line with x = number of turnovers. (c) 
y\=460.2-4.084(17)=390.772,9 = 460.2 — 4.084(17) = 390.772, «othe residual is 
y—-y=238-390.772=-152.772.Y y = 238 — 390.772 = —152.772. The number of points scored by 
the San Francisco 49ers was 152.772 points less than predicted based on their number of turnovers. (d) 
Because their point falls below the least-squares regression line and is to the left of the mean number of 
turnovers, their point decreases the y-intercept and increases the slope, making the slope closer to 0 (i.e., 
less negative). Because the 49ers’ point is farther from the line than the rest of the points in the data set, 
this point increases the standard deviation of the residuals. 


Section 4.2 

Answers to Check Your Understanding 

page 245: 1. Experiment because a treatment (brightness of screen) was imposed on the laptops. 

2. Observational; students were not assigned to eat a particular number of meals with their family per 
week. 

3. Explanatory: The number of meals per week teens ate with their families. Response: GPA (or some 
other measure of their grades). 


4. Students who have part-time jobs may not be able to eat many meals with their families and may not 
have much time to study, leading to lower grades. So we can’t conclude that not eating with their family 
is the cause. 

page 250: 1. The patients and the researchers know who is receiving which treatment. This knowledge 
could motivate some people to take other measures (e.g., exercising more or eating better in general) that 
would also influence their heart health. 


2. Use a double-blind experiment; the patients would not know which treatment they received, nor would 
the researchers know what treatment each patient received. 


page 255: 1. A control group would show how much electricity customers tend to use naturally. This 
would serve as a baseline to determine how much less electricity is used in each of the treatment groups. 
2. Number the houses from 1 to 60. Write the numbers 1 to 60 on slips of paper. Shuffle well. Draw out 
20 slips of paper (without replacement). Those households will receive a display. Draw out another 20 
slips of paper (without replacement). Those households will receive a chart. The remaining 20 households 


will receive only information about energy consumption. 


3. To create groups of households that are roughly equivalent at the beginning of the experiment; this will 
ensure that the effects of other variables (e.g., the thrifty inclination of some households) are spread 
evenly among the three groups. 


page 261: 1. Number the volunteers 1 to 300. Use a random number generator to produce 100 different 


random integers from 1 to 300 and show the first advertisement to the volunteers with those numbers. 
Generate 100 additional different random integers from 1 to 300 and show the second advertisement to 
the volunteers with those numbers. The remaining 100 volunteers will view the third advertisement. 
Compare the effectiveness of the advertisements for the three groups. 


2. Because the effectiveness of the ads may depend on a volunteer’s familiarity with Jane Austen, block 
by whether or not the individuals are familiar with the works of Jane Austen. The volunteers in each block 
would be numbered and randomly assigned to one of three treatment groups: One-third of the volunteers 
in each block would view the first advertisement, one-third would view the second advertisement, and 
one-third would view the third advertisement. After viewing the advertisements, the researchers would 
gauge the effectiveness of the advertisements. 

3. A randomized block design accounts for the variability in effectiveness that is due to subjects’ 
familiarity with the works of Jane Austen. This makes it easier to determine the effectiveness of the three 
different advertisements. 


Answers to Odd-Numbered Section 4.2 Exercises 

4.43 Although eating seafood may decrease the risk of colon cancer, it is possible that the physicians who 
ate seafood were also more likely to exercise. Because exercise might decrease colon cancer risk, perhaps 
the exercise caused the decrease in colon cancer risk, not eating seafood. 

4.45 Explanatory: Type of program the people watched. Response: Number of calories consumed. This 
was an experiment because the treatments (20 minutes of The Island, 20 minutes of The Island without 
sound, and 20 minutes of Charlie Rose) were deliberately imposed on the students. 


4.47 (a) Explanatory: Amount of time in child care from birth to age 41%. Response: Adult ratings of their 
behavior. (b) This was an observational study because children were not assigned to attend child care for 
a prescribed amount of time. (c) No, this is an observational study, so we cannot make a cause-and-effect 
conclusion. For example, children who spend more time in child care probably have less time with their 
parents and get less instruction about proper behavior. 

4.49 Experimental units: Pine seedlings; Treatments: Full light, 25% light, and 5% light. 

4.51 (a) The factors are (1) information provided by interviewer, which had 3 levels, and (2) whether the 
caller offered survey results, which had 2 levels. (b) 6 treatments (c) Answers may vary. Here are two 
treatments: (1) giving name/no survey results; (2) identifying university/no survey results. 


4.53 Explanatory: Type of snack that was eaten (berries or candy). Response: Amount of pasta consumed 
(measured by the number of calories consumed). Experimental units: Women; Treatments: 65 calories of 
berries or 65 calories of candy. 

4.55 The control group can be used to show how changes in pain and joint stiffness progress over the 
three years naturally. If a control group was not used, the treatment could possibly be deemed ineffective 
at improving the symptoms, when in reality it may be effective at preventing the symptoms from 


worsening. 


4.57 There was no control group. We do not know if this was a placebo effect or if the flavonols actually 
affected the blood flow. To make a cause-and-effect conclusion possible, we need to randomly assign 
some subjects to get flavonols and others to get a placebo. 


4.59 Yes; if the treatment (ASU or placebo) assigned to a subject was unknown to both the subject and 


those responsible for assessing the effectiveness of that treatment. If subjects knew they were receiving 
the placebo, their expectations would differ from those who received the ASU. Then it would be 
impossible to know if a decrease in pain was due to the difference in expectations or the ASU. It is 
important for the experimenters to be blind so that they will be unbiased in the way that they interact and 
assess the subjects. 


4.61 Because the experimenter knew which subjects had learned the meditation techniques, he or she is 
not blind. If the experimenter believed that meditation was beneficial, he or she may subconsciously rate 
subjects in the meditation group as being less anxious. 

4.63 (a) Write each name on a slip of paper, put them in a container and mix thoroughly. Pull out 40 slips 
of paper and assign these subjects to Treatment 1. Then pull out 40 more slips of paper and assign these 
subject to Treatment 2. The remaining 40 subjects are assigned to Treatment 3. (b) Assign the students 
numbers from 1 to 120. Using the command RandInt(1,120), generate 40 unique integers from 1 to 120, 
and assign the corresponding students to Treatment 1. Then generate an additional 40 unique integers 
from 1 to 120 and assign the corresponding students to Treatment 2. The remaining 40 students are 
assigned to Treatment 3. (c) Assign the students numbers from 001 to 120. Pick a spot on Table D and 
read off the first 40 unique numbers between 001 and 120. The students corresponding to these numbers 
are assigned to Treatment 1. The students corresponding to the next 40 unique numbers between 001 and 
120 are assigned to Treatment 2. The remaining 40 students are assigned to Treatment 3. 


4.65 The coach’s plan does not include random assignment. Perhaps the more motivated players will 
choose the new method. If they improve more by the end of the study, the coach cannot be sure if it was 
the exercise program or player motivation that caused the improvement. 

4.67 (a) Researchers used a design that compared infants who were assigned to one of three treatments. 
(b) Random assignment helps to create three groups of infants who are roughly equivalent at the 
beginning of the study. This ensures that the effects of other variables (e.g., genetics) are spread evenly 
among the three groups of infants. (c) Birthweight and whether or not the baby was born prematurely; it is 
beneficial to control these variables, because otherwise they would provide additional sources of 
variability that would make it harder to determine the effectiveness of the treatments. (d) Having about 17 
infants in each group makes it easier to rule out the chance variation in random assignment as a possible 
explanation for the differences observed in intelligence, language, and memory by age 4. 


4.69 (a) Expense and condition of the patient; if a patient is in very poor health, a doctor might choose not 
to recommend surgery because of the added complications. Then, if the non-surgery treatment has a 
higher death rate, we will not know if it is because of the treatment or because the initial health of the 
subjects was worse. (b) Write the names of all 300 patients on identical slips of paper, put them in a hat, 
and mix them well. Draw out 150 slips and assign the corresponding subjects to receive surgery. The 
remaining 150 subjects receive the new method. At the end of the study, count how many patients 
survived in each group. 


4.71 (a) Because smaller trees are likely to have fewer oranges than medium or large trees, form blocks 
based on the size of the trees (small, medium, large). Then randomly assign one-third of the small trees to 
fertilizer A, one-third to fertilizer B, and one-third to fertilizer C. Do the same for the other two blocks. In 
the end, measure each tree’s orange production. (b) In a completely randomized design, the differences in 
tree size will increase the amount of variability in number of oranges for each treatment. However, a 


randomized block design would help us account for the variability in number of oranges that is due to the 
differences in tree size. This will make it easier to determine if one fertilizer is better than the others. 


4.73 (a) The blocks are the different diagnoses (e.g., asthma) because the treatments (doctor or nurse- 
practitioner) were assigned to patients within each diagnosis. (b) Using a randomized block design allows 
us to account for the variability due to differences in diagnosis by comparing the results within each 
block. In a completely randomized design, the variability due to differences in diagnoses will be 
unaccounted for and will make it harder to determine if there is a difference in health and satisfaction due 
to the difference between doctors and nurse-practitioners. (c) Advantage: There would be no variability in 
the response variables introduced by differences in diagnosis. Disadvantage: We would only be able to 
make conclusions about health and satisfaction with their medical care for diabetic patients like the ones 
in the experiment. 

4.75 (a) If all rats from litter 1 were fed Diet A and if these rats gained more weight, we would not know 
if this was because of the diet or because of genetics and initial health. (b) For each litter, randomly assign 
half of the rats to receive Diet A and the other half to receive Diet B. Number the rats in the first litter 
from 1 to 10. Use the command randInt(1,10) to select 5 different integers from 1 to 10. Select the 
corresponding 5 rats and assign them to Diet A. The other 5 rats in this litter will be assigned to Diet B. 
Repeat the same process for the second litter of rats. (c) Answers will vary. 


4.77 (a) Write the names of all 30 students on identical slips of paper, put them in a hat, and mix them 
well. Draw out 15 slips and assign the corresponding students to take the online SAT preparation 
program. The remaining 15 students will take the in-person SAT preparation class. At the end of the 
study, compare the average improvement in SAT score for each group. (b) Create pairs of similar students 
based on their previous SAT score (the two students with the highest SAT scores are paired together, and 
so on). Within each pair, one student is randomly assigned to the online preparation class and the other is 
assigned to the in-person preparation class. For each student, record the SAT improvement after receiving 
the treatment. (c) The matched pairs design is preferred. By matching students by initial score, we can 
account for the variability in improvement that is introduced by the variability in student ability, making it 
easier to determine which program is more effective. 

4.79 (a) This is a matched pairs design because each subject was assigned both treatments. (b) Some 
students are more distractible than others. In a completely randomized design, the differences between the 
students will add variability to the response variable, making it harder to detect if there is a difference 
caused by the treatments. In a matched pairs design, each student is compared with himself (or herself), so 
the differences between students are accounted for. (c) If all the students used the hands-free phone during 
the first session and performed worse, we would not know if the better performance during the second 
session was due to the lack of phone or to learning from their mistakes. By randomizing the order, some 
students will use the hands-free phone during the first session and others will use it during the second 
session. (d) The simulator, route, driving conditions, and traffic flow were all kept the same for both 
sessions. That way, the researchers are preventing these variables from adding variability to the response 
variable. 


4.81 (a) If the students find a difference between the two groups, they will not know if it is due to gender 
or due to the deodorant. (b) Use a matched pairs design. In this case, each student would have one armpit 
randomly assigned to receive deodorant A and the other to receive deodorant B. Because each person uses 
both deodorants, there is no longer any confounding between gender and deodorant. Also, the paired 


design accounts for the variability between individuals, making it easier to see any difference in the 
effectiveness of the two deodorants. 


4.83 c 

4.85 b 

4.87 c 

4.89 b 

4.91 (a) (i) z = —0.23; 1 — 0.4090 = 0.5910 (ii) normalcdf(lower: 500, upper: 10000, mean: 525, SD: 110) 
= 0.5899 (b) (i) Solving ~1,28=x-525110 +28 its os gives x = 384.2. (ii) invNorm(area: 0.10, mean: 


525, SD: 110) = 384.0 


Section 4.3 

Answers to Check Your Understanding 

page 277: Because the individuals were not randomly selected and this is an observational study and not 
an experiment, we can only conclude that, for the athletes in the study, those who were removed from 
play immediately recovered more quickly, on average, than athletes who continued to play. 


Answers to Odd-Numbered Section 4.3 Exercises 

4.93 (a) Because different random samples will include different students and produce different estimates, 
it is unlikely that the sample result will be the same as the proportion of all students at the school who use 
Twitter. (b) An SRS of 100 students; estimates tend to be closer to the truth when the sample size is 
larger. 


4.95 (a) Yes; it is plausible that the true proportion could be as small as 0.37 — 0.031 = 0.339 or as large 
as 0.37 + 0.031 = 0.401 and 0.50 is not in this interval. (b) Increase the number of adults in the sample. 
4.97 (a) If we repeatedly take random samples of size 124 from a population of couples that have no 
preference for which way they kiss, the number of couples who kiss the “right way” in a sample varies 
from about 45 to 76. (b) Yes; in the study, 83 couples kissed the “right way”—much higher than what we 
would expect to happen by chance alone. In the simulations, the largest number of couples kissing the 
“right way” was 76. 


4.99 (a) 4.68 — 4.21 = 0.47 days (b) Assuming the type of clipboard does not matter, there was one 
simulated random assignment where the difference (A — B) in the mean number of days for the two 
groups was 0.72. (c) Because a difference of means of 0.47 or higher occurred 16 out of 100 times in the 
simulation, the difference is not statistically significant. It is quite plausible to get a difference this big 


simply due to chance variation in the random assignment. 


4.101 (a) To make sure that the two groups were as similar as possible before the treatments were 
administered. (b) The difference in the percent of women who received acupuncture and became pregnant 
and those who lay still and became pregnant was large enough to conclude that the difference was not 
likely due to the chance variation created by the random assignment to treatments. (c) Because the women 
were aware of which treatment they received, we do not know if their expectations or the treatment was 
the cause of the increase in pregnancy rates. 


4.103 Because this study involved random assignment to the treatments (foster care or institutional care), 


we can infer that the difference in being in foster care or institutional care caused the difference in 
response. However, the results should only be applied to children like the ones in the study, because these 
136 children were not randomly selected from a larger population. 


4.105 Because the subjects were not randomly assigned to attend religious services (or not), we cannot 
infer cause and effect. However, this study involved a random sample of adults so we can make an 
inference about the population of adults. It appears that adults who attend religious services regularly have 
a lower risk of dying, but we do not know that attending religious services is the cause of the lower risk. 
4.107 Because this study does not involve random assignment to the treatments (amount of anthocyanins 
consumed), we cannot infer that the difference in blueberry and strawberry intake caused the difference in 
heart attack risk. In addition, we should only apply the results to women like the ones in this study 
because these 93,600 women were not randomly selected from a larger population. 


4.109 Answers will vary. 

4.111 Those Facebook users involved in the study did not know that they were going to be subjected to 
treatments, and they did not provide informed consent before the study was conducted. 

4.113 The responses to the GSS are confidential. The person giving the survey knows who is answering 
the questions because s/he is at that person’s home, but will not share the results with anyone else. 
4.115 In this case, the subjects were not able to give informed consent. They did not know what was 
happening to them and they were not old enough to understand the ramifications in any event. 

4.117d 


4.119 (a) 
100 =——_ 
fa Strongly 
80 + disagree 
> és fd Disagree 
Ps 7 CJ Neither 
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|__| Strongly 
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(b) Yes, there is an association between gender and opinion. Men are more likely to view animal testing 
as justified if it might save human lives: over two-thirds of men agree or strongly agree with this 
statement, compared to slightly less than half of the women. The percentages who disagree or strongly 
disagree tell a similar story: 16% of men versus 30% of women. 


Answers to Chapter 4 Review Exercises 

R4.1 (a) Population: All adult U.S. residents. Sample: The 805 adult U.S. residents interviewed. (b) Even 
though the sample size is very large, it is unlikely that the percentage in the entire population would be 
exactly the same as the percentage in the sample because of sampling variability. (c) A larger random 
sample is more likely to get a sample result close to the true population value. 

R4.2 (a) One possible answer: Announce in daily bulletin that there is a survey concerning student 
parking available in the main office for students who want to respond. Because generally only those who 


feel strongly about the issue respond to a voluntary response survey, the opinions of respondents may 
differ from the population as a whole, resulting in an inaccurate estimate. (b) One possible answer: 
Personally interview a group of students as they come in from the parking lot. Because these students use 
the parking lot, their opinions may differ from the population as a whole, resulting in an inaccurate 
estimate. (c) Write the names of all 1800 students on identical pieces of paper. Place the slips of paper 
into a hat. Shuffle well. Draw out 50 names. Those students will form the SRS of 50 students from the 
school. (d) An SRS reduces bias by selecting the sample in such a way that every group of 50 individuals 
in the population has an equal chance to be selected as the sample. 


R4.3 (a) You would have to identify 10% of the seats, go to those seats in the arena, and find the people 
who are sitting there. (b) It would be better to use the lettered rows as the strata because each lettered row 
is the same distance from the court and so would contain only seats with the same (or nearly the same) 
ticket price. Within sections, ticket prices vary quite a bit. (c) Survey all fans in several randomly selected 
sections (clusters); people in a particular numbered section are in roughly the same location, making it 
easy to administer the survey. Furthermore, the people in each cluster reflect the variability found in the 
population, which is ideal. 

R4.4 (a) When the interviewer provides the additional information that “box-office revenues are at an all- 
time high,” the listeners may believe that they contributed to this fact and be more likely to overestimate 
the number of movies they have seen in the past 12 months. Eliminate this sentence. (b) A sample that 
uses only residential phone numbers is likely to underrepresent younger adults who use only cell phones. 
If younger adults go to movies more often than older adults, the estimated mean will be too small. (c) 
People who do not go to the movies very often might be more likely to respond to the poll because they 
are at home. Because the frequent moviegoers will not be at home to respond, the estimated mean will be 
too small. 


R4.5 (a) The data were collected after the anesthesia was administered. Hospital records were used to 
“observe” the death rates, rather than imposing different anesthetics on the subjects. (b) Explanatory: 
Type of anesthetic. Response: Whether or not a patient died. (c) One variable that might be confounded 
with choice of anesthetic is type of surgery. If anesthesia C is used more often with a type of surgery that 
has a higher death rate, we would not know if the death rate was higher because of the anesthesia or the 
type of surgery. 

R4.6 (a) Experimental units: Potatoes. The factors are the storage method (3 levels) and time from slicing 
until cooking (2 levels). There are six treatments: (1) freshly picked and cooked immediately, (2) freshly 
picked and cooked after an hour, (3) stored at room temperature and cooked immediately, (4) stored at 
room temperature and cooked after an hour, (5) stored in refrigerator and cooked immediately, (6) stored 
in refrigerator and cooked after an hour. (b) Using 300 identical slips of paper, write “1” on 50 of them, 
write “2” on 50 of them, and so on. Put the papers in a hat and mix well. Then select a potato and 
randomly select a slip from the hat to determine which treatment that potato will receive. Repeat this 
process for the remaining 299 potatoes, making sure not to replace the slips of paper into the hat. (c) 
Benefit: The quality of the potatoes should be fairly consistent, reducing a source of variability. 
Drawback: The results of the experiment could then only be applied to potatoes that come from that one 
supplier rather than to potatoes in general. (d) Use a randomized block design with the suppliers as the 
blocks. For each supplier, randomly assign potatoes to the 6 treatments. Doing so would allow the 
researchers to account for the variability in color and flavor due to differences in the initial quality of the 


potatoes from different suppliers, making it easier to estimate how the treatments affect color and flavor 
of the French fries. 


R4.7 (a) No; the 1000 students were not randomly selected from any larger population, so we should 
apply the results only to students like those in the study. (b) Yes; the students were randomly assigned to 
the three treatments, so we can conclude that the reduction in cold symptoms was caused by the masks. 
R4.8 (a) If all of the patients received the St. John’s wort, the researchers would not know if any 
improvement was due to the St. John’s wort or to the expectations of the subjects (the placebo effect). By 
giving some patients a treatment that should have no effect at all but that looks, tastes, and feels like the 
St. John’s wort, the researchers can account for the placebo effect by comparing the results for the two 
groups. (b) To create two groups of subjects that are roughly equivalent at the beginning of the 
experiment. (c) The subjects should not know which treatment they are getting so that the researchers can 
account for the placebo effect. Also, the researchers should be unaware of which subjects received which 
treatment so they cannot consciously (or subconsciously) influence how the results are measured. (d) In 
this context, “not statistically significant” means that the difference in improvement between the St. 
John’s wort and placebo groups was not large enough to rule out the variability caused by the random 
assignment as the explanation. 


R4.9 (a) Use 30 identical slips of paper and write the name of each subject on a slip. Mix the slips in a hat 
and select 15 of them at random. These subjects will be assigned to Group 1. The remaining 15 will be 
assigned to Group 2. After the experiment, compare the time estimates of Group 1 with those of Group 2. 
(b) Each student does the activity twice, once with the easy mazes, and once with the hard mazes. For 
each student, randomly determine which set of mazes is used first. To do this, flip a coin for each subject. 
If it’s heads, the subject will do the easy mazes followed by the hard mazes. If it’s tails, the subject will do 
the hard mazes followed by the easy mazes. After the experiment, compare each student’s “easy” and 
“hard” time estimate. (c) The matched pairs design would be more likely to detect a difference because it 
accounts for the variability between subjects. 


R4.10 (a) No, because the subjects did not know the nature of the experiment before they agreed to 
participate. (b) All individual data should be kept confidential and the experiment should go before an 
institutional review board before being implemented. 


Answers to Chapter 4 AP® Practice Test 
T4.1¢ 
T4.2 e 
T4.3d 
T4.4c 
T4.5 d 
T4.6e 
T4.7 a 
T4.8d 
T4.9d 
T4.10 e 


T4.11 d 

[4.12 (a) Experimental units: Acacia trees. (b) This allows the researchers to measure the effect of active 
hives and empty hives on tree damage compared to no hives at all. (c) Assign the trees numbers from 01 
to 72 and use a random number table to pick 24 different two-digit numbers between 01 and 72. Those 
trees will get the active beehives. The trees associated with the next 24 different two-digit numbers will 
get the empty beehives and the remaining 24 trees will remain empty. Compare the damage caused by 
elephants to the trees with active beehives, those with empty beehives, and those with no beehives. 

T4.13 (a) Population: All U.S. students in grades 7-12. Sample: The 1673 U.S. students in grades 7-12 
surveyed. (b) Random selection reduces the effects of bias due to self-selection and also allows the results 
to be inferred to a larger population. In this case, students who respond to an online survey might be more 
interested in computer science, leading to an estimate that is too high. (c) No, because of sampling 
variability. (d) The estimate of 54% because it was based on a sample size of 1673, whereas the other two 
results were based on smaller sample sizes. Larger sample sizes should yield results that are closer to the 
truth about the population. 

[4.14 (a) Explanatory: The treatment given (caffeine capsule or placebo). Response: The time it takes to 
complete the tapping task. (b) Each of the 11 individuals will be a block in a matched pairs design. Each 
participant will take the caffeine tablets on one of the two-day sessions and the placebo on the other. The 
blocking was done to account for individual differences in dexterity. (c) After the first trial, subjects might 
practice the tapping task and do better the second time. If all the subjects got caffeine the second time, the 
researchers would not know if the increase was due to the practice or the caffeine. (d) Yes, if neither the 
subjects nor the people who come in contact with them during the experiment (including those who record 
the number of taps) have knowledge of the order in which the caffeine or placebo was administered. 


Answers to Cumulative AP® Practice Test 1 
AP1.1d 
AP1.2e 
AP1.3b 
AP1.4¢c 
AP1.5e 
AP1.6c 
AP1.7e 
AP1.8e 
AP1.9d 
AP1.10d 
AP1.11d 
AP1.12b 
AP1.13 a 
AP1.14a 


AP1.15 (a) The distribution of gains for subjects using Machine A is roughly symmetric, while that for 
subjects using Machine B is skewed to the left (toward the smaller values). Neither distribution appears to 


contain any outliers. The center of the distribution of gains for subjects using Machine B (median = 38) is 
greater than that for subjects using Machine A (median = 28). The distribution of gains for subjects using 
Machine B (range = 57, IQR = 22) is more variable than that for subjects using Machine A (range = 32, 
IQR = 15). (b) Machine B because its median gain (38) is greater than it is for Machine A (28), as is the 
mean (x B=35.4 versus x A=28.9% 8 = 35.4 versus ¥ 4 = 28.9) (©) Machine A because it exhibits less 
variation in gains than does Machine B. The IQR for Machine A (15) is less than the IQR for Machine B 
(22). Additionally, the standard deviation for Machine A (9.38) is less than the standard deviation for 
Machine B (16.19). (d) The experiment was conducted at only one fitness center. Results may vary at 
other fitness centers in this city and in other cities. If the company wants to broaden their scope of 
inference, they should randomly select people from the population they would like to draw an inference 
about. 


AP 1.16 (a) Randomly assign 30 retail sales districts to the monetary incentives treatment and the 
remaining 30 retail sales districts to the tangible incentives treatment [see part (b) for method]. After a 
specified period of time, record the change in sales for each district and compare the mean change for 
each of the treatment groups. (b) Number the 60 retail sales districts with a two-digit number from 01 to 
60. Using a table of random digits, read two-digit numbers until 30 unique numbers from 01 to 60 have 
been selected. These 30 districts are assigned to the monetary incentives group and the remaining 30 to 
the tangibles incentives group. Using the digits provided, the districts labeled 07, 51, and 18 are the first 
three to be assigned to the monetary incentives group. (c) Matching the districts based on their size 
accounts for the variation among the experimental units due to their size on the response variable—sales 
volume. Pair the two largest districts in size, the next two largest, down to the two smallest districts. For 
each pair, pick one of the districts and flip a coin. If it’s heads, this district is assigned to the monetary 
incentives group. If it’s tails, this district is assigned to the tangible incentives group. The other district in 
the pair is assigned to the other group. After a specified period of time, record the change in sales for each 
district and compare within each pair. 

AP 1.17 (a) There is a very strong, positive linear association between sales and shelf length. (b) 
y\=317.94+152.68x,Y = 317.94 + 152.682, where yy = predicted weekly sales (in dollars) and x = 


shelf length (in feet). (c) y§=317.94+152.68(5)=1081.34Y = 317-94 + 152.68 (5) = 1081.34 (4) The 
actual weekly sales (in dollars) is typically about $22.9212 away from the weekly sales predicted by the 
least-squares regression line with x = shelf length (in feet). (e) About 98.2% of the variation in weekly 


sales revenue can be accounted for by the least-squares regression line with x = shelf length (in feet). 
—5—0 5—0 


AP1.18 (a) (i) 2=-5-022.92=-0.22 and 2=5-022.92=0.22;7 ~ asz — —9-22 and z= s555 = 0.22; 
0.5871 — 0.4219 = 0.1652 (ii) normalcdf(lower: —5, upper: 5, mean: 0, SD: 22.92) = 0.1727 (b) (i) 

7 -196=2° © - 196= 22 | 
Solving —1.96=x-022.92 22.92 gives x = —$44.92. Solving 1.96=x-022.92 22.92 gives x 


= $44.92. (ii) invNorm(area: 0.025, mean: 0, SD: 22.92) = -$44.92 and invNorm(area: 0.975, mean: 0, 
SD: 22.92) = $44.92 


The middle 95% of residuals should be between —$44.92 and $44.92. 


If 5 linear feet are allocated to the store’s brand of men’s grooming products, the weekly sales revenue 
can be expected to be between $1036.08 and $1125.92 (1081 + 177; 44.92). 


Chapter 5 
Section 5.1 


Answers to Check Your Understanding 

page 304: 1. Interpretation: If you take a very large random sample of Pedro’s commutes, about 55% of 
the time the light will be red when Pedro reaches the light. 

2. (a) This probability is 0. If an outcome can never occur, then it will occur in 0% of the trials. (b) This 
probability is 1. If an outcome will occur on every trial, then it will occur in 100% of the trials. (c) This 
probability is 0.001. An outcome that occurs in 0.1% of the trials is very unlikely, but will occur every 
once in a while. (d) This probability is 0.6. An outcome that occurs in 60% of the trials will happen more 
than half the time. Also, 0.6 is a better choice than 0.99 because the wording suggests that the event 
occurs often but not nearly every time. 


3. The doctor is wrong because the sex of the next baby born is a random phenomenon that is 
unpredictable in the short run, even though it has a regular predictable pattern in the long run. So while 
approximately 50% of all babies born will be male, even after a couple has seven girls in a row, the 
probability of the eighth child being a girl is still 50%. 

page 307: 1. To carry out a simulation to estimate the probability that a 50% shooter who takes 30 shots 
in a game would have a streak of 10 or more shots, begin by assigning digits to the outcomes. Let 1 = 
make a shot and let 2 = miss a shot. Generate 30 random integers from 1 to 2 to simulate taking 30 shots. 
Record whether or not there are a series of at least 10 “makes” in a row among the 30 shots. Perform 
many trials of this simulation. See what percent of the time there is a streak of at least 10 “makes” among 
the 30 shots. 


2. There were two trials that resulted in 9 consecutive “makes” among the 30 shots. 


3. Based on the dotplot, there was only 1 trial out of 50 simulated games in which the player had at least 
10 “makes” in a row among the 30 shots. If the player’s shot percentage truly is 50% and if the result of a 
shot truly does not depend on previous shots, then in 50 trials we would expect this player to have a streak 
of 10 or more makes only about 2% of the time (1 out of 50). Because this phenomenon is unlikely to 
happen strictly due to chance alone, and the announcer observed the player making 10 shots in a row ina 
recent game, the announcer is justified in claiming that the player is streaky. 


Answers to Odd-Numbered Section 5.1 Exercises 

5.1 (a) If you take a very large random sample of times that Aaron tunes into his favorite radio station, 
about 20% of the time a commercial will be playing. (b) No, chance behavior is unpredictable in the short 
run, but has a regular and predictable pattern in the long run. The value 0.20 describes the proportion of 
times that a commercial will be playing when Aaron tunes in to the station in a very long series of trials. 


5.3 (a) If you take a very large random sample of women with breast cancer, about 10% of the time the 
mammogram will indicate that the woman does not have breast cancer when, in fact, she does have breast 
cancer. (b) A false negative is a more serious error because a woman with breast cancer will not get 
potentially life-saving treatment. A false positive would result in temporary stress until a more thorough 


examination is performed. 


5.5 In the short run, there was quite a bit of variability in the percentage of made 3-point shots. However, 
the percentage of made 3-point shots became less variable and approached 0.30 as the number of shots 


increased. 


5.7 (a) The wheel is not affected by its past outcomes—it has no memory. So on any one spin, black and 
red remain equally likely. (b) The gambler is wrong again. Removing a card changes the composition of 
the remaining deck. If you hold 5 red cards, the deck now contains 5 fewer red cards, so the probability of 
being dealt another red card decreases. 

5.9 (a) To carry out a simulation to estimate the score that Luke will earn on the quiz, begin by assigning 
digits to the outcomes. For each question, let 1 = guessed correctly and let 2, 3, and 4 = guessed 
incorrectly. Generate 10 random integers from 1 to 4 to simulate the result of Luke’s guess for each of the 
10 questions. Record the number of correct guesses (1’s). Repeat this three times and record the highest of 
the three scores. (b) There was one trial that resulted in a highest quiz score of 1 correct response out of 
10. (c) Based on the simulation, the probability that Luke passes the quiz (scores at least a 6 out of 10) is 
5/50 = 10%. Five of the 50 trials show a score of at least 6. (d) Yes; it is unlikely that Doug would score 
higher than 6 out of 10 based on randomly guessing the answers. Since Doug scored 8 points out of 10, 
there is reason to believe that Doug does understand some of the material. 


5.11 (a) To use a table of random digits to carry out the simulation, number the first class passengers 01— 
12 and the other passengers 13-76. Ignore all other numbers. Moving left to right across a row, look at 
pairs of digits until you have 10 unique numbers (no repetitions, because you do not want to select the 
same person twice). Count the number of two-digit numbers between 01 and 12. Perform many trials of 
this simulation. Determine what percent of the trials showed no first class passengers selected for 
screening. (b) The numbers, read in pairs, are: 71 48 70 99 84 29 07 71 48 63 61 68 34 70 52. The bold 
numbers indicate people who have been selected. The other numbers are either too large (over 76) or have 
already been selected. There is one person among the 10 selected who is in first class in this sample 
(underlined). (c) No, there is not convincing evidence that the TSA officers did not carry out a truly 
random selection. If the selection is truly random, there is about a 15% chance that no one in first class 
will be selected. So it is not surprising that a single random selection would contain no first class 
passengers. 

5.13 (a) To carry out this simulation using slips of paper, label 10 identically sized pieces of paper 0-9. 
Let 0—5 represent hitting the center of the target and let 6—9 represent not hitting the center of the target. 
Place the slips of paper into a hat and mix well. Draw out one slip of paper at a time and record the digit. 
Replace the paper and mix well. Repeat this process until Quinn misses the center of the target. Determine 
whether she remained in the competition for at least 10 shots. (b) To carry out this simulation using a 
random digits table, let 0-5 represent hitting the center of the target and let 6—9 represent not hitting the 
center of the target. Moving left to right across a row, read single digits and record whether she hit the 
center of the target or not. Repeat this process until Quinn misses the center of the target. Determine 
whether she remained in the competition for at least 10 shots. (c) To carry out this simulation using a 
random number generator, let 0O—5 represent hitting the center of the target and let 6—9 represent not 
hitting the center of the target. Generate a random integer from 0 to 9 and record whether she hit the 
center of the target or not. Repeat this process until Quinn misses the center of the target. Determine 
whether she remained in the competition for at least 10 shots. 


5.15 This is a legitimate simulation. By letting 1 = feel addicted and 2 = doesn’t feel addicted, the 


simulation would model the claim that 50% of U.S. teens with smartphones feel addicted to their devices. 


5.17 This is not a legitimate simulation. When simulating the selection of homework assignments, the 
selection process would occur without replacement. The slips of paper should not be placed back in the 
hat. 

5.19 To conduct a simulation to estimate the probability that we would have to randomly select 20 or 
more U.S. adult males to find one who is red-green color-blind, let 0-6 = color-blind and let 7-99 = not 
color-blind. Use technology to pick an integer from 0 to 99. Continue picking integers until we get a 
number between 0 and 6. Count how many numbers there are in the sample. Repeat this process many 
times. Determine the percent of the time we would have to randomly select 20 or more U.S. adult males to 
find one who is red—green color-blind. 


5.21 (a) In the simulation, 43 of the 200 samples yielded a sample proportion of at least 0.55. Obtaining a 
sample proportion of 0.55 or higher is not particularly unusual when 50% of all students recycle. (b) Only 
1 of the 200 samples yielded a sample proportion of at least 0.63. This means that if 50% of all students 
recycle, we would see a sample proportion of at least 0.63 in only about 0.5% of samples. Because getting 
a sample proportion of at least 0.63 is very unlikely, we have convincing evidence that the percentage of 
all students who recycle is larger than 50%. 
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5.29 (a) This survey displays undercoverage because people who are younger than 50 years old did not 
have a chance to participate in the survey. Younger people generally don’t have as many prescription 
drugs and might be less willing than older people to support a program like this. The 75% who answered 
“Yes” is likely an overestimate of the true proportion of Americans who support the program. (b) 
Including the additional information of “can be helped” might have encouraged respondents to say “Yes” 
to the program because they liked the idea of helping people. This would cause the 75% estimate to be 
higher than if the question were worded more neutrally. 


Section 5.2 
Answers to Check Your Understanding 


page 318: 1. Events A and B are mutually exclusive because a person cannot have a cholesterol level of 
both 240 or above and between 200 and less than 240 at the same time. 


2. The event “A or B” is a person who has either a cholesterol level of 240 or above or they have a 
cholesterol level between 200 and less than 240. P(A or B) = P(A) + P(B) = 0.16 + 0.29 = 0.45 


3. P(C) = 1 - P(A or B) = 1 - 0.45 = 0.55 

page 321: 1. The probability that the person is an environmental club member is (212 + 77 + 16)/1526 = 
305/1526 = 0.1999 = 19.99%. 

2. P(not a snowmobile renter) = (445 + 212 + 297 + 16)/1526 = 970/1526 = 0.6356 = 63.56%. 


3. P(environmental club member and not a snowmobile renter) = (212 + 16)/1526 = 228/1526 = 0.1494 = 
14.94%. 


4. The probability that the person is not an environmental club member or is a snowmobile renter is 


85.06%, which is (445 + 497 + 279 + 77)/1526 = 1298/1526 = 0.8506 = 85.06%. 


Answers to Odd-Numbered Section 5.2 Exercises 


5.31 (a) The following table shows the possible outcomes in the sample space. Each of the 16 outcomes 


I 
will be equally likely and have probability 116. 16° 


First roll 


1 (1,1) (2,1) (3,1) (4,1) 

Second 2 (1,2) (2,2) (3,2) (4,2) 
roll 3 (1,3) (2,3) (3,3) (4,3) 

4 (1,4) (2,4) (3,4) (4,4) 


(b) P(A) = 0.25. There are four ways to get a sum of 5 from these two dice: (1, 4), (2, 3), (3, 2), (4, 1). The 


a 
probability of getting a sum of 5 is p(a)=416=0.25 7 4) = ae ae 


5.33 (a) The sample space is: Connor/Declan, Connor/Lucas, Connor/Piper, Connor/Sedona, 
Connor/Zayne, Declan/Lucas, Declan/Piper, Declan/Sedona, Declan/Zayne, Lucas/Piper, Lucas/Sedona, 
Lucas/Zayne, Piper/Sedona, Piper/Zayne, and Sedona/Zayne. Each of these 15 outcomes will be equally 
likely and will have probability 1/15. (b) There are 9 outcomes in which Piper or Sedona (or both) get to 
go to the show: Connor/Piper, Connor/Sedona, Declan/Piper, Declan/Sedona, Lucas/Piper, Lucas/Sedona, 
Piper/Sedona, Piper/Zayne, Sedona/Zayne. Define Event A as Piper or Sedona (or both) get to go to the 
show. Then P(A) = 9/15 = 0.60. 


5.35 (a) This is a valid probability model because each probability is between 0 and 1 and the 
probabilities sum to 1. (b) P(student won’t win extra homework) = 1 — 0.05 = 0.95. There is a 0.95 
probability that a student will not win extra homework. (c) P(candy or homework pass) = 0.25 + 0.15 = 
0.40. There is a 0.40 probability that a student wins candy or a homework pass. 

5.37 (a) The probabilities of all the possible outcomes must add to 1. The given probabilities add up to 
0.25 + 0.32 + 0.07 + 0.03 + 0.01 = 0.68. This leaves a probability of 1 — 0.68 = 0.32 for P(3 or 4). 
Because the probability of finding 3 people in a household is the same as the probability of finding 4 
people (and they are mutually exclusive), each probability must be 0.32/2 = 0.16. (b) P(more than 2 
people) = 1 — P(1 or 2 people) = 1 — (0.25 + 0.32) = 0.43. This could also be found using the addition rule 
for mutually exclusive events. P(more than 2 people) = P(3 or 4 or 5 or 6 or 7+) = 0.16 + 0.16 + 0.07 + 
0.03 +0.01 = 0.43. 


5.39 (a) The given probabilities have a sum of 0.72 and the sum of all probabilities should be 1. Thus, the 
probability that a randomly chosen young adult has some education beyond high school but does not have 
a bachelor’s degree is 1 — 0.72 = 0.28. There is a 0.28 probability that a young adult has some education 
beyond high school but does not have a bachelor’s degree. (b) Using the complement rule, P(at least a 
high school education) = 1 — P(has not finished high school) = 1 — 0.13 = 0.87. There is a 0.87 probability 
that a young adult has completed high school. (c) P(young adult has further education beyond high 
school) = P(young adult has some education beyond high school but does not have a bachelor’s degree) + 
P(young adult has at least a bachelor’s degree) = 0.28 + 0.30 = 0.58. The probability rule used is the 
addition rule for mutually exclusive events. 


205 
5.41 (a) P(B®) = P(student does not eat breakfast regularly) = 295595=0.496 595 — hee (b) P(F and 


165 _ 
BC) = P(student is a female and does not eat breakfast regularly) = 165595=0.277. 595 0.27. 


Interpretation: If we select a student from the school at random, the probability that the student will be 


female and does not eat breakfast regularly is 0.277. (c) P(F and B°) = P(student is a female or does not 
110+165+130 _ 9 egy 
eat breakfast regularly) = 110+165+130595=0.681 595 — 


5.43 (a) P(not age 18 to 34 and not own an iPhone) = 189+100+277+6432024=12092024=0.597 


189+100+277+643 1209 
—a = 2004 — 9-597 
(b) P(age 18 to 34 or owns iPhone) = 169+214+134+171+1272024=8152024=0.403 
169+214+134+171+127 815 0.403 

2024 ai) eats 
5.45 (a) 

Black Not black Total 

Even 10 10 20 
Not even 8 10 18 
Total 18 20 38 


= 3S 4: a 
(b) P(B)=1838=0.474:P(E)=2038-0.526" (B) ~ 38 0.474; P(E) 38 280 (c) The event “B and 


E” would be that the ball lands in a spot that is black and even. P(B and E)=1038=0.263 


_ 10 _ 9 9¢: 
PIBand B) = sy = 0.265 (d) The probability of the event “B or E” means the probability of landing in 


a spot that is either black, even, or both. If we add the probabilities of landing in a black spot and landing 
in an even spot, the spots that are black and even will be double counted because events B and E are not 
mutually exclusive. P(B and E)=1838+2038-1038=2838=0.737. 

P(BandE) = = + 2 - 3 = = = 0.737. 

5.47 P(own a dog or a cat) = P(own a dog) + P(own a cat) — P(own a dog and a cat) = 0.40 + 0.32 - 0.18 
= 0.54 

5.49 (a) We are given the following information: P(M) = 0.6, P(GX = 0.67, P(G and M) = 0.23. 
Therefore, P(G) = 1 — 0.67 = 0.33. P(G U M) = P(G) + P(M) - P(G n M) = 0.33 + 0.6 — 0.23 = 0.70. 
Interpretation: The probability of randomly selecting a student from among those who were part of the 
census that is a graduate student or uses a Mac is 0.70. (b) P(GS al M°) =1-P(GUM)=1-0.7=0.3 


5.51 (a) A Venn diagram is shown here. 
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(b) P(D n C°) = 0.22 
5.53 The general addition rule states that P(A U B) = P(A) + P(B) - P(A n B). We know that P(A) = 0.3, 
P(B) = 0.4, and P(A U B) = 0.58. Therefore, 0.58 = 0.3 + 0.4 - P(A n B). Solving for P(A n B) gives 


P(A n B) = 0.12. 
5.55e 
5.57e 


5.59 (a) The scatterplot for the average crawling age and average temperature is given here. 
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In this scatterplot, there appears to be a moderately strong, negative linear relationship between average 
temperature and average crawling age. (b) The equation for the least-squares regression line is 
yA=35.68-0.0777x,Y = 35.68 — 0.07772, where yy is the predicted average crawling age and x is the 
average temperature in degrees Fahrenheit. (c) The slope = —0.0777. Interpretation: The predicted 
average crawling age decreases by 0.0777 weeks for each additional 1-degree increase in the average 
temperature in degrees Fahrenheit. (d) We cannot conclude that warmer temperatures six months after 
babies are born causes them to crawl sooner because this was an observational study and not an 
experiment. We cannot draw conclusions of cause and effect from observational studies. 


Section 5.3 
Answers to Check Your Understanding 


P(N|E) = 22 = 0.695. 


page 334: 1. P(N|E)=212305=0.695. 305 Given that a survey respondent belongs to 


an environmental organization, there is about a 69.5% chance that he or she never used a snowmobile. 
212+-77 : 89 © 
P(E|S°) = 22+ — 28 — 9.235 


657+-57- 1231 


2. P(E|SC)=212+77657+574=2891231=0.235 
3. P(EC|N)=445657=0.677. P(EC|renter)=497574=0.866. P(EC|owner)=279295=0.946. 


Cc — So he C — (FF 7 Bin. = : 
P(E” |N) = = 0.677. P(E” |renter) = en 0.866. P(E* |owner) aie 0.946. The chosen 


person is more likely to not be an environmental club member if he or she owns a snowmobile. 

page 337: 1. Events A and B are independent. Because we are putting the first card back and shuffling the 
cards before drawing the second card, knowing what the first card was will not help us predict what the 
second card will be. 

2. Events A and B are not independent. Once we know the suit of the first card, then the probability of 
getting a heart on the second card will change depending on what the first card was. 

3. The two events, “Female” and “Got an A,” are independent. Once we know that the chosen person is 
female, this does not help us predict if she got an A or not. Overall, 4/28 or 1/7 of the students got an A on 
the quiz. And, among the females, 3/21 or 1/7 got an A. So P(got an A) = P(got an A | female). 


page 343: 1. 


ee Desktop 
California <—0 Lapto 
gp, California aptop 
Y ™~ 
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Computers < 

\ RAS Desktop 


~ 
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2. P(tablet) = P(California n tablet) + P(Texas n tablet) = (0.40)(0.45) + (0.60)(0.70) = 0.60. The 
probability of selecting a tablet is 0.60. 


3. P(California|tablet)=P(Californiantablet)P(tablet)=(0.4)(0.45)0.60=0. 180.60=0.30. 


: a _ P(California’ tablet) _ (0,4)(0.45) _ 0.18 
P(California|tablet) = a oe er 0.30 


’ Given that a tablet was selected, 
there is a 0.30 probability that it was made in California. 

page 347: 1. P(train arrives late on Monday and on time on Tuesday) = P(Arrives late on Monday) - 
P(Arrives on time on Tuesday) = (0.10)(0.90) = 0.09 

2. P(train arrives late at least once in a 5-day week) = 1 — P(train never arrives late in a 5-day week) = 1 - 
(0.90)? = 0.41 

3. P(full-time college student and age 65 or older) # (0.08)(0.15) because “Full-time college student” and 
“Adults age 65 or older” are not independent events. Knowing that a person is a full-time college student 
decreases the likelihood that he or she is an adult age 65 or older. 


Answers to Odd-Numbered Section 5.3 Exercises 


= “A — 0,92 
5.61 (a) P(TIE)=44200=0.22P (TIE) a2 Given that the child is from England, there is a 0.22 


probability that the student selected the superpower of telepathy. 


(b) P(E|SC)=54+52+30+4499+96+67+110=180372=0.484 99+96+67+110 372 


Given that the child did not choose superstrength, there is a 0.484 probability that the child is from 
England. 


P(female|about right) = ae = 0.655. eu 


5.63 (a) P(female|about right)=560560+295=0.655. 
that the person perceived his or her body image as about right, there is a 0.655 probability that the person 
is a female. (b) P(not overweight| F)=560+37560+163+37=597760=0.786. 


‘ 560+37 597 
P(not overweight! F) = a = — = 0.786. _. : ; 
( ght| F) 560+163+37 760 Given that the person selected is female, there is a 


0.786 probability that she did not perceive her body image as overweight. 
5.65 (a) P(is studying other than English) = 1 — P(none) = 1 — 0.59 = 0.41. There is a 0.41 probability that 
the student is studying a language other than English. 


P(Spanishother than English) = a = 0.6341. 


(b) P(Spanishother than English)=0.260.41=0.6341. 
Given that the student is studying some language other than English, there is a 0.6341 probability that he 
or she is studying Spanish. 


5.67 P(B) < P(B | T) < P(T) < P(T | B). There are very few professional basketball players, so P(B) 


should be the smallest probability. If you are a professional basketball player, it is quite likely that you are 
tall, so P(T | B) should be the largest probability. Finally, it’s much more likely to be over 6 feet tall (B) 


than it is to be a professional basketball player if you’re over 6 feet tall (B | T). 
_ P(DNC) _ 0.18 


P(D|C) = 5 = 37 — 0.563. 
5.69 Method 1: P(D|C)=P(DnC)P(C)=0.180.32=0.563. — PO) we Given that a 
household owns a cat, there is a 0.563 probability that the household owns a dog. Method 2: Use a Venn 
P(D|C) = rer = oo = 0.563 


diagram. P(D|C)=0.180.18+0.14=0.180.32=0.563 


Dior Ses Se & 
Tr fs * 

/ / \ 

{ | \ \ 


| 0.22 | 0.18 0.14 | 


i \ a ; Pi 


0.46 


5.71 P(homeowner) = 340/500 = 0.68; P(homeowner | high school graduate) = 221/310 = 0.713. Because 
these probabilities are not equal, the events “Homeowner” and “High school graduate” are not 
independent. Knowing that the person is a high school graduate increases the probability that the person is 
a homeowner. 

5.73 (a) P(iPhone | 18-34) = 169/517 = 0.327. Given that the adult is aged 18—34, there is a 0.327 
probability that the person owns an iPhone. (b) First we determine P(iPhone) = 467/2024 = 0.231. 
Because these probabilities are not equal, the events “Own an iPhone” and “Aged 18—34” are not 
independent. Knowing that the person is aged 18—34 increases the probability that the person owns an 
iPhone. 

5.75 There are 36 different possible outcomes of the two dice: (1, 1), (1, 2), ... , (6, 6). Let’s assume that 
the second die is the green die. There are then 6 ways for the green die to show a 4: (1, 4), (2, 4), (3, 4), 
(4, 4), (5, 4), (6, 4). Of those, there is only one way to get a sum of 7, so P(sum of 7 | green is 4) = 1/6 = 
0.1667 Overall, there are 6 ways to get a 7: (1, 6), (2, 5), (3, 4), (4, 3), (5, 2), (6, 1). So P(sum of 7) = 6/36 
= 0.1667 Because these two probabilities are the same, the events “Sum of 7” and “Green die shows a 4” 
are independent. Knowing that the green die shows a 4 does not change the probability that the sum is 7. 
5.77 P(download music) = 0.29, P(don’t care | download music) = 0.67, P(download music n don’t care) 
= P(download music) - P(don’t care | download music) = (0.29)(0.67) = 0.1943 = 19.43%. About 19.43% 
of Internet users download music and don’t care if it is copyrighted. 


5.79 P(all three candies have soft centers) = 
P(1st soft) - P(2nd soft | 1st soft) - P(3rd soft | Ist and 2nd soft) = 1420-1319-1218=21846840=0.319. 


M13, 2 _ 218% _ 9 919 
20 19 18 = 6840 ~~" There is a 0.319 probability that 3 randomly selected candies from a Gump 


box will all have soft centers. 


5.81 (a) A tree diagram is shown here. 


02% Credit card 
Regular <_ 0.75 
gasoline “= No credit card 
0% Credit card 
Mid-grade —_0.66 . 
gasoline — No credit card 


Customer 


4) 
gat Credit card 
Premium <—_0.59 
gasoline ~~ No credit card 


(b) P(credit card) = (0.88)(0.28) + (0.02)(0.34) + (0.10)(0.42) = 0.295. There is a 0.295 probability that 
the customer paid with a credit card. (c) P(premium gasoline | credit card) = 
P(premium gasolinencredit card)P(credit card)=(0.10)(0.42)0.295=0.0420.295=0.142. 


P(premium gasoline( credit card) (0.10)(0,42) 0.042 
— Pleredit card) ~~ 0.205 0295 — O-142. eee | 
Kexediit card) oe _ Given that the customer paid with a credit 


card, there is a 0.142 probability that the customer bought premium gas. 
5.83 We need to determine P(missed his first serve | won the point). Here is a tree diagram. 


g.16_ Wins point 
rn " . 
yp — Makes = 24 Doesn't 
Andy Murray we first-serve win point 
acta Dey e 
— ee 954_ Wins point 
Misses <p 
~246 Doesn't 


_ <.. 
first-serve ~ 
win point 


P(missed his first serve | won the point) = P(missed his first servenwon the point)P(won the point)=(0.40) 


(0.54)(0.60)(0.76)+(0.40)(0.54)=0.2160.672=0.32. 


P(missed his first servewon the point) —_ (0.40)(0.54) _— 0.216 _ 0.32 
P(won the point) ~ (0,60)(0.76)+(0.40)(0.54) 0.672 


" Given that he won the point, 
there is a 0.32 probability that he missed his first serve. 


5.85 We need to determine P(antibody | positive). Here is a tree diagram. 


oe * 
Antibodies <0 
‘ 90 
ow present Sis 
Person 2.99 me + 
~ Antibodies <~ g 
present =e. 


P(antibody|positive)=P(antibody n positive)P(positive)=(0.01)(0.9985)(0.01)(0.9985) 1 (0.99) 
. . er _ P(antibodyM positive) _ (0.01)(0.9985) = 
(0.006)=0.6270.7 ees ipostttve) _ P(positive) ~ (0.01)(0.9985) 1 (0.99)(0.006) 0.6270. 


Given that the EIA test is positive, there is a 0.627 probability that the person has the antibody. 


5.87 (a) P(contributed) = (0.5)(0.4)(0.8) + (0.3)(0.3)(0.6) + (0.2)(0.1)(0.5) = 0.16 + 0.054 + 0.01 = 0.224. 
There is a 0.224 probability that a potential donor contributed to the charity. (b) 
P(recent donor|contribute)=P(recent donorn contribute)P(contribute)=0.160.224=0.7143. 


, P{recent donor contribute) ; 
P(recent donor|contribute) = ————————— = 2% = 0.7143. 


P(contribute) 0.224 Given that the person 


contributes to the charity, there is a 0.7143 probability that the person was a recent donor. 


5.89 The probability that the string of lights will remain bright for 3 years is (0.98)7° = 0.6676. 


5.91 P(none are late) 
= P(Ast is not late and 2nd is not late and ... and 20th is not late) 


= P(Ast is not late) - P(2nd is not late) - ... - P(20th is not late) 
= (0.90) (0.90) ... (0.90) 

= (0.90)? 

= 0.1216 


P(at least 1 late) = 1 — P(mone are late) = 1 — 0.1216 = 0.8784 

5.93 We cannot simply multiply the probabilities together. If a woman is over 55 years old, it is unlikely 
that she is pregnant. These events are not independent. 

5.95 (a) P(all 4 calls are for medical help) 

= P(1st is medical and 2nd is medical and 3rd is medical and 4th is medical) 

= P(1st is medical) - P(2nd is medical) - P(3rd is medical) - P(4th is medical) 

= (0.81)(0.81)(0.81)(0.81) 

= (0.81)* 

= 0.430 

There is a 0.430 probability that all 4 calls are for medical help. 

(b) P(at least 1 not for medical help) 

= 1 — P(all 4 calls are for medical help) 

= 1 - 0.430 

= 0.570 

There is a 0.570 probability that at least one of the 4 calls is not for medical help. (c) The calculation in 
part (a) might not be valid because the 4 consecutive calls being medical are not independent events. 
Knowing that the first call is medical might make it more likely that the next call is medical (e.g., several 
people might call for the same medical emergency). 


5.97 (a) 
Gender 
Male Female Total 
Blue eyes 10 
Eye color Brown eyes 40 
Total 50 


The events “Student is male” and “Student has blue eyes” are mutually exclusive because they don’t 


occur at the same time. 


(b) 


Blue eyes 


Eye color Brown eyes 16 24 40 
Total 20 30 50 


If the event “Student is male” and the event “Student has blue eyes” are independent, then: 


P(male)=P(male|blue) + 2050=x10 + x=4/ (male) = P(male|blue) ” 50 - 0 tte 
(c) Answers may vary. 
Gender 
Male Female Total 
Eye color Blue eyes 10 
Brown eyes 40 
Total 50 


The events “Student is male” and “Student has blue eyes” are not mutually exclusive because they can 
occur at the same time (for 5 students). The events “Student is male” and “Student has blue eyes” are not 
independent because P(male)4#* P(male | blue). P(male)=20/50=0.4" (male) = 20/50 = 0.4, 
P(male|blue)=5/10=0.5" (male|blue) = 5/10 = 0.5, 


5.99 Two events are independent if P(ANB)=P(A):P(B). (AMB) = P(A)- P(B). It is given that 
P(A) = 0.3, P(B) = 0.4, and P(AnB)=0.12.P (AB) = 0-12. Because 0.12 = (0.3)(0.4), events A and B 
are independent. 

5.101 (a) There are 6 ways to get doubles out of 36 possibilities so P(doubles)=6/36=0.167. 

P (doubles) = 6/36 = 0.167. There is a 0.167 probability of getting doubles on a single toss of the dice. 
(b) Because the rolls are independent, we can use the multiplication rule for independent events. 

P(no doubles firstn doubles second)=/ (n° doubles first doubles second) = 


P(no doubles first)-P(doubles second)=(30/36)(6/36)=0.139. 

P (no doubles first) - P (doubles second) = (30/36) (6/36) = 0.139. 

There is a 0.139 probability of not getting doubles on the first toss and getting doubles on the second toss. 
(c) P(first doubles on third roll)=" (first doubles on third roll) = 


P(no doubles)-P(no doubles)-P(doubles)=56(56)(16)=25216 
P (no doubles) - P (no doubles) - P (doubles) = 2 (2) (4) es 


6\6) \6 216 
= 0.116. There is a 0.116 probability that the first doubles occurs on the third roll. (d) For the first doubles 
on the fourth roll, the probability is 


3 -\4 

5 l 5 l 
nae a ( : . For the first doubles on the fifth roll, the probability is uaek ( e . The 
probability that the first doubles are rolled on the 


k-1 
5 ) 
kth roll is eaidee ¢ ( os 
5.103 c 
5.105 e 


18,5—26.8 


-y ieee Ot 


5.107 (a) 2=18.5-26.87.4=-1.12" — 
(i) The proportion of z-scores below —1.12 is 0.1314. 
(ii) normalcdf(lower: —1000, upper: 18.5, mean: 26.8, SD: 7.4) = 0.1314. About 13.14% of young women 


are underweight by this cri- terion. (b) Note that P(not underweight)=1—P(underweight)= 
P (not underweight) = 1 — P (underweight) = ;_¢ 131=0.g691 — 0.131 = 0.869. P(at least one is 


underweight) = 1 — P(none are underweight) =1-0.8692=0.24481 0.869° = 0.2448 There is a 0.2448 
probability that at least one of the two women will be classified as underweight. 


Answers to Chapter 5 Review Exercises 

R5.1 (a) If you take a very large random sample of pieces of buttered toast and dropped them from a 2.5- 
foot-high table, about 81% of them will land butter side down. (b) No; if four dropped pieces of toast all 
landed butter side down, it does not make it more likely that the next piece of toast will land with the 
butter side up. Chance behavior is unpredictable in the short run, but it has a regular and predictable 
pattern in the long run. The value 0.81 describes the proportion of times that toast will land butter side 
down in a very long series of trials. 

R5.2 (a) To use a table of random digits to perform the simulation, begin by letting 00-80 = butter side 
down and 81—99 = butter side up. Moving left to right across a row, look at pairs of digits to simulate 
dropping one piece of toast. Read 10 such pairs of two-digit numbers to simulate dropping 10 pieces of 
toast. Record the number of pieces of toast out of 10 that land butter side down. Determine if 4 or fewer 
pieces of toast out of 10 landed butter side down. Repeat this process many times. (b) Bold numbers 
indicate that the toast landed butter side down. 


Trial 1: 29 07 71 48 63 61 68 34 70 52. Did 4 or fewer pieces of toast land butter side down? No. 
Trial 2: 62 22 45 10 25 95 05 29 09 08. Did 4 or fewer pieces of toast land butter side down? No. 
Trial 3: 73 59 27 51 86 87 13 69 57 61. Did 4 or fewer pieces of toast land butter side down? No. 


(c) Assuming the probability that each piece of toast lands butter side down is 0.81, the estimate of the 
probability that dropping 10 pieces of toast yields 4 or less butter side down is 1/50 = 0.02. There is 
convincing evidence that the 0.81 claim is false. Maria does have reason to be surprised, as these results 
are unlikely to have occurred purely by chance. 

R5.3 (a) The sample space is: rock/rock, rock/paper, rock/scissors, paper/rock, paper/paper, 
paper/scissors, scissors/rock, scissors/paper, and scissors/scissors. Because each player is equally likely to 
choose any of the three, each of these 9 outcomes will be equally likely and have probability 1/9. (b) 
There are three outcomes—rock/scissors, paper/rock, and scissors/paper—where player 1 wins on the first 


play. Define event A as player 1 wins on the first play. P(A) =3/9=0.333/9 = 0.33, 


R5.4 (a) The probability that it is a crossover is 1 — 0.46 — 0.15 — 0.10 — 0.05 = 0.24. The sum of the 
probabilities must add to 1. (b) P(vehicle is not an SUV or a minivan) = 0.46 + 0.15 + 0.24 = 0.85. The 
probability that the vehicle is not an SUV or a minivan is 0.85 


(c) P(pickup truck | not a passenger car) = 0.150.15+0.10+0.24+0.05=0.150.54=0.278 


0.15 — O15 _ 
o1sr010+0247008 ~ 054 ~ 2-278 


Given that the vehicle is not a passenger car, there is a 0.278 probability that the vehicle is a pickup truck. 


R5.5 (a) P(drives an SUV) = 39/120 = 0.325. The probability that the person drives an SUV is 0.325. (b) 
25+20+154+12 __ 
P(drives a sedan or exercises) = 25+20+15+12120=0.60 120 7 al The probability that the 


person drives a sedan or exercises for at least 30 minutes four or more times per week is 0.60. 


25+15  __ 40 __ 9 7¢g 
(c) P(does not drive a truck | exercises) = 25+1525+15+12=4052=0.769 25+ 15+12 52 “"“ Given 


that the person exercises for at least 30 minutes four or more times per week, there is a 0.769 probability 


that the person does not drive a truck. 

R5.6 (a) The events “Thick-crust pizza” and “Pizza with mushrooms” are not mutually exclusive. It is 
possible for a pizza to be thick-crust and have mushrooms. (b) Let’s organize the given information in a 
table. 


Thick crust Thin crust Total 
Mushrooms 2 4 6 
No mushrooms 1 2 3 
Total 3 6 9 


P(mushrooms) = 6/9 = 0.667 and P(mushrooms | thick crust) = 2/3 = 0.667. Because P(mushrooms) = 
P(mushrooms | thick crust), the events “Mushrooms” and “Thick crust” are independent. (c) Use the 
general multiplication rule. P(both randomly selected pizzas have mushrooms) = P(first has mushrooms): 
P(second has mushrooms | first has mushrooms) =(6/9)(5/8)=0.417(8/9) (5/8) = 0.417 J¢ you randomly 
select 2 of the pizzas in the oven, the probability that both have mushrooms is 0.417. 


R5.7 (a) A tree diagram is shown here. 


Test result 
oe 4 
ow Drugs <0, 19 _ 
a) oO 4 
; ae —<09s = 


(b) P(drug test result is positive) = (0.04)(0.90) + (0.96)(0.05) = 0.084. The probability that the drug test 
result is positive is 0.084. 
(c) P(uses illegal drugs | positive result) = P(uses illegal drugs positive result)P(positive result)=(0.04) 
P(uses illegal drugsMpositive result) (0.04)(0.90) 
~ P(positive result) | 0.084 0.429 , 
(0.90)0.084=0.429 omeiye ae) Given that the drug test result is 
positive, there is a 0.429 probability that person uses illegal drugs. 
R5.8 P(at least 1 believes that finding and picking up a penny is good luck) = 1 — P(none of the 10 people 
believe this) 
= 1 - (0.67)!° = 0.98177 
The probability of selecting 10 U.S. adults and finding at least 1 who believes that finding and picking up 
a penny is good luck is 0.98177. 


Answers to Chapter 5 AP® Practice Test 


13.1 
T5.2 b 
Th.3¢ 
T5.4b 
T5.4°¢ 
io.6a 
T5.7b 
15.6.4 
V5.9 e 
T5.10e 


T5.11 (a) P(student eats regularly in the cafeteria and is not a 10th grader) 
130+122+68 _ 320 _ gq a9 
=130+122+68805=320805=0.398 805 805 "~~. There is a probability of 0.398 that a 


randomly selected student eats regularly in the cafeteria and is not a 10th grader. (b) P(10th grader | eats 
regularly in the cafeteria) = 175/495 = 0.354. Given that the student eats regularly in the cafeteria, there is 
a 0.354 probability that he or she is a 10th grader. (c) P(10th grader) = 209/805 = 0.260. The events “10th 
grader” and “Eats regularly in the cafeteria” are not independent because P(10th grader | eats regularly in 
the cafeteria)#7 P(10th grader). 

T5.12 Let’s organize this information using a tree diagram. 


01° Defective 


A <O% 
ef ox 


i‘) 
. J 30 eu Defective 


Part < B <_ 0% 
. —~ OK 
4 
oA 
< go Defective 
C <_0¢6 
~ 0 ok 


(a) To get the probability that a part randomly chosen from all parts produced in this factory is defective, 
add the probabilities from all branches in the tree that end in a defective part. P(defective) = (0.60)(0.10) 


+ (0.30)(0.30) + (0.10)(0.40) = 0.06 + 0.09 + 0.04 = 0.19. (b) P(Machine B | defective) =(0.30) 
(0.30)(0.30) _ 0.09 _ 0.474 
(0.30)0.19=0.090.19=0.474 919 = 0.19 “*"" “, Given that a part is inspected and found to be 


defective, there is a 0.474 probability that it was produced by Machine B. 

T5.13 (a) P(customer will pay less than the usual cost of the buffet) = 23/36 = 0.639. There is a 0.639 
probability that the customer will pay less than the usual cost of the buffet. (b) P(all 4 friends end up 
paying less than the usual cost of the buffet) = (23/36)* = 0.167. There is a 0.167 probability that all 4 of 
these friends end up paying less than the usual cost of the buffet. (c) P(at least 1 of the 4 friends ends up 
paying more than the usual cost of the buffet) = 1 — 0.167 = 0.833. There is a 0.833 probability that at 
least 1 of the 4 friends ends up paying more than the usual cost of the buffet. 

T5.14 (a) To carry out the simulation using a table of random digits, let 00-16 = cars with out-of-state 
plates and let 17—99 = other cars. Moving left to right across a row, look at pairs of digits from a random 
number table until you have found two numbers between 00 and 16 (repeats are allowed). Record how 


many two-digit numbers you had to read in order to get 2 numbers between 00 and 16. Repeat this process 
many times. (b) The first sample is 41 05 09. The bold numbers represent cars with out-of-state plates. In 
this sample, it took three cars to find two with out-of-state plates. The second sample is 20 31 06 44 90 50 
59 59 88 43 18 80 53 11. In this sample, it took 14 cars to find two with out-of-state plates. The third 
sample is 58 44 69 94 86 85 79 67 05 81 18 45 14. In this sample, it took 13 cars to find two with out-of- 
state plates. 


Chapter 6 

Section 6.1 

Answers to Check Your Understanding 

page 365: 1. Using probability notation, the event “Student got a C” is written as P(x=2)P (X= 2). 


P(X=2)=1-0.011-0.032-0.362—P (X = 2) = 1 — 0.011 — 0.032 — 0.362—_ 9 457=0.138 
0.457 = 0.138 


2. P(x>3)P (X > 3) is the probability that the student got either an A or a B. This probability is 
P(X23)=0.362+0.457=0.819? (X 2 3) = 0.362 + 0.457 = 0.819 


3. The histogram is left skewed. This means that higher grades are more likely, and there are a few lower 
grades. 


0. 
0.05 = 
0.00 


Probability 

oop i) 
eeNNW > 
Cucuoucwn 
4 4 


X = Student’s grade 
page 371: 1. uX=(0)(0.011)+(1)(0.032)+(2)(0.138)+#2X = (0) (0.011) + (1) (0.032) + (2) (0.138) + (3) 
(0.362)+(4)(0.457)=3.222(3) (0.362) + (4) (0.457) = 3.222 IF many, many students are selected, the 
average grade of the students selected will be about 3.222. 
2. GX2=(0-3.222)2(0.011)+(1-3.222)2(0.032)+7x = (0 — 3.222)" (0.011) + (1 — 3.222)° (0.032) + 
(2-3.222)2(0.138)+(2 — 3-222)” (0.138) + (3-3.999)9(0.362)+(4-3.222)2(0.457) 
(3 — 3.222)” (0.362) + (4 — 3.222)” (0.457). gx 9=9,77277% = 9-7727 So 6x=0.7727=0.879 


ax = V0.7727 = 0.879 The grade received by a randomly selected student will typically vary from the 
mean (3.222) by about 0.879 points. 


Answers to Odd-Numbered Section 6.1 Exercises 

6.1 (a) X=5; P(X=5)=0.11% = 55 P(X = 8) = 0.11 gy pex3)-0,.72P (XS 8) = 0.72 

6.3 (a) P(Y<20)=0.31" (Y < 20) = 0.31. there is a 0.31 probability that the amount of money collected 
on arandomly selected ferry trip is less than $20. (b) y>20¥ = 20. P(Y>20)=0.69" (Y > 20) = 0.69 
6.5 


Probability 


Y = Money collected ($) 


Skewed to the left with a single peak at $25 collected. 


6.7 WY=19.35F¥ = 19.35. if many, many ferry trips are randomly selected, the average amount of 
money collected will be about $19.35. 

6.9 (a) Right-skewed distribution; the most likely first digit is 1, and each subsequent digit is less likely 
than the previous digit. (b) ux=3.441H2 = 3.441. in many, many randomly chosen legitimate records, the 
average of the first digit would be about 3.441. 


6.11 (a) 

Money collected 0 5 10 15 20 25 
Probability 0.02 0.05 0.008 016 0.27 #0.42 
Cumulative Probability 0.02 0.07 0.15 0.31 0.58 1 


The median of Y is $20. (b) The mean of Y is less than the median of Y because the probability 
distribution is skewed left. 

6.13 oY=6.4297Y = 6.429; the money collected on a randomly selected ferry trip will typically vary 
from the mean ($19.35) by about $6.43. 


6.15 ox=2.462 ; the first digit of a randomly chosen record will typically vary 
from the mean (3.441) by about 2.462. 
6.17 (a) The company has collected $1250 and must pay out $100,000. The company earns 
$1250-$100,000=—-$98, 750 
. (b) pY=$303.35 
Hy = $303.35 ; if many, many males are insured by this company, the average amount the company 
would make, per person, will be about $303.35. (c) oY=$9,707.57 
; the amount that the company earns on a randomly 

selected policy will typically vary from the mean ($303.35) by about $9,707.57. 
6.19 (a) Both distributions are skewed to the right. The center for the “household” distribution is less than 
the center for the “family” distribution, but the variability of the household distribution is greater than the 
variability of the family distribution. Also, the event X=1 has a much higher probability 
in the household distribution. (b) pH=2.6 and pF=3.14 ; 
the household distribution has a slightly smaller mean than the family distribution. (c) oF=1.249 

; the standard deviation for the household distribution is slightly larger than 
for the family distribution. 
6.21 (a) P(Y>6)=0.333; P(X>6)=0.155 


; if an 


expense report contains more than a proportion of 0.155 of first digits that are greater than 6, it may bea 
fake expense report. (b) The mean is 5 because this distribution is symmetric. (c) To detect a fake expense 
report, compute the sample mean of the first digits and see if it is closer to 5 (suggesting a fake report) or 
near 3.441 (consistent with a truthful report). 


6.23 Length=3600; height=1/3600; probability=300(1/3600)=0.083; 


there is about an 8.3% chance that the request is received within the first 300 seconds (5 minutes) after the 
hour. 


6.25 (a) P(-1<Y<1)=0.4 


; the probability that Mr. Shrager dismisses class within a minute of the end of class is 0.4. (b) pY=1.5; 
the mean is 1.5 because this distribution is symmetric. (c) k=2.75; 


to find this value, we solve the equation (4—k)(1/5)=0.25 


for k. 
6.27 (i)z=-1.50 ; P(Z<-1.50)=0.0668 
(ii) PCY <6)= normalcdf(lower: —1000 , upper: 6, mean: 


7.11, SD: 0.74). There is about a 6.68% chance that this student will run the mile in under 6 minutes. 
6.29 (a) (i) z=-1.83 and z=1.5 ; P(-1.83<Z<1.5)= 
0.9332—-0.0336=0.8996 


(ii) P(325<X<345)= normalcdf(lower: 325, upper: 345, 
mean: 336, SD: 6) = 0.8998. There is about an 89.98% chance that a randomly selected horse pregnancy 
will last between 325 and 345 days. 


(b) (i) 0.20 area to the right of c > z=0.84 . 0.84=c-3366 
= 5.04=c-336 7 
c=341.04 
(ii) invNorm(area: 0.80, mean: 336, SD: 6)=341.05 . About 20% of horses will have 
pregnancies that last more than 341.04 days. 
6.31 b 
6.33.¢ 


6.35 (a) Yes; if we look at the differences (Post — Pre) in the scores, the mean difference was 5.38 and the 
median difference was 3. This means that at least half of the students (though less than three-quarters 


because Q1 was negative) improved their reading scores. (b) No, we do not have a control 
group that did not participate in the chess program for comparison. It may be that children of this age 
improve their reading scores for other reasons (e.g., regular school) and that the chess program had 
nothing to do with their improvement. 


Section 6.2 
Answers to Check Your Understanding 
page 386: 1. The probability distribution of X and the probability distribution of Y are shown here. 


Both distributions are slightly skewed to the right with a single peak. Their shapes are identical. 
2. HY =500(uX)=500(1.1)=$550 


3. oY=500(0.943)=$471.50 


. The bonus earned in the first hour of business on a randomly selected Friday will typically vary from the 
mean ($550) by about $471.50. 


4. Note that T=Y-75 . Shape: The shape of the probability distribution of T 
will be the same as the shape of the probability distribution of Y: skewed right with a single peak. 


Center: uT=p Y—-75=550-75=$475 


Variability: oT=0Y =$471.50 


page 394: 1. pT=pX+pY=1.1+0.7=1.8 


this dealership expects to sell or lease about 1.8 cars in the first hour of business, on average, over Many 
randomly selected Fridays. 


2. Because X and Y are independent, oT2=0X2+oY2=(0.943)2+ 


(0.64)2=1.2988 , 80 0T=1.2988=1.14 


. The total number of cars 
sold or leased in the first hour of a randomly selected Friday will typically vary from the mean (1.8) by 
about 1.14. 


3. The total bonus is B=500X+300Y . This means 
that pB= 500pX+300p Y=500(1.1)+300(0.7)=$760 


. Because X and Y are independent, oB2=(5000X)2+(3000Y)2= 


(500)2(0.943)2+ 


(300)2(0.64)2=259,176.25 


. Therefore, oB=259,176.25= 
$509.09 


Answers to Odd-Numbered Section 6.2 Exercises 


6.37 Approximately Normal; pT=20 minutes; oT=6.5 
minutes 


6.39 (a) 


The distribution of total number of people on the flight is roughly symmetric with a single peak at 42 


people. (b) pX=41.4 ; if many, many flights are randomly selected, the average 
of the total number of people on the flight will be about 41.4 people. (c) oX=1.24 


; the total number of people on a flight will typically vary from the mean 
(41.4) by about 1.24 people. 


6.41 (a) The distribution of M has the same shape as the distribution of X: skewed to the left. (b) 
uM=$19.35 (c) oM=$6.45 
6.43 (a) W represents the amount of time (in minutes) after the start of an hour at which a randomly 


selected request is received by a particular web server. (b) W has a uniform distribution on the interval 
from 0 to 60 minutes. 


6.45 (a) G=5X+50 ; medianG=5(8.5)+50=92.5 


(b) IQRG=5(9-8)=5 


6.47 (a) pY=95(8.5)+32=47.3 


(b) oY=95(2.25)=4.05 


; the 


temperature in the cabin at midnight on a randomly selected night will typically vary from the mean (47.3 


°F) by about 4.05 °F 

(c) (i)z=-1.80 ; P(Z<-1.80)=0.0359 

(ii) PCY <40)= normalcdf(lower: —1000 , upper: 40, mean: 
47.3, SD: 4.05)=0.0357 . There is a 0.0357 probability that the midnight temperature in the 


cabin is below 40°F 


6.49 wS=pX+pY=732.50+825=$1557.50 


; the average of the sum of the tuitions would be about $1557.50 for many, many randomly selected pairs 
of students from each of the campuses. 


6.51 pF-M=120-105=15 ; 
the average of the difference would be about 15 points, if you were to repeat the process of selecting a 
single male student, selecting a single female student, and finding the difference (Female — Male) in their 


scores Many times. 


6.53 [X/50-Y/55=pX/50-pY/55=14.65-15=-0.35 


6.55 (a) Yes; the mean of a sum is always equal to the sum of the means. (b) No; the variance of the sum 
is not equal to the sum of the variances because it is not reasonable to assume that X and Y are 
independent. 


6.57 Because X and Y are independent, oS=1032+126.502=95 = V 103° + 126.50° = $163.139163.13, 
For a student randomly selected from each of the campuses, the sum of their tuitions typically varies from 
the mean ($1557.50) by about $163.13. 

6.59 (a) Independence of F and M means that knowing the value of one student’s score does not help us 
predict the value of the other 


student’s score. (b) OF-M=282+352=44.8229F-M = v 28? + 35° = 44.822. for a randomly selected 
male and female college student, the difference of SSHA scores typically varies from the mean (15) by 
about 44.822 points. (c) No, we do not know the shapes of the distributions. We cannot assume that the 
distributions are Normal without additional information. 


6.61 oD=0X/50-Y/55=(10350)2+(126.5055)2=3.088 
LPN? 2 
op = ox/s0-v/s5 = 4/ (3) + (==) — 3.088 
J a av : the 
standard deviation of the difference D (Main — Downtown) in the number of units that two randomly 
selected students take is 3.088. 


Hux, +x, = $606.70 


6.63 pX1+X2=$606.70 , and because the variables are independent, 


oe 9 Punk ; 
oX1+X2=$13,728.58, W=12(X1+X2)7X1+X2 = $13,728.58. W = 9 (Xi + Xo) 


pw =$303.35 and oW=$6864.29 


, 80 LW=$303.35 


6.65 (a) Normal distribution with mean = 11 + 20 = 31 seconds, and because the variables are 


me 7 a ee 

independent, standard deviation=22+42=4,4721Standard deviation = V2" + 4° = 4.4721 (p) 
(i)2=-0.22% = —0.22; p(z.<-0.22)=0.4129P (4 < —0.22) = 0.4129 

(ii) P(X+Y<30)=normalcdf(lower: ~1000,P(¥ + ¥ < 30) = normaledf(lower: — 1000, upper: 30, 
mean: 31, SD: 4.4721) = 0.4115. There is a 0.4115 probability of completing the process in less than 30 
seconds for a randomly selected part. 


6.67 Let D = L — H; pD=105-98=7HD = 105 — 98 = 7 Because the random variables are independent, 


= ,/(10)” + (15) = 18.03 
oD=(10)2+(15)2=18,0372 ~ V 20)" + (5) 

ae F rey _ P(-5<L-—H<5) = 
7 and a standard deviation of 18.03. We want to find P(—5<L—H<5) or P(-5<D<5) 
P(-5 < D<5) 


. D follows a Normal distribution with a mean of 


(i)z=-0.11 and z=-0.672 = —0.11 and z = —0.67. p(—0.67<Z<-0.11)=0.4562- 
0.2514=0.2048 
(ii) P(-5<D<5) = 
normalcdf(lower: —5, upper: 5, mean: 7, SD: 18.03)=0.2030 . There is a 0.203 probability 
that Lamar and Hareesh finish their jobs within 5 minutes of each other on a randomly selected day. 
6.69 Let T=X1+X2+X3+X4 


; HT=55.2+58.0+56.3+ 
54,7=224,2 . Because the random variables are independent, 
OT=(2.8)2+(3.0)2+(2.6)2+(2.7)2=5.56 


. Thas the Normal distribution with a mean of 224.2 and a standard deviation of 5.56. We want to find 


P(T<220) (i) z=-0.76 ; P(Z<-0.76)=0.2236 


(ii) P(T<220)=normalcdf(lower: —1000, upper: 220, mean: 224.2, SD: 5.56)=0.2250. 


There is a 0.2250 probability that the total team time is less than 220 seconds in a randomly selected race. 
6.71 Let D=X1-X2; pD=1.4-1.4=0 


. Because the random variables are independent, oX1—X2=0.32+0.32=0.4243 


. Dhas the Normal distribution with a mean of 0 and a standard deviation of 0.4243. We want to find 
P(D<-—0.8 or D>0.8) 
(i) z=1.89 and z=—-1.89 ; P(Z<-1.89 or Z>1.89) 
=0.0588 
(ii) P(D<-—0.8 or D>0.8)=1-normalcdf(lower: —0.8, 


upper: 0.8, mean: 0, SD: 0.4243) = 1 -0.9406 = 0.0594. There is a 0.0594 probability that difference is at 
least as large as the attendant observed. 

6.73 Cc 

6.75 (a) Yes; the girls who were assigned to receive the fluoride varnish may expect to get fewer cavities 
and, as a result, might take better care of their teeth. (b) This experiment could be double-blind if the girls 
who are not selected to receive the fluoride varnish received a placebo varnish that would neither help nor 
hurt their teeth. Also, the dental hygienist who checks the girls’ teeth for cavities at the end of the 4 years 
should not know which girls received which treatment. (c) The random assignment should help to make 
the two groups as similar as possible before treatments are administered. 


Section 6.3 
Answers to Check Your Understanding 


2 = 


= get an ace; “Failure 


JF 


page 406: 1. Binary? “Success don’t get an ace. Independent? Because you are 


replacing the card in the deck and shuffling each time, the result of one trial does not tell you anything 
about the outcome of any other trial. Number? n=10 . Same probability? p=4/52 
. This is a binomial setting, and X has a binomial distribution with n=10 
and p=4/52P = 4/52. 
2. Binary? “Success” = over 6 feet; “Failure” = not over 6 feet. Independent? Because we are selecting 
without replacement from a small number of students, the observations are not independent. Number? 


n=5n = 5.Same probability? The (unconditional) probability of success will not change from trial to trial. 
Because the trials are not independent, this is not a binomial setting. 


3. Binary? “Success” = roll a 5; “Failure” = don’t roll a 5. Independent? Because you are rolling a die, the 
outcome of any one trial does not tell you anything about the outcome of any other trial. Number? n=100 
n = 100. Same probability? No; the probability of success changes when the corner of the die is chipped 
off. Because the (unconditional) probability of success changes from trial to trial, this is not a binomial 
setting. 


page 412: 1. Binary? “Success” = question answered correctly; “Failure” = question not answered 
correctly. Independent? Mr. Miller randomly determined correct answers to the questions, so knowing the 
result of one trial (question) should not tell you anything about the result on any other trial. Number? 


n=10 . Same probability? p=0.20 . This is a binomial setting and X has a 
binomial distribution with n=10 and p=0.20 
2. P(X=3)=(103)(0.2)3(0.8)7=0.2013. 


There is about a 20% 
chance that Hannah will answer exactly 3 questions correctly. 
3. P(X>6)=1-P(X<6)=1-P(X<5)=1-binomcdf 


(trials:10, p: 0.2, x value:5)=1—0.9936=0.0064. 


There is only a 0.0064 probability that a student would get 6 or more correct, so we would be quite 


surprised if Hannah was able to pass. 
page 418: 1. 


The probability distribution is skewed to the left with a single peak at Y = 8. 
2. LY=np=10(0.80)=8 


. If many students took the quiz, the average number of questions students would answer incorrectly is 
about 8 questions. 


3. oY=np(1—p)=10(0.80)(0.20)=1.265. 


If many students took the quiz, we would expect individual students’ scores to typically vary from the 
mean of 8 incorrect answers by about 1.265 incorrect answers. 

4. The shape of X is skewed right and the shape of Y is skewed left, both with a single peak. The center 
(mean) of the distribution of X is 2, which is less than the center (mean) of the distribution of Y, which is 
8. (Note that the sum of the means is 10 because X + Y= 10.) The variability of both probability 
distributions is the same (standard deviation = 1.265). 


page 426: 1. Die rolls are independent, the probability of getting doubles is the same on each roll (1/6), 


and we are repeating the chance process until we get a success (doubles). This is a geometric setting and T 
pee 5 
is a geometric random variable with p=16” 6. 


P(T =3) = (3) (+) = 0.1157 — 
2. P(T=3)=(56)2(16)=0.1157 , , . The probability is 11.57% that 
you will get the first set of doubles on the third roll of the dice. 


P(T<3)=1+(%)(4)+(% "G = 0.4213 

3. P(T<3)=16+(56)(16)+(56)2(16)=0.4213 (Ps3) ° (; (; ) ;) . The 
proba- 

bility is 42.13% of getting doubles in 3 or fewer rolls. 


Answers to Odd-Numbered Section 6.3 Exercises 


, — 


6.77 Binary? “Success” = survive; “Failure” = does not survive. Independent? Yes; knowing the outcome 
of one elk shouldn’t tell us anything about the outcomes of other elk. Number? n=7n = 7, Same 
probability? p=0.44P = 9.44, x has a binomial distribution with n=72 = 7 and p=0.44P = 9.44, 


Do 


6.79 Binary? “Success” = hits; “Failure” = does not hit. Independent? Yes, the outcome of one shot does 


not tell us anything about the outcome of other shots. Number? No, there is not a fixed number of trials. Y 
does not have a binomial distribution. 

6.81 X has a binomial distribution with n=7 and p=0.44 
P(X=4)=(74)(0.44)4(0.56)3=0.2304. 


Tech: P(Y=4)= 
binompdf(trials: 7, p: 0.44, x value: 4)=0.2304. 


There is a 23.04% probability that exactly 4 of the 7 elk survive to adulthood. 
6.83 (a) Binary? “Success” = spinner lands in the blue region; “Failure” = spinner does not land in the 
blue region. Independent? Knowing whether or not one spin lands in the blue region tells you nothing 


about whether or not another spin lands in the blue region. Number? n=12 . Same probability? 


p=0.80 . X has a binomial distribution with n=12 and p=0.80P ~ 0.80. (b) 


( 12 
(128)(0.80)8(0.20)4=0.1329 \ 8 
binompdf(trials: 

12, p: 0.80, x value: 8) =0.1329= 0.1329. There is about a 13.29% probability that the spinner will land 
in the blue region exactly 8 of the 12 times. 


6.85 X has a binomial distribution with n=72 = 7 and p=0.44P = 0.44 P(X>4)=0.1402 

P(X >4)= 0.1402 Tech: P(X>4)= 1 — binomcdf(trials: 7, p: 
0.44, x value: 4) =1—-0.8598=0.1402 . This 
probability isn’t very small, so it is not surprising for more than 4 elk to survive to adulthood. 


(0.80)* (0.20) = 0.1329 
; feck: p(y=8)=P (Y= 8§) = 


6.87 X has a binomial distribution with n=12 and p=0.80 . Tech: P(X<7)= 
binomcdf(trials: 12, p: 0.80, x value: 7)=0.0726 


. Assuming the website’s claim is true, there is a probability of about 7.26% that there 
would be 7 or fewer spins landing in the blue region. 


6.89 (a) X has a binomial distribution with n=22 and p=0.20 . Tech: P(X214)= 


1 — binomcdf(trials: 22, p: 0.20, x value: 13) = 0.00001. 
Assuming participants don’t have a special preference for the last thing they taste, there is an almost 0% 
probability that there would be at least 14 people who choose the last kiss. (b) Because this outcome is 
very unlikely, we have convincing evidence that participants have a preference for the last thing they 
taste. 
6.91 (a) Binary? “Success” = red; “Failure” = not red. Independent? Knowing whether or not the light is 
red for one randomly selected passenger tells you nothing about whether or not the light is red on another 


randomly selected passenger. Number? n=20 . Same probability? p=0.30 . R has 
a binomial distribution with n=20 and p=0.30 . (b) The shape is fairly 
symmetric with a single peak at R=6 passengers. 

6.93 


The shape is left-skewed with a single peak at X=10 
6.95 (a) PR=20(0.3)=6 


; if many 


groups of 20 passengers were selected, the average number of passengers who would get a red light would 
be about 6 passengers. (b) oOR=20(0.3)(0.7)=2.049 


; if many groups of 20 passengers were selected, the number of passengers who would get a red light 
would typically vary from the mean (6) by about 2.049. 
6.97 (a) Let Y=number of calls not completed. Y has a binomial distribution with 
n=15 and p=0.91 . Tech: P(Y>12)=1- 

binomcdf(15,0.91,12)=0.8531 


. There is an 85.31% 
probability that more than 12 calls are not completed. (b) pX=15(0.09)=1.35 
; if we 
watched the machine make many sets of 15 calls, the average number of calls that would reach a live 
person would be about 1.35 calls. (c) oX=15(0.09)(0.91)=1.11 


; if we watched the machine make many sets of 15 calls, we would expect the number of calls that reach a 
live person to typically vary from the mean (1.35) by about 1.11 calls. 


6.99 (a) As long as n=100 is less than 10% of the size of the population (the entire high 
school), L can be modeled by a binomial distribution even though the sample was selected without 
replacement. (b) Tech: P(L>15)*1—binomcdf(100,0.11,14)=0.1330 


. There is a 13.3% probability that 15 or more students in the sample are left-handed. 
6.101 No; we are sampling without replacement and the sample size (10) is more than 10% of the 
population size (76). We should not treat the observations as independent. 
6.103 (a) n=100 and p=0.11, so np=11>10 
and n(1—p)=89=10 


Since the expected number of successes and the expected number of failures are both 10 or more, L can be 
approximated by a Normal distribution. (b) We want to find P(L>15) 


_ pL=(100)(0.11)=11 


and oL=(100)(0.11)(0.89)=3.129 


(i) 2=1.28 ; P(Z>1.28)=1-0.8997=0.1003" (4 2 1.28) = 1 — 0.8997 = 0.1003 

(ii) P(L=15)=normalcdf(lower: 15, upper: 1000, mean: 11, SD:3.129)=0.1006. 

P(L > 15) = normalcdf(lower:15,upper:1000,mean:11, SD : 3.129) = 0.1006. there is a 0.1006 
probability that 15 or more students in the sample are left-handed. 


6.105 When sampling without replacement, the trials are not independent because knowing the outcomes 
of previous trials makes it easier to predict what will happen in future trials. However, if the sample is a 


small fraction of the population (less than 10%), the makeup of the population doesn’t change enough to 
make the lack of independence an issue. 

6.107 (a) X is a geometric random variable with p=0.20P ~ 0.20, P(X=3)=(0.8)2(0.2)=0.128 

P(X = 3) = (0.8)° (0.2) = 0.128 Tech: geometpdf(p:0.20, x value:3)=0.128 

geometpdf(p : 0.20, x value : 3) = 0.128 There is a 0.128 probability that it takes Rita exactly 3 pulls 
to start the mower. (b) p(x>6) (X > 6) 

=1-[0.20+(0.8)(0.2)+(0.8)2(0.2)+- - -+(0.8)5(0.2)]=0.2621 

= 1— {0.20 + (0.8) (0.2) + (0.8)° (0.2) +--+ + (0.8)° (0.2)| 

= 0.2621 

Tech: 1 — geometcdf(p: 0.20, x value: 6)=0.2621= 0.2621, There is a 0.2621 probability that it takes Rita 
more than 6 pulls to start the mower. 


6.109 (a) This is not a geometric setting because we can’t classify the possible outcomes on each trial 


(card) as “success” or “failure” and we are not selecting cards until we get 1 success. (b) Games of 4-Spot 


= 0.259) 


Keno are independent, the probability of winning is the same in each game (p=0.259)\P , and 


Lola is repeating a chance process until she gets a success. X is a geometric random variable with 


p=0.259 


6.111 (a) X is a geometric random variable with p=0.097 ; UX=1p=10.097=10.31 


. We 
would expect to examine about 10.31 


invoices to find the first 8 or 9. (b) Tech: P(X>40)=1-geometcdf(p: 0.097, x value: 39)=1—0.9813=0.0187 


. Because the probability of not getting an 8 or 9 before the 40th invoice is small, we may begin to worry 
that the invoice amounts are fraudulent. 


6.113 b 
6.115d 
eli ¢ 
6.119 The standard deviations from smallest to largest are: B, C, A. 


Answers to Chapter 6 Review Exercises 


R6.1 (a) P(Y=5)=0.1 ; there is a 0.1 
probability that a randomly selected patient would rate his or her pain as a5 on a scale of 1 to 5. (b) 
P(Y<2)=0.3 (c) pX=3.1; 
oX=1.136 


R6.2 (a) Temperature is a continuous random variable because it takes all values in an interval of 
numbers. (b) pD=550-550=0°C, 
and the 


standard deviation stays the same, oD=5.7°C , because 
subtracting a constant does not change the variability. 


r= 9 Qe os 99° 
(c) '¥=95(550)+32=1022°R 2 = a 82 = Mee 


_— (2) (5.7) = 10.26°F. 


and oY=(95)(5.7)=10.26°F. 


R6.3 (a) 


0.4 


Probability 


0.0 ————— 
0 1 3 120 


X = Payout for a single $1 bet 
The distribution is skewed right with a single peak at the $0 payout. The mean of the distribution is $0.70 
and the standard deviation is $6.58. (b) Payout averages $0.70 per game. If you were to play many games 
of 4-Spot Keno, the payout amounts would typically vary by about $6.58 from the mean ($0.70). (c) 
Y=5X; pY=5(0.70)=$3.50; oY=5(6.58)=$32.90 
Y = 5Xjpy = 5 (0.70) = $3.50;0y = 5 (6.58) = $32.90 (d) Let W be the amount of Marla’s payout, 
W=X1+X2+X3+X4+X5W = Xi + Xo + X3 + Xa + Xs. ww=5(0.70)=$3.50Hw = 5 (0.70) = $3.50 
and oW=$14.719w = $14.71, 
R6.4 (a) (i)z=0.83;2 = 9-83; p(z>0.83)=1-P(Z<0.83)=0.2033 
P(Z > 0.83) = 1— P(Z < 0.83) = 0.2033 


(ii) P(C>11)=normalcdf(lower: 11, upper: 1000, mean: 10, SD:1.2)=0.2023. 


There is a 0.2023 probability that a randomly selected cap has strength greater than 11 inch-pounds. (b) It 
is reasonable to assume the cap strength and torque are independent because the machine that makes the 


caps and the machine that applies the torque are not the same. (c) C-T is Normal with 
mean 10-7=3 inch-pounds and standard deviation 

0.92+1.22=1.5 inch-pounds. 

(d) (i)z=-2 ; P(Z<-2)=0.0228 


(ii) P(C-T<0)=normalcdf(lower: —1000, upper: 0, mean: 3, SD: 1.5)=0.0228 


. There is a 0.0228 probability that a randomly selected cap will break when being fastened by the 
machine. 


R6.5 (a) Binary? “Success” = orange; “Failure” = not orange. Independent? The sample of size n=8 


is less than 10% of the large bag, so we can assume the outcomes of trials are independent. 


Number? We are choosing a fixed sample of n=8 candies. Success? The probability of 
success remains constant at p=0.205 . This is a binomial setting, so X has a binomial 
distribution with n=8 and p=0.205 . (b) P(X=3)=0.1532 


. Tech: P(X=3)= binompdf 


(trials: 8, p: 0.205, x value: 3)=0.1532 . There is a 15.32% probability that exactly 3 of the 8 
M&M’S® will be orange. (c) Tech: P(X>4)=1-binomcdf(trials: 8, p: 0.205, x value: 3)=0.0610 


. There is a 6.1% probability that at least 4 of the 8 M&M’S will be orange. (d) No; it is not very unusual 
to receive 4 or more orange M&M’S in a sample of 8 M&M’S when 20.5% of M&M’S produced are 
orange. This happens 6.1% of the time due to chance alone, so this result does not provide convincing 
evidence that Mars, Inc.’s claim is false. 


R6.6 (a) pX=8(0.205)=1.64 

; if 
we were to select many random samples of size 8, we would expect to get about 1.64 orange M&M’S, on 
average. (b) oX=8(0.205)(0.795)=1.14 


; if we were to select many random samples of size 8, the number of orange M&M’S would typically vary 
from the mean (1.64) by about 1.14. 
R6.7 Y is a geometric random variable with p=312=0.25 


P(Y<3)=0.5781 
Tech: geometcdf(p: 0.25, x value: 3)=0.5781 


There is a 0.5781 probability that it takes 3 or fewer spins to get a wasabi bomb. 


0 


R6.8 (a) Although the trials are binary (“success” = use and “failure” = does not use) and there are a fixed 


number of trials (n=200) , the residents were selected 
without replacement, violating the condition of independence. However, since the sample size (n=200) 


is less than 10% of the population size (all residents in a 
large city), T has approximately a binomial distribution with n=200 and p=0.34 
. (b) n=200 and p=0.34 , SO np=68>10 
and n(1—p)=132>10 


Since the expected number of successes and the expected number of failures are both 10 or more, T can be 
approximated by a Normal distribution. (c) We want to find P(T<60) 
uT=(200)(0.34)=68 


and oT=np(1-p)= 


(200)(0.34)(0.66)=6.699 


(i)z=-1.19 ; P(Z<-1.19)=0.1170 


(ii) P(T<60)=normalcdf(lower: —1000, upper: 60, mean: 68, SD:6.699)=0.1162. 


There is a 0.1162 probability that at most 60 residents in the sample use public transportation at least once 
per week. 


Answers to Chapter 6 AP® Practice Test 
T6.1b 
T6.2d 
T6.3 d 
T6.4e 
T6.5d 
T6.6 b 
T6.7 Cc 
T6.8 b 
T6.9 b 
T6.10¢c 


T6.11 (a) P(Y<2)=0.96 ; there is a 96% chance that at 
least 10 eggs are unbroken in a randomly selected carton of “store brand” eggs. (b) pY=0.38HY — 0.38. 
if we were to randomly select many cartons of eggs, we would expect an average of about 0.38 eggs to be 
broken. 

(c)o Y=0.82199Y = 0.8219 ; if we were to randomly select many cartons of eggs, the number of broken 
eggs would typically vary from the mean (0.38) by about 0.8219. (d) X is a geometric random variable 
with p=0.11? ~ 0.11 We are looking for P(x<3)P (X< 3) Tech: geometcdf1p: 


0.11, x value: pbc=0.2950 


2 = 


T6.12 (a) Binary? “Success” = greets dog first; “Failure” = does not greet dog first. Independent? We are 


sampling without replacement, but 12 is less than 10% of all dog owners. Number? n=12 
Same probability? The probability of success is constant for all trials (p=0.66) 

. X is a binomial random variable with n=12 
and p=0.66 . (b) P(X=6)=0.1180 

(c) P(X<4)=0.0213 

P(X<4)= 0.0213 Tech: binomcdf(trials: 12, p: 0.66, x value: 4)=0.0213 
binomedf (trials:12,p:0.66,xvalue:4) = 0.0213 Because this probability is very small, it is unlikely 
that only 4 or fewer owners will greet their dogs first by chance alone. This gives convincing evidence 
that the claim by Ladies’ Home Journal is incorrect. 
T6.13 (a) Letting D=A-E, D=50-25=25P = A—E,up = 50 — 25 = 25. Because the amount of time 
they spend on homework is independent of each other, oD=11.189D ~ 11.18 (b) (i)z=-2.244 = — 2.24. 
P(Z<-2.24)=0.0125P (4 < —2.24) = 0.0125 


(ii) P(D<0)=normalcdf1lower: -1000,P (D < 0) = normalcdfllower: — 1000, upper: 0, mean: 25, SD: 
11.182 = 0.0127. There is a 0.0127 probability that Ed spent longer on his assignment than Adelaide did 
on hers. 


T6.14 (a) X has a binomial distribution with n=1200n = 1200 and p=0.13 
wX=1200(0.13)=156 

and 
oX=1200(0.13)(0.87)=11.6499 


. (b) If the sample contains 10% Hispanics, there were 1200(0.10)=120 
Hispanics in the sample. We want to find 
P(X<120)=0.00083 . Tech: binomcdf trials: 1200, 
p=0.13, x value: 1202=0.00083 
. If we use the Normal approximation to the binomial 
distribution, z=—-3.09 and P(Z<-3.09)=0.001 


. Because this probability is small, it is 
unlikely to select 120 or fewer Hispanics in the sample just by chance. This gives us reason to be 
suspicious of the sampling process. 


Chapter 7 

Section 7.1 

Answers to Check Your Understanding 

page 446: 1. The population is all M&M’S® Milk Chocolate Candies produced by the factory in 


Hackettstown, NJ. The parameter is p= the proportion of all M&M?’S Milk Chocolate Candies 
produced by the factory in Hackettstown, NJ, that are orange. The parameter is claimed to be p=0.25 


. The sample is the 50 M&M’S Milk Chocolate Candies selected. The statistic is the 
proportion of the sample of 50 M&M’S that are orange, p/ 
be 


3. The graph below shows a possible distribution of sample data. For this sample, there are 11 orange 


M&M’S, so pA=1150=0.22 


4. The middle graph is the approximate sampling distribution of p/ . The statistic measures the 
proportion of orange candies in samples of 50 M&M’S. Assuming that the company is correct, 25% of the 
M&M’S are orange, so the center of the distribution of p/ should be at approximately 0.25. The first 
graph shows the distribution of the colors for one sample, rather than the distribution of p/ from 
many samples, and the third graph is centered at 0.125, rather than 0.25. 


page 452: 1. The median does not appear to be an unbiased estimator of the population median. The mean 
of the simulated sampling distribution of the sample median (73.5) is not equal to the median of the 
population (75). 

2. Increasing the sample size from 10 to 20 will decrease the variability of the sampling distribution. 
Larger samples provide more precise estimates, because larger samples include more information about 
the population distribution. 


3. The sampling distribution of the sample median is skewed to the left and single peaked. 


Answers to Odd-Numbered Section 7.1 Exercises 


7.1 Population: All people who signed a card saying they intend to quit smoking. Parameter: p= 
the proportion of the population who had not smoked over the past 6 months. Sample: The 1000 people 


selected at random. Statistic: p= the proportion in the sample who had not smoked over the 


past 6 months=0.21 


7.3 Population: All dental practices in California. Parameter: The interquartile range of the price to fill a 
cavity for all dental practices in California. Sample: The 10 randomly selected dental practices that 


provided the price they charge to fill a cavity. Statistic: IQR= the 
interquartile range of the price to fill a cavity for the 10 selected dental practices =$74 


7.5 Population: All bottles of Arizona Iced Tea produced that day. Parameter: =" ~ the average 
number of ounces per bottle in all bottles of Arizona Iced Tea produced that day. Sample: The 50 bottles 
of tea selected at random from the day’s production. Statistic: x = = the average number of ounces of 
tea contained in the 50 bottles=19.650 bottles = 19.6 ounces. 


7.7 The 10 possible SRSs of size n=2 and their sample means are: 
1: Abigail(10), Bobby(5) x =7.5 

: Abigail(10), Carlos(10) x =10 

: Abigail(10), DeAnna(7) x =8.5 

: Abigail(10), Emily(9) x =9.5 

: Bobby(5), Carlos(10) x =7.5 


e. 

3 

4 

FS) 

6: Bobby(5), DeAnna(7) x =6 
7: Bobby(5), Emily(9) x =7 
8: Carlos(10), DeAnna(7) x =8.5 
9: Carlos(10), Emily(9) x =9.5 


10: DeAnna(7), Emily(9) x =8 


7.9 The 10 possible SRSs of size n=2 and the proportion of females in each sample are: 
1: Abigail(F), Bobby(M) p/=0.50 
2: Abigail(F), Carlos(M) p4=0.50 


3: Abigail(F), DeAnna(F) pA=1P = 

4: Abigail(F), Emily(F) p\=1P = 1 

5: Bobby(M), Carlos(M) p\=0P =0 

6: Bobby(M), DeAnna(F) p=0.50P = 0.50 
7: Bobby(M), Emily(F) p§=0.50P = 9.50 
8: Carlos(M), DeAnna(F) pA=0.50P = 0.50 
9: Carlos(M), Emily(F) p4=0.50 

10: DeAnna(F), Emily(F) pA=1 


7.11 (a) 


(b) An example bar graph is given. 


7.13 (a) pA=45100=0.45 , which is less than 
0.60. (b) It is possible 


that 60% of the students did their homework, and the students got a p/ less than 60% because of 
sampling variability. It is also possible that the sample proportion is less than 60% because less than 60% 
of all the students did their homework. (c) In one simulated SRS of 100 students, 73 students did their 


assigned homework last week. (d) Because there were no values of p/ less than or equal to 0.45 in 
the simulation, it would be very surprising to get a sample proportion of 0.45 or less in an SRS of size 100 


from a population in which p=0.60 . (e) Because it would be very surprising to get a sample 
proportion of pA=0.45 or less in an SRS of size 100 when p=0.60 , we 
should be skeptical of the newspaper’s claim. 

7.15 Because 78/250 or 31.2% of the values of p/ are less than or equal to 0.57 in the simulation, it 
would not be surprising to get a sample proportion of p\=0.57 or less in an SRS of size 


100 from a population in which p=0.60 . A sample proportion of p\=0.57P = 0.57 does not 
provide convincing evidence that the population proportion of students who did all their assigned 
homework is less than p\=60,P = 69, 


7.17 A sample standard deviation of 5°F is quite large compared with what we would expect by chance 
alone, because none of the 300 simulated SRSs had a standard deviation that large. A sample standard 
deviation of 5°F provides convincing evidence that the manufacturer’s claim is false and that the 
thermostat actually has more variability than claimed. 


7.19 The 6 possible SRSs of size n=2n = 2 and the sample proportion for each sample are: 
1: Red, White p4=0.5 

2: Red, Silver pA=0.5 

3: Red, Red pA=1P =1 

4: White, Silver p\=0P = 9 


5: White, Red p=0.5? = 0.5 
6: Silver, Red pA=0.5P = 0.5 


T T 
0 05 I 


p = Sample proportion of red cars 


The population proportion of red cars is p=2/4=0.5? = 2/4 = 0-5 Because the mean of the sampling 
distribution is also 0.5, the sample proportion is an unbiased estimator of the population proportion. 


7.21 The 4 possible SRSs of size n=3 and the sample proportion for each sample are: 
1: Red, White, Silver pA=1/3 

2: Red, White, Red pA=2/3 

3: Red, Silver, Red pA=2/3 

4: White, Silver, Red pA=1/3 


The variability in this sampling distribution is less than the variability in the sampling distribution from 
Exercise 19. Increasing the sample size decreases the variability in the sampling distribution. 

7.23 If we chose many SRSs and calculated the sample mean x — for each sample, the 
distribution of x — would be centered at the value of p. 


7.25 (a) Statistics ii and iii are unbiased estimators because the means of their sampling distributions 
appear to be equal to the corresponding population parameters. (b) Statistic ii does the best job at 
estimating the parameter. It is unbiased and has very little variability. 


tora 

7.29 b 

7.31 (a) We are looking for P(z < —2.5). (i) P(z < -2.5) = 0.0062 (ii) P(z < -2.5) = normalcdf(lower: 
—1000, upper: —2.50, mean: 0, SD: 1) = 0.0062. Less than 1% of healthy young adults have osteoporosis. 


(b) X follows a Normal distribution with a mean of —2 and a standard deviation of 1. We want to find P(X 
< -2.5). (i) z = -0.5; P(X < -2.5) = 0.3085 (ii) P(X < -2.5) = normalcdf(lower: —1000, upper: —2.5, mean: 
-2, SD: 1) = 0.3085. About 30.85% of women aged 70-79 have osteoporosis. 


Section 7.2 


Answers to Check Your Understanding 


page 463: 1. upA=p=0.75/' = P — 0.75 


2. Because 1000 < 10% of all young adult Internet users, the 10% condition has been met. The standard 
deviation of the sampling distribution of p/P is op=p(1—p)n=0.75(0.25)1000=0.0137 


[pl p) i / 0.75(0.25) 
in — VV ~ yooo 


o> = = 0.0137 | 


SRSs of size 1000, the sample proportion of young adult Internet users that watch online videos will 
typically vary by about 0.0137 from the true proportion of p=0.75? ~ 0.79, 


3, Because np=1000(0.75)=750>10"P = 1000 (0.75) = 750 > 10 ang n(1—p)=1000(0.25)=250>10 


n(1—p) = 1000 (0.25) = 250 > 10 ino sampling distribution of p\? is approximately Normal. 


4. If the sample size were 9000 instead of 1000, the sampling distribution would still be approximately 
Normal with mean 0.75. However, the standard deviation of the sampling distribution would be smaller 


/ 0.75(0.25) 


,» = 4/——— = 0.0046 
by a factor of 3. In this case, op=0.75(0.25)9000=0.0046 ! V “9000 ; 
Answers to Odd-Numbered Section 7.2 Exercises 
7.33 (a) pp=p=0.55 (b) n=500 is less than 


10% of the population of all registered voters, so op=0.55(1-0.55)500=0.022 


.In SRSs 
of size n=500 , the sample proportion of registered voters that are Democrats will typically 
vary by about 0.022 from the true proportion of p=0.55 . (c) Because np=500(0.55)=275>10 


and n(1—p)=500(1-0.55)=225210 


, the sampling distribution of p/ is approximately Normal. 


7.35 (a) pp4=p=0.20 (b) n=30 is less 
than 10% of the population of all Skittles®, so op=0.20(1-0.20)30=0.0730 


. In SRSs of 
size n=30 , the sample proportion of Skittles that are orange will typically vary by about 0.073 
from the true proportion of p=0.20 
(c) Because np=30(0.2)=6<10 
the 


sampling distribution of p/ is not approximately Normal. Because p=0.20 is closer to 
0 than to 1, the sampling distribution of p/P is skewed to the right. 


7.37 A sample size of 30(4)=12030 (4) = 120 \ould reduce the standard deviation of the sampling 
distribution to half the value found in Exercise 35(b): opA=0.20(1—-0.20)120=0.0365 


sfinaies / 0.20(1—0.20) _ > 
05 = Va — = 0.0365 


p 
7.39 (a) No; the 10% condition is not met here because more than 10% of the population (10/76=13%) 
(10/76 = 13%) was selected. (b) No; the Large Counts condition is also not met because the sample size 
was only n=10n = 10. Neither np’? nor n(1-p)” (1 — P) will be at least 10. 

7.41 (a) upA=p=" ‘p ~2= 0.70 (b) n=1012 is less than 10% of the population of all U.S. 
adults, so opA=0.7(1-0.7)1012=0.0144 


(c) np=1012(0.70)=708.4>10 
and n(1—p)=1012(0.30)= 


303.6210 , so the sampling distribution of p/ is approximately 
Normal. (d) We want to find P(pr<0.67) (p< 0.67) (i)z=—-2.087 = 2.08. P(z<-—2.08)=0.0188 
P(z < —2.08) = 0.0188 


(ii) P(p4<0.67)=normalcdf(lower: —1000, upper: 0.67, mean: 0.70, SD: 0.0144)=0.0186 
P(p < 0.67) = normalcdf(lower: — 1000,upper:0.67, mean: 0.70,SD:0.0144) = 0.0186 


. There is a 0.0186 probability of obtaining a sample of size 1012 in which 67% or fewer say they drink 
the milk. (e) Because 0.0186 is a small probability, there is convincing evidence against the claim that 
p=0.70P =~ 0.70_it isn’t plausible to get a sample proportion this small by chance alone. 

7.43 ppA=0.70 ; because 267 is less than 10% of the population of 

all college women, op/=0.7(0.3)267=0.0280 


. Because np= 
267(0.7)=186.9210 


and 
n(1—p)=267(0.3)=80.1>10 


, the sampling distribution of p/ can be approximated by a Normal distribution. We want to find 


P(pA20.75) . (i)z=1.79 ; P(z=1.79)=0.0367 


(ii) P(p4=0.75)=normalcdf(lower: 0.75, upper: 1000, mean: 0.7, SD: 0.02802=0.0371. 


There is a 0.0371 probability that 75% or more of the women in the sample have been on a diet within the 
last 12 months. 


7.45 (a) up=0.90 ; because 100 is less than 10% of the population 


of orders (100/5000=2%) F 
op/=0.90(0.10)100=0.03 


. Because 
np=100(0.90)=90=10 


and n(1—p)=100(0.10)=10>10” (1 — p) = 100 (0.10) = 10 > 10 the sampling distribution of p\? can 
be approximated by a Normal distribution. We want to find P(ps<0.86)" (p< 0.86) (i)z=-1.33 

2 = —1.33. pcz<-1,33)=0.0918" (2 < —1-33) = 0.0918 (3) p@r<o.86)=normalcdf(lower: -1000, 

P (p < 0.86) = normaledf(lower: — 1000, upper: 0.86, mean: 0.90, SD: 0.03)=0.0912. 
0.90,SD:0.03) = 0.0912. There is a 0.0912 probability that 86% or fewer of orders in an SRS of 100 
were shipped within 3 working days. (b) No; it isn’t unusual to get a sample proportion of 0.86 or smaller 
when selecting an SRS of 100 from a population in which p=0.90? ~ 0.90. Thus, it is plausible that the 
90% claim is correct and that the lower than expected percentage is due to chance alone. 

7.47 a 

7.49 b 

7.91 (a) 


Share music from computer? 


Download music files online? 


(b) 62/100=0.62=62% of Internet users 
neither download nor share music files. (c) 12/29=0.414=41.4% 


of Internet users who download music 
files online also share music files. 


Section 7.3 

Answers to Check Your Understanding 

page 474: 1. We want to find P(X>270) . (i) z=0.25 : 
P(z>0.25)= 1-0.5987=0.4013 (ii) 
P(X>270)=normalcdf(lower: 270 


’ 


upper: 1000, mean: 266 , SD: 16)=0.4013. 


There is a 0.4013 probability of selecting a woman 
whose pregnancy lasts for more than 270 days. 
2. days ux =y1=266 
3. The sample of size 6 is less than 10% of all pregnant women. Therefore, the standard deviation of the 
sampling distribution of x is 
ox =on=166=6.532 


days. In SRSs of size 6, the sample mean 


length of human pregnancies from conception to birth will typically vary by about 6.532 days from the 
true mean of 266 days. 


4. The mean length of pregnancy follows a Normal distribution with a mean of 266 and a standard 
deviation of 6.532 and we want to find P(x >270) . (i) z=0.61 
; P(z>0.61)= 1-0.7291=0.2709 
(ii) 
P(x >270)=normalcdf(lower: 270, upper: 1000, mean: 266, SD: 6.532)=0.2701 


. There is a 0.2701 probability of selecting a sample of 6 women whose mean pregnancy length exceeds 
270 days. 


Answers to Odd-Numbered Section 7.3 Exercises 


7.53 (a) px =p=225 seconds (b) Because the 
sample size (10) is less than 10% of the population of songs on David’s iPod, 


ox =on=6010=18.974 


seconds. In SRSs of size 10, the sample 
mean play time will typically vary by about 18.974 seconds from the true mean of 225 seconds. 
7.55 10=60n — 10n=60 — n=6 = n=36 


. A sample size 

of 36 songs will produce a sampling distribution with a standard deviation of 10 seconds. 
7.57 (a) We want to find P(X<295) . (i) z=-1 
P(Z<-—1)=0.1587 

(ii) P1X<295)=normalcdf(lower: —1000, upper: 295, mean: 


298, SD: 3)=0.1587. There is a 
0.1587 probability that a randomly selected bottle contains less than 295 ml. (b) Because X has a Normal 


distribution, the sampling distribution of x has a Normal distribution. px =y=298 ml 


; because 6 is less than 10% 


of all bottles produced, ox =on=36=1.2247 


ml. We want to 


find P(x <295) (i) z=-2.45 ; P(Z<-2.45)=0.0071 


(ii) P(x <295)=normalcdf(lower: —1000, upper: 295, mean: 298 
P(& < 295) = normalcdf(lower: — 1000,upper:295, mean:298 op. 1.2247)=0.0072. 


SD:1.2247) = 0.0072. There is a 0.0072 probability that the mean contents of 6 randomly selected 
bottles is less than 295 ml. 


7.59 (a) The sampling distribution of x @ is Normal with px =)=188 mg/dl = # = 188 mg/dl 


Because the sample size (100) is less than 10% of all men aged 20 to 34, ox =on=41100=4.1 mg/dl 


= — < — SE ome -() / 
34, oF v =i 4.1 mg, I 


(b) We want to find P(185<x-<191) (185 < Z < 191) (i) z=-0.732 = —9-73 and z=0.732 = 0.73; 
P(-0.73<z<0.73)=0.7673- 


0.2327=0.5346 
(ii) P(185<x < 191)=normalcdf(lower: 185, upper: 191, mean: 


188, SD: 4.1)=0.5357. There is a 

0.5357 probability that x | estimates p within +3 mg/dl=3 mg/dl (c) ox =on=411000= 
a 41 

? a — —_—_ = —— 


vn V1000 

We want to find P(185<x-<191)? (185 < @ < 191) (j) 7=-9 312 = —2.31 and 2=2.312 = 2.31; 
P(-2.31<z<2.31)=0.9896-P (—2-31 < 2 < 2.31) = 0.9896— 9 9194=0.97920.0104 = 0.9792 
(ii) P(185<x <191)=normalcdf(lower: 185, upper: 191, mean: 


188, SD: 1.30)=0.9790. There is a 
0.9790 probability that x — estimates p —_ within +3 mg/dl 


. The larger sample is better because it is more likely to produce a 
sample mean within 3 mg/dl of the population mean. 


7.61 (a) The sampling distribution of x is Normal with px =y=48 


months. Because the sample size (8) is less 
than 10% of all batteries, 


Ox =on=8.28=2.899 


months. We want to find P(x < 42.2) 


(i) z=-2.00 ; P(z<—2.00)=0.0228 


(ii) P(x <42.2)=normalcdf 


(lower: —1000, upper: 42.2, mean: 48, SD: 2.899)=0.0227. 


There is a 0.0227 probability that the sample mean lifetime is 42.2 months or less if the lifetime 
distribution is unchanged. (b) Because this probability is very small, there is convincing evidence that the 
company is overstating the average lifetime of its batteries. It is not plausible to get a sample mean this 
small by chance alone. 


7.63 (a) Because n=5<30 , the sampling distribution of x 
will also be skewed to the right, but not quite as strongly as the population. (b) Because 
n=100230 , the sampling distribution of x — 
is approximately Normal by the central limit theorem. 
7.65 (a) Because the sample size is large (n=1000$30) 
, the sampling distribution of x — is 
approximately Normal. px =y=$250 
Assuming 1000 is less than 10% of all homeowners with fire insurance, 
ox =on=50001000=158.114 


. We want to find P(x >300)P (x > 300) 


(i) 2=0.322 = 0.32; P(z>0.32)=1-0.6255=0.3745! (2 > 0.32) = 1 — 0.6255 = 0.3745 (i) pox->300)= 


P(x > 300) = normalcdf(lower: 300, upper: 1000, mean: 250 


oe 
158.114) = 0.3759. There is a 0.3759 probability that the mean annual loss from a sample of 1000 policies 


is greater than $300. (b) (i) Look in the body of Table A for the value closest to 0.90, which is z=1.28 
_ 2z—250 
z = 1.28. Solving 1.28=x-250158.114!'25 = is8.14 gives x=$452.39@ = $452.39, 


(ii) invNorm(area: 0.90, mean: 250, SD: 158.114)=$452.63. 
invNorm (area:0.90,mean:250,8D:158.114) = $452.63. if the 


company wants to be 90% certain that the mean loss from fire in an SRS of 1000 homeowners is less than 
the amount they charge for the policy, they should charge $452.63. 


7.67 (a) It is more likely to randomly select 1 square yard of material and find 2 or more flaws than to 
randomly select 50 square yards of material and find an average of 2 or more flaws. There is more 
variability from the mean of 1.6 in the number of flaws found in individual square yards of material than 
in the average number of flaws found in a sample of 50 square yards of material. (b) You cannot use a 
Normal distribution to calculate the probability of the first event in part (a) because the population 
distribution is not Normal and the sample size is not at least 30. (c) Because the sample size is large 


(n=50>30)\" = 50230) the sampling distribution of x — is approximately Normal with 


Ux =p=1.6 . Because 50 is less than 10% of all 
square yards of this carpet, ox =on=1.250=0.1697 


. We want to 


find P(x >2) . (i) 2=2.36 
P(z>2.36)=1-0.9909=0.0091 


(ii) P(x >2)=normalcdf(lower: 2, upper: 1000, mean: 1.6, SD: 


0.16972=0.0092. There is a 0.0092 probability of finding that 
the mean number of flaws is 2 or more in a random sample of 50 square yards of carpet. 

7.69 No; the histogram of the sample values will look like the population distribution, whatever it happens 
to be. The central limit theorem says that the histogram of the sampling distribution of the sample mean 


will look more and more Normal as the sample size increases. 
7.71 Because the sample size is large (n=30230) 
, the distribution of x — 
is approximately Normal with px =y=190 


pounds. Because n=30 is less than 
10% of all possible passengers, 


ox =on=3530=6.3901 


pounds. We want to find P(x >200) 

(i) z=1.56 ; P(z>1.56)=1-0.9406=0.0594 

(ii) P(x >200)=normalcdf(lower: 200, upper: 1000, mean: 190, SD: 6.3901)=0.0588. 

There is a 0.0588 probability that the mean weight exceeds 200 pounds (and the total weight exceeds 
6000 pounds). 

7.73 b 

Pac 


7.77 The unemployment rates for each level of education are: 
P(unemployed|didn’t finish HS)=106212,470=0.0852 


P(unemployed|HS but no college)=197737,834=0.0523 


P(unemployed|less than bachelor’s degree)=146234,439=0.0425 


P(unemployed|college graduate)=109740,390=0.0272 


There is an association between unemployment rate and education. The unemployment rate decreases 


with additional education. 


Answers to Chapter 7 Review Exercises 


R7.1 The population is the set of all eggs shipped in one day. The parameter is p= 
of eggs shipped that day that had salmonella. The sample consists of the 200 eggs examined. The statistic 


is pA= 


salmonella =9/200=0.045 


R7.2 (a) The 10 possible SRSs of size n=3 
ilk 


wa N Du KR WwW DHD 


64, 66, 71 median=66 


: 64, 66, 73 median=66 

: 64, 66, 76 median=66median = 66 
: 64, 71, 73 median=71median = 71 
: 64, 71, 76 median=71median = 71 


64, 73, 76 median=73median = 73 


: 66, 71, 73 median=71median = 71 
: 66, 71, 76 median=71median = 71 
: 66, 73, 76 median=73median = 73 


10: 71, 73, 76 median=73 


(b) If the sample size was increased to n=4n = 4, the variability of the sampling distribution of the 
sample median would decrease. (c) The 5 possible SRSs of size n=4m = 4 and the median number of 


pages of each sample are: 


1: 
2: 


64, 66, 71, 73 median=68.5median = 68.5 
64, 66, 71, 76 median=68.5median = 68.5 


the proportion of eggs in the sample that had 


and the median score of each sample are: 


3: 64, 66, 73, 76 median=69.5median = 69.5 
4: 64, 71, 73, 76 median=72 
5: 66, 71, 73, 76 median=72 


This sampling distribution does support the answer to part (b). The variability of the sampling distribution 
with n=4 is less than the variability of the sampling distribution with n=3 
R7.3 (a) 


(b) Answers will vary. An example dotplot is given. The range for this sample is 4375-2790=1585 


A dot plot marks the number of instances for birth weights in grams. 


(c) The dot at 2800 represents one SRS of size n=5 from this population where the sample 
range was 2800 grams. (d) The sample range is not an unbiased estimator of the population range. If it 
were unbiased, then the sampling distribution of the sample range would have 3417 (the population range) 


as its mean. 
R7.4 (a) pp4=p=0.15 (b) Because the sample size of 
n=1540 is less than 10% of the population of all adults, the standard deviation 


of the sampling distribution of p/ is opA=0.15(0.85)1540=0.0091 


(c) Because np=1540(0.15)=231>10 


and n(1 — p) = 1540(0.85) = 1309210 , the sampling distribution of p/ 
is approximately Normal. (d) We want to find P(0.13 <p<0.17) 
. (i) z=-2.20 and 
z=2.20 ; P(-2.20<z<2.20)=0.9861- 


0.0139=0.9722 (ii) P(0.13<p4<0.17)=normalcdf(lower: 0.13, upper: 0.17, 


mean: 0.15, SD: 0.0091) = 0.9720. There is a 0.9720 probability of obtaining a sample in which between 
13% and 17% are joggers. 


R7.5 (a) We have an SRS of size 100 drawn from a population in which the proportion who get a red light 
is p=0.30 , assuming the agents’ claim is true. Thus, pp\=p=0.30 

. Because 100 is less than 
10% of the population of travelers, op\=0.30(0.70)100=0.0458 


Because np=100(0.30)=30210 


and n(1-p)=100(0.70)=70=10 


, the sampling distribution of p/ can be approximated by a Normal distribution. We want to find 
P(p<0.20) . (i) z=-2.18 ; P(z<-2.18)=0.0146 
(ii) P(p4<0.20)=normalcdf (lower: 
—1000, upper: 
0.20, mean: 0.30, SD: 0.0458)=0.0145. 


There is a 0.0145 probability that 
20% or fewer of the travelers get a red light. 


(b) Because this is a small probability, there is convincing evidence against the agents’ claim—it isn’t 
plausible to get a sample proportion of travelers with a red light this small by chance alone. 


R7.6 (a) We want to find P(X=105) . (i) z=0.33 : 
P(z>0.33)=1-0.6293=0.3707 


(ii) P(X=105)=normalcdf(lower: 105, upper:1000, mean: 100, SD: 15)=0.3694. 


There is a 0.3694 probability of selecting an individual with a WAIS score of at least 105. 
(b) The mean of the sampling distribution of x — is px =p=100 
. Because the sample of size 60 is less than 


10% of all adults, the standard deviation of the sampling distribution of x — is ox = 


on=1560=1.9365 


. The mean WAIS score for a random sample 

of 60 adults will typically vary from the mean (100) by about 1.9365 points. (c) We want to find 
P(x >105) . (i) z=2.58 ; 
P(z>2.58)=1-0.9951=0.0049 (2 > 2.58) = 1 — 0.9951 = 0.0049 


(ii) P(x >105)=normalcdf(lower: 105, upper: 1000, mean: 100, SD: 1.9365)=0.0049. 
P(& > 105) = normalcdf(lower:105,upper:1000,mean: 100, SD:1.9365) = 0.0049. there is a 


0.0049 probability of selecting a sample of 60 adults whose mean WAIS score is at least 105. (d) The 
answer to part (a) could be quite different depending on the shape of the population distribution. The 
answer to part (b) would be the same because the mean and standard deviation do not depend on the shape 


(60 > 30) 


of the population distribution. Because of the large sample size (60230) , the answer we gave 


for part (c) would still be fairly reliable due to the central limit theorem. 


R7.7 (a) Because the sample size is large (n=50>30)\” = 50 > 30) 


, the central limit theorem says that 
the distribution of x will be approximately Normal. (b) px =p=0.5 
; because 50 is less than 10% of all 


traps, ox =on=0.750=0.0990 


. We want to find P(x >0.6) 
(i) z=1.01 ; P(z>1.01)=1-0.8438=0.1562 

(ii) 
P(x >0.6)=normalcdf(lower: 0.6, upper: 1000, mean: 0.5, SD: 0.0990)=0.1562. 


There is a 0.1562 probability that the mean number of moths is greater than or equal to 0.6. (c) No; 
because this probability is not small, it is plausible that the sample mean number of moths is this high by 
chance alone. There is not convincing evidence that the moth population is getting larger in their state. 


Answers to Chapter 7 AP® Practice Test 
T7Ae 
T7,2¢ 
3c 
T7.4a 
T7.5d 
T7.6d 
T7.7b 
T7.8e 
T7.9b 
T7.10e 


T7.11 Sample statistic A provides the best estimate of the parameter. Both statistics A and B appear to be 
unbiased, while statistic C appears to be biased because the center of its sampling distribution is smaller 
than the value of the parameter. In addition, statistic A has less variability than statistic B. In this 
situation, we want low bias and small variability, so statistic A is the best choice. 


T7.12 (a) The probability that a single household pays more than $55 cannot be calculated, because we do 
not know the shape of the population distribution of monthly fees. (b) ux =p=$50 


; because the 
sample of size 50 is less than 10% of all households with 


Internet access, ox =on=2050=$2.828 


. (c) Because the sample 
size is large (nN=50230) 
, the distribution of x — 


will be approximately Normal. (d) We want to find P(x >55) 


. (i) 2=1.77 ; P(z>1.77)=1-0.9616=0.0384 
(ii) 
P(x >55)=normalcdf (lower: 55, upper: 1000, 
mean: 50 » oD: 
2.828)=0.0385. There is a 0.0385 probability that the mean 


monthly fee paid by the sample of 50 households exceeds $55. 
T7.13 (a) pp4=p=0.22 ; because 300 is less than 10% 
of children 


under the age of 6, opA=0.22(0.78)300=0.0239 


. Because np = 
np=300(0.22)=66 


and 
n(1—p)=300(0.78)=234 
are both at least 10, the sampling distribution of p/ can be approximated by a Normal distribution. 
We want to find P(p4>0.29) . (i) 2=2.93 : 
P(z22.93)=1-0.9983=0.0017 
(ii) 


P(p4>0.29)=normalcdf(lower: 0.29, upper: 1000, mean: 0.22, SD: 0.0239)=0.0017. 


There is a 0.0017 probability that more than 29% of the sample are from poverty-level households. (b) 
Because it is unlikely to get a sample proportion of 29% or greater by chance alone, there is convincing 
evidence that the percentage of children under the age of 6 living in households with incomes less than the 
official poverty level in this state is greater than the national value of 22%. 


Answers to Cumulative AP® Practice Test 2 
AP2.1a 
AP2.2 d 
AP2.3e 


AP2.4b 
AP2.5c¢ 
AP2.6d 
AP2.7 ¢ 
AP2.8a 
AP2.9d 
AP2.10¢ 
AP2.11b 
AP2.12¢ 
AP2.13 a 
AP2.14c¢ 
AP2.15d 
AP2.16c 
AP2.17e 
AP2.18a 
AP2.19 cc 
AP2.20b 
AP2.21a 


AP2.22 (a) This is an observational study. Subjects were not assigned to take (or not take) fish oil. (b) 
Two variables are confounded when their effects on the cholesterol level cannot be distinguished from 
one another. For example, people who take omega-3 fish oil might also be more health conscious in 
general and do other things such as eat more healthfully or exercise more. If eating more healthfully or 
exercising more lowers cholesterol, researchers would not know whether it was the omega-3 fish oil or the 
more healthy food consumption or exercise that lowered cholesterol. (c) No; this wasn’t an experiment 
and taking fish oil is possibly confounded with other good habits, such as healthful eating and exercise. 


AP2.23 (a) y4=25.66-0.003131x , where y4 is the 


predicted number of wins and x is the number of yards allowed. (b) y/= 
25.66—-0.003131(4668)=11.04 
wins. The residual=10- 
11.04=-1.04 wins. The actual number of 
Seattle Seahawk wins was 1.04 less than the number of wins predicted by the regression line with x=4668 
yards allowed. (c) Because the Carolina Panthers allowed fewer yards than average and 
also had more wins than average, this point will increase the steepness of the negative slope of the least- 
squares regression line (make it more negative) and increase the y intercept of the least-squares regression 
line. 
AP2.24 (a) The distribution of seed mass for the cicada plants is roughly symmetric, whereas that for the 


control plants is skewed to the left. Neither group had any outliers. The median seed mass is the same for 


both groups (median=0.25) . The cicada plants had a 


larger range in seed mass, but the control plants had a larger IQR. (b) The distribution of seed mass for the 
cicada plants is roughly symmetric, which suggests that the mean should be about the same as the median. 
However, the distribution of seed mass for the control plants is skewed to the left, which will pull the 
mean of this distribution below its median toward the smaller values. Because the medians of both 
distributions are equal, the mean for the cicada plants is likely greater than the mean for the control plants. 
(c) The purpose of the random assignment is to create two groups of plants that are roughly equivalent at 
the beginning of the experiment. (d) A benefit of using only American bellflowers is that the researchers 
may then control a source of variability. Different types of flowers will have different seed masses, 
making the response more variable if other types of plants were used. A drawback to only using American 
bellflowers is that we can’t make inferences about the effect of cicadas on other types of plants, because 
other plants may respond differently to cicadas. 


AP2.25 (a) Because the sample size is large (n=50230) 
, the distribution of x — 
is approximately Normal with px =p=525 


pages. Because n=50 is less than 
10% of all novels in the library, 


ox =on=20050=28.28 


pages. We want to find P(x <500) 
(i) z=20.88 ; P(Z<-0.88)=0.1894 
(ii) P(x <500)= 


normalcdf(lower: —1000, upper: 500, mean: 


B20; 
SD: 28.28)= 0.1883 . There is a 0.1883 probability that the 
average number of pages in the sample is less than 500. (b) X is a binomial random variable with n=50 


and p=0.30 . We want to find P(X=20) 
P(X=20)=1-binomcdf(trials: 50, p : 0.30, x value: 19)=1—0.9152=0.0848 


. There is a 0.0848 probability of selecting at least 20 novels that have fewer than 400 pages. 


Chapter 8 
Section 8.1 
Answers to Check Your Understanding 


page 502: 1. We are 95% confident that the interval from 0.175 to 0.225 captures the true proportion of 
all U.S. adults who would answer the question correctly. 


2. If we were to select many random samples of U.S. adults and construct a 95% confidence interval using 
each sample, about 95% of the intervals would capture the true proportion of all U.S. adults who would 
answer the question correctly. 


3. The point estimate is 0.175+0.2252=0.20 . The margin of 


error 
is 0.225 — 0.20 = 0.025. 

4. All of the plausible values in the 95% confidence interval are less than the proportion expected if 
people were to simply guess from the four choices at random. Therefore, this interval does provide 
convincing evidence that less than 25% of all U.S. adults would answer this question correctly. 


Answers to Odd-Numbered Section 8.1 Exercises 
8.1 Sample mean, x =30.35 


8.3 Sample proportion, p\=3650=0.72 


8.5 (a) We are 95% confident that the interval from 0.63 to 0.69 captures the true proportion of all U.S. 
adults who favor an amendment to the Constitution that would permit organized prayer in public schools. 
(b) The point estimate is p\=0.63+0.692=0.66. 


The 


margin of error = 0.69 — 0.66 = 0.03. (c) Because the value 2/3 = 0.667 (and values less than 2/3) are in 
the interval, it is plausible that two-thirds or less of the population favor such an amendment. Thus, there 
is not convincing evidence that more than two-thirds of U.S. adults favor such an amendment. 

8.7 (a) Because 12 is one of the plausible values in the 95% confidence interval, there is not convincing 
evidence that the true mean volume is different from 12 ounces. (b) No; although 12 is a plausible value 
for the true mean volume of all cans of diet cola, there are many other plausible values in the confidence 
interval. Because any of these values could be the true mean, there is not convincing evidence that the true 


mean volume is 12 ounces. 


8.9 (a) We are 95% confident that the interval from 10.9 to 26.5 captures the true difference (Girls — 
Boys) in the mean number of pairs of shoes owned by all girls and boys at this school. (b) Yes; because 
the 95% confidence interval does not include 0 as a plausible value for the difference in means, there is 
convincing evidence of a difference in the mean number of shoes for boys and girls. 

8.11 If we were to select many random samples of U.S. adults and construct a 95% confidence interval 
using each sample, about 95% of the intervals would capture the true proportion of all U.S. adults who 
would favor an amendment to the Constitution that would permit organized prayer in public schools. 


8.13 (a) The confidence interval is $51,492 — $431 = $51,061 to $51,492 + $431 = $51,923. We are 90% 
confident that the interval from $51,061 to $51,923 captures the true median household income for all 
households in Arizona in 2015. (b) If we were to select many random samples of Arizona households and 
construct a 90% confidence interval using each sample, about 90% of the intervals would capture the true 
median household income for all households in Arizona in 2015. 


8.15 84% of the intervals did contain the true parameter, which suggests that these were 80% or 90% 
confidence intervals. 

8.17 (a) Incorrect; the interval provides plausible values for the mean BMI of all women, not plausible 
values for individual BMI measurements, which will be much more variable. (b) Incorrect; we shouldn’t 
use the results of one sample to predict the results for future samples. (c) Correct; a confidence interval 
provides an interval of plausible values for a parameter. (d) Incorrect; the population mean always stays 
the same, regardless of the number of samples taken. (e) Incorrect; we are 95% confident that the 


population mean is between 26.2 and 27.4, but that does not absolutely rule out any other possibility. 


8.19 (a) The length of the interval would increase. (b) The length of the interval would decrease. (c) One 
of the practical difficulties would include non-response (those who are not available to respond or those 
who refuse to answer). For example, if people who were selected but do not respond have different 
opinions than the people who did respond, the estimated proportion may be off by much more than 3 
percentage points. 

8.21 (a) If we were to select many random samples of California adults and construct a 90% confidence 
interval using each sample, about 90% of the intervals would capture the true average travel time to work 
for all employed California adults. (b) Decrease the confidence level; drawback: we can’t be as confident 
that our interval will capture the true proportion. Increase the sample size; drawback: larger samples cost 
more time and money to obtain. (c) People who have longer travel times to work might have less time to 
respond to a survey. This would cause our estimate from the sample to be less than the true mean travel 
time to work. The bias due to nonresponse is not accounted for by the margin of error, because the margin 
of error accounts for only variability we expect from random sampling. 


8.23 b 

O20 2 

8.27 (a) Observational study; pregnant women and children were not assigned to different amounts of 
exposure to magnetic fields. (b) No; we cannot make any conclusions about cause and effect because this 


was not an experiment. We can only conclude that there isn’t convincing evidence that living near power 
lines is related to whether children develop cancer. 


Section 8.2 
Answers to Check Your Understanding 


page 516: 1. p = true proportion of all U.S. adults who “often or always” got enough sleep during the last 
7 nights. 


2. The Random condition is met because the statement says that the adults were chosen randomly. The 
10% condition is met because the sample size n = 1029 is less than 10% of all U.S. adults. The Large 
Counts condition is met because np’ = (1029)(0.48) = 493.92 = 10 and n(1-p‘) 

= (1029)(1 - 0.48) = 535.08 = 10. 
3. 1-0.992=0.005 ; using Table A, the closest area is 0.0051 
(or 


0.0049), corresponding to a critical value of z* = 2.57 (or 2.58). Tech: invNorm(area: 0.005, mean: 0, SD: 
1) = -2.576, so z* = 2.576 and 0.48+2.5760.48(1—0.48)1029=0.48+0.04=(0.44,0.52) 


4. We are 99% confident that the interval from 0.44 to 0.52 captures p = the true proportion of all U.S. 
adults who would report that they “often or always” got enough sleep during the last 7 nights. 


page 520: 1. Solving 1.960.80(0.20)n<0.03 


for n gives 12(0.80)(0.20)(1.960.03)2 =682.95. 


This means that we should select a sample of at least 683 customers. This sample size is less than that 


determined in the example using P“°-°- 


2. If the company president demands 99% confidence instead, the required sample size will be larger 
because the critical value is larger for 99% confidence (2.576) versus 95% confidence (1.96). The 
company would need to select at least 1180 customers to have 99% confidence. 


Answers to Odd-Numbered Section 8.2 Exercises 
8.29 Random: Met because Latoya selected an SRS of students. 10%: Not met because the sample size 


(50) is more than 10% of the population of seniors in the dormitory (175). Large Counts: Met because 


np/=14>10 and n(1-p4) =36210 


8.31 Random: Met because the inspector chose an SRS of bags. 10%: Met because the sample of 25 is 
less than 10% of the thousands of bags filled in an hour. Large Counts: Not met because there were only 
3 successes (bags with too much salt), which is less than 10. 


8.33 (a) It is necessary to ensure that the observations are close to independent. If they aren’t, the formula 


for the standard error of p/ won’t accurately estimate the standard deviation of the sampling 
distribution of p/ . (b) The confidence interval will capture the population parameter more often than 
the specified confidence level. 

= = 0.01 
8.35 (a) 1-0.982=0.01 2 ~— ~*””; using Table A, the closest area is 0.0099, corresponding to a 


critical value of z*=2.332* = 2.33, Tech: invNorm(area: 0.01, mean: 0, SD: 1) = —2.326, so z* = 2.326. 


~ 914 _ 
\=9144579=0.1996" 4579 0.1906 0.1996+2.3260.1996(1-0.1996)4579=0.1996+0.0137=(0.1859, 0.2133 
(b) P 


(c) We are 95% confident that the interval from 0.1859 to 0.2133 captures p = the true proportion of 
American adults who have earned money by selling something online in the previous year. 


8.37 The standard error of pA=0.1996(1—-0.1996)4579=0.0059 


. In repeated SRSs of size 4579, the sample proportion of American adults who have earned money by 
selling something online in the previous year typically varies from the population proportion by about 
0.0059. 


8.39 (a) Population: All seniors at Tonya’s high school. Parameter: The true proportion of all seniors 
who plan to attend the prom. (b) Random: The sample is a simple random sample. 10%: n = 50 is less 
than 10% of the population size. Large Counts: np\=36>10 


and n(1-p) 14>10 
. (c) z*=1.645 
; pA=3650=0.72 


0.72 + 1.6450.72(0.28)50=0.72+0.104=(0.616, 0.824) 


(d) We 

are 90% confident that the interval from 0.616 to 0.824 captures p = the true proportion of all seniors at 
Tonya’s high school who plan to attend the prom. 

8.41 STATE: p = the true proportion of all U.S. adults who play video games. PLAN: One-sample z 
interval for p. Random: The adults were selected randomly. 10%: n=2001 is less than 10% 
of the population of all U.S. adults. Large Counts: np\=980.49>10 


and n(1—p4)=1020.51>10 


. DO: (0.468, 0.512). CONCLUDE: We are 95% confident that the interval from 0.468 to 0.512 captures 
p = the true proportion of U.S. adults who play video games. 


8.43 (a) We do not have enough information because we do not know the size of the sample that was 
taken from each age group. (b) Larger margin of error because this is a smaller sample size than the 
original group. 

8.45 (a) STATE: p = the true proportion of all U.S. adults who think that organic produce is better for 
health than conventionally grown produce. PLAN: One-sample z interval for p. Random: The adults were 


selected randomly. 10%: n=1480 is less than 10% of the population of all U.S. adults. 


Large Counts: np\=814>10 and 
n(1—p4)=666210 


. DO: (0.517, 0.583). CONCLUDE: We are 99% confident that the interval from 0.517 to 0.583 captures 
p = the true proportion of U.S. adults who would agree with the statement “Organic produce is better for 
health than conventionally grown produce.” (b) Yes; all of the plausible values in the interval are greater 
than 0.5, which provides convincing evidence that a majority of all U.S. adults think that organic produce 
is better for health than conventionally grown produce. 

8.47 The confidence interval is (0.616, 0.824). We expect between 61.6% and 82.4% of the 750 seniors to 
attend the prom. We can be 90% confident that the interval from 462 to 618 captures the total number of 
seniors planning to go to the prom. 


8.49 (a) Solving 2.5760.44(0.56)n<0.03 


gives n=1817 . (b) Solving 2.5760.5(0.5)n<0.03 


gives n=1844n = 1844. The conservative approach requires 27 more adults. 


/0.5(0.5) 


1.645) 


—— < 0.04 


8.51 Solving 1.6450.5(0.5)n<0.04 gives n=423n = 423, 


oo / 0.64(0.36) 
8.53 (a) Solving 0.03=z*0.64(0.36)1028  ~ V 1028 gives z*=2.002* = 2.00. The 


confidence level is likely 95%. (b) Teens are hard to reach and often unwilling to participate in surveys, 


so nonresponse bias is a major “practical difficulty” for this type of poll. If teens with TVs in their rooms 
are less likely to answer the poll (because they are watching TV in their rooms!), the estimate from the 


poll would likely be too small. 
8.50 a 
8.57 d 


8.59 (a) X follows the Normal distribution with a mean of 21.1 and a standard deviation of 1.8. We want 
to find P(X > 22). (i) z= 0.5; P(z = 0.5) = 0.3085 (ii) P(X > 22) = normalcdf(lower: 22, upper: 1000, 
mean: 21.1, SD: 1.8) = 0.3085. There is a 0.3085 probability that a randomly selected orange from this 
tree has a circumference greater than 22 cm. (b) Normal distribution; px =m=21.1 cm 


Because 20 is less than 10% of all oranges on the tree, ox =on= 


1.820=0.4025 cm. We want to find P(x | > 22), 
(i) z=2.24 : 
P(z > 2.24) = 0.0125 (ii) P(x >.22)=normalcdf(lower: 22, upper: 1000, mean: 21.1, SD: 0.4025)=0.0127 


. There is a 0.0127 probability that the mean circumference of 20 randomly selected oranges from this tree 
is greater than 22 cm. 


Section 8.3 

Answers to Check Your Understanding 

page 531: 1. df = 22 — 1 = 21, t* = 2.189. Tech: invT(area: 0.02, df: 21) = —2.189, so t* = 2.189. 

2. df = 71 — 1 = 70, t* = 2.660 (using df = 60). Tech: invT(area: 0.005, df: 70) = —2.648, so t* = 2.648. 
page 538: STATE: p = the true mean healing rate. PLAN: One-sample t interval for p. Random: The 


newts were randomly chosen. 10%: n = 18 is less than 10% of the population of all newts. Normal/Large 
Sample: The histogram below does not show strong skewness or outliers, so this condition is met. 


DO: x =25.67 and s, = 8.32; df = 17 and t* = 2.110; 25.67 + 


2.110(8.3218)=25.67+4.14=(21.53,29.81). 


CONCLUDE: We are 95% confident that the interval from 21.53 to 29.81 micrometers per hour captures 
=! = the true mean healing rate for all newts. 


zZ*—-. 
page 540: The margin of error is defined to be z*on. Vv" Using of = 154 and z* = 1.645 for 90% 


sa 5, 1.645(154) \ 2 
30 > 1.645 154, a> () = 71.3 
‘n = 
confidence, 30>1.645 154n. V Thus, n>(1.645(154)30)2=71.3 , SO take 


a sample of 72 students. 


Answers to Odd-Numbered Section 8.3 Exercises 

8.61 (a) df = 9, t* = 2.262. Tech: invT(area: 0.025, df: 9) = —-2.262, so t* = 2.262. (b) df = 19, t* = 2.861. 
Tech: invT(area: 0.005, df: 19) = —2.861, so t* = 2.861. (c) Use df = 60, t* = 1.671. Tech: invT(area: 0.05, 
df: 76) = —1.665, so t* = 1.665. 

8.63 Not met; the sample size is small (n = 28 < 30) and there are outliers in the data. We cannot assume 
that the population is approximately Normal. 

8.65 (a) Random: No; members of the AP® Statistics class are not a random sample of all students at the 
school. 10%: Yes, assuming 32 is less than 10% of all students at the school. Normal/Large Sample: Yes; 


Bee 0 . (b) Random: Yes; random sample of 100 home sales 


from the previous 6 months in her city. 10%: Yes; assume that 100 is less than 10% of all homes that 


were sold during the previous 6 months in her city. Normal/Large Sample: Yes; ae 


. Despite the fact that the boxplot showing the distribution of 
home sales is strongly right skewed with outliers, this condition is still met because the sample size is 
large. 

8.67 SEx =sxn=9.327=1.7898. 


If we take many samples of size 27, the sample mean blood pressure will typically vary by about 1.7898 
from the population mean blood pressure. 

8.69 (a) STATE: ps = the true mean percent change in BMC for breast-feeding mothers. PLAN: One- 
sample t interval. Random: The mothers were randomly selected. 10%: 47 is less than 10% of all breast- 


feeding mothers. Normal/Large Sample: n=47230 . DO: 


df = 40 (-4.575,-2.599). Tech: 
(-4.569,—-2.605) with df = 46. CONCLUDE: We are 99% confident that the interval from —4.569 to 
—2.605 captures p = the true mean percent change in BMC for breast-feeding mothers. (b) Because all of 
the plausible values in the interval are negative (indicating bone loss), the data give convincing evidence 
that breast-feeding mothers lose bone mineral, on average. 

8.71 STATE: p = the true mean weight of an Oreo cookie. PLAN: One-sample t interval. Random: The 
cookies were randomly selected. 10%: 36 is less than 10% of all Oreo cookies. Normal/Large 


Sample: n=36>30 _DO: df = 30 (11-369, 11.4152). 
Tech: (11.369, 11.415) with df = 35. 


CONCLUDE: We are 90% confident that the interval from 11.369 g to 11.4152 g captures p = the true 
mean weight for all Oreo cookies. 


8.73 STATE: p = the true mean number of pepperonis on a large pizza at this restaurant. PLAN: One- 
sample t interval. Random: Pizzas were randomly selected. 10%: 10 is less than 10% of all pepperoni 
pizzas made at this restaurant. Normal/Large Sample: The dotplot doesn’t show any outliers or strong 
skewness. 


DO: x =37.4 i sx=7.662 , and n= 10; df =9; (31.919, 42.881). 


(31.919, 42.881). 
Tech: (31.919, 42.881) with df = 9. CONCLUDE: We are 95% confident that the 


interval from 31.919 to 42.881 captures p = the true mean number of pepperonis on a large pizza at this 
restaurant. 
8.75 (a) It was necessary because the sample size was not large (less than 30). When the sample size is 
less than 30, we must assume the population is Normally distributed. (b) The value 40 is a plausible value 
found within the confidence interval, so we do not have convincing evidence that the average number of 
pepperonis is less than 40. 

7.5 


8.77 Solving 2.576 ee vn ‘ gives n > 373.26, take an SRS of 374 women. 


7 SE; = 19.03 = — = — a/R 
8.79 (a) SEx =19.03=sxn=sx23 vi V3 so sx=19.0323=52 = 19.08V 23 =91 26 


cm. 
(b) The researchers are using a critical value of t* = 1; with df = 23 —- 1 = 22, the area between t = —-1 and t 
= 1 is approximately tcdf (lower: —1, upper: 1, df: 22) = 0.67. So the confidence level is 67%. 

8.81 b 

8.83 b 


8.85 (a) P(X=7)=0.57 (b) Mean: If we were to randomly select 
many young people, the average number of days they watched television in the past 7 days would be 


about 5.44. Standard deviation: The number of days of television watched typically varies from the mean 
(5.44 days) by about 2.14 days. (c) Because n=100>30 


, we expect the mean number of days x | for 
100 randomly selected young people to be approximately Normally distributed with mean px =p=5.44 


. Because the sample size (100) is less than 
10% of all young people, ox =on=2.14100=0.214 


. We want to find P(x #@< 4.96). (i) z = —2.24; P(z < —2.24) = 0.0125 (ii) P(x %< 4.96) = 
normalcdf(lower: —1000, upper: 4.96, mean: 5.44, SD: 0.214) = 0.0124. There is a 0.0124 probability of 
getting a sample mean of 4.96 or smaller. Because this probability is small, a sample mean of 4.96 or 
smaller would be surprising. 


Answers to Chapter 8 Review Exercises 


10.94 _ 
R8.1 (a) 1-0.942=0.03 2 — 0.03. the closest area is 0.0301, corresponding to a critical value of z* = 


1.88. Tech: invNorm(area: 0.03, mean: 0, SD: 1) = -1.881. So z* = 1.881. (b) df = 57; using Table B and 
50 degrees of freedom, t* = 2.678. Tech: invT(area: 0.005, df: 57) = -2.665, so t* = 2.665. 
arene ~ 450 


R8.2 (a) The point estimate x =430+4702=450" i minutes. The margin of error = 470 — 


2.045 —— 
450 = 20; df = 29 and t* = 2.045. Thus, 20=20 =2.045 sx30 v30 and the standard error = 
Sr _ 20 


5x30=202.045= v30 2-045 9). 780 . Finally, because sx30=9.780 
, SX=9.78030= 


oie Me . (b) Incorrect; the 

confidence interval provided gives an interval estimate for the mean lifetime of batteries produced by this 
company, not individual lifetimes. (c) No; any given confidence interval either captures the population 
mean (probability = 1) or it doesn’t (probability = 0). (d) If we were to take many samples of 30 batteries 
and compute 95% confidence intervals for the mean lifetime, about 95% of these intervals will capture the 
true mean lifetime of the batteries. 
R8.3 (a) p = the proportion of all adults aged 18 and older who would say that football is their favorite 
sport to watch on television. (b) The point estimate p\=3701000=0.37 

. (c) Because of sampling variability, I 
doubt that the value of the point estimate is exactly equal to the value of p. However, because the data 
were collected from a large random sample from the population of interest, we have reason to believe that 
the value of the point estimate will be relatively close to the value of p. 
R8.4 (a) STATE: p = the true proportion of all drivers who have run at least one red light in the last 10 
intersections they have entered. PLAN: One-sample z interval. Random: Drivers were selected at random. 


10%: n = 880 is less than 10% of all drivers. Large Counts: np/ = 171 > 10 and n(1=p‘) 
= 709 > 10. DO: pA=171880=0.194 
; (0.168,0.220). 
CONCLUDE: We are 95% confident that 


the interval from 0.168 to 0.220 captures the true proportion of all drivers who have run at least one red 
light in the last 10 intersections they have entered. (b) It is likely that more than 171 respondents have run 
red lights. We would not expect very many people to claim they have run red lights when they have not, 
but some people will deny running red lights when they have. The margin of error does not account for 
these sources of bias, only sampling variability. 


R8.5 (a) STATE: p = the true mean measurement of the critical component for the engine crankshafts 
produced in one day. PLAN: One-sample ¢ interval. Random: The crankshafts are randomly selected. 
10%: n= 16 is less than 10% of all crankshafts produced in one day. Normal/Large Sample: The 
histogram shows no strong skewness or outliers. 


DO: x =224.002 , sx=0.0618 , and n=16 

Thus, df = 15 and t*=2.131. (223.969,224.035). 

(223.969, 224.035) . CONCLUDE: We are 95% confident that the interval from 223.969 to 224.035 mm 
captures p! = the true mean measurement of the critical component for engine crankshafts produced on this 
day. (b) Because 224 is in this interval, it is a plausible value for the true mean. We don’t have convincing 
evidence that the process mean has drifted. 


2576 \/ 0.5(0.5) 


—— < 0.01 


R8.6 Solving 2.5760.5(0.5)n<0.01 ,n = 16,590 adults. 


r (22) < 1000 
R8.7 Solving 1.96(3000n)<1000 vi 


R8.8 (a) If we increase the confidence level, then the margin of error must get larger to increase the 


, nN = 35 pieces of Douglas fir. 


capture rate of the intervals. (b) If we quadruple the sample size, the margin of error will decrease by a 
factor of 2. 

R8.9 (a) We use a t critical value, rather than a z critical value, when constructing a confidence interval 
for a population mean when using the sample standard deviation s, to estimate the population standard 
deviation o@. (b) As the degrees of freedom increase, the t critical values decrease and become closer and 
closer to the z critical value for a particular level of confidence. 


Answers to Chapter 8 AP® Practice Test 
T8.1a 

T8.2d 

T8.3¢ 

T8.4d 

T8.5b 

T8.6a 

T8.7 ¢ 

T8.8d 

Te.9¢ 

T8.10d 

T8.11 (a) STATE: p = the true proportion of all visitors to Yellowstone who would say they favor the 


restrictions. PLAN: One-sample z interval. Random: The visitors were selected randomly. 10%: n = 150 is 
less than 10% of all visitors to Yellowstone National Park. Large Counts: =89>10 


and n(1-p‘)=61>10 


. DO: pA=89150= 0.593 ; (0.490,0.696). 


CONCLUDE: We are 99% confident that 
the interval from 0.490 to 0.696 captures p = the true proportion of all visitors who would say that they 
favor the restrictions. (b) Because there are values less than 0.50 in the confidence interval, the U.S. 
Forest Service cannot conclude that more than half of visitors to Yellowstone National Park favor the 
proposal. It is plausible that only 49% favor the proposal. 

T8.12 (a) We are 95% confident that the interval from 317.64 to 394.56 captures p = the true mean 
number of licks it takes to get to the center of a Tootsie Pop. (b) The point estimate is 


317.64+394.562=356.1 . The margin of error is 394.56 — 
356.1 = 38.46. (c) The researcher could reduce the margin of error by decreasing the confidence level. 
The drawback is that we can’t be as confident that our interval will capture the true mean. The researcher 
could also reduce the margin of error by increasing the sample size. The drawback is that larger samples 


cost more time and money to obtain. 
T8.13 STATE: p = the true mean number of bacteria per milliliter in raw milk received at the factory. 


PLAN: One-sample t interval. Random: The data come from a random sample. 10%: n=10 is 
less than 10% of all one-milliliter specimens that arrive at the factory. Normal/Large Sample: The dotplot 
shows that there is no strong skewness or outliers. 


DO: x =4950.0@ = 4950.0, sx=268.58r = 268.5 and n=10n = 10: df =9 and tx=t” =1.833.1.833. 
(4794.37,5105.63). CONCLUDE: We 


are 90% confident that the interval from 4794.37 to 5105.63 bacteria per milliliter captures p= 
the true mean number of bacteria in the milk received at this factory. 


Chapter 9 
Section 9.1 


Answers to Check Your Understanding 


page 556: 1. HO: p=0.85 and Ha: p#0.85 

, where p= proportion of all students at Jannie’s high school 
who get fewer than 8 hours of sleep at night. 2. HO: p=10 and Ha: 
u>10 , where pi=true mean amount of time that it 


takes to complete the census form. 

page 562: 1. Type I error: The manager finds convincing evidence that less than 63% of the drive-thru 
wait times are longer than 2 minutes, when the true proportion really is 0.63. Type II error: The manager 
does not find convincing evidence that less than 63% of the drive-thru wait times are longer than 2 
minutes, when the true proportion really is less than 0.63. 


2. In this case, a Type I error is more serious because the manager will believe that the additional 


employee reduces the proportion of drive-thru customers who have to wait longer than 2 minutes to 
receive their food, when that is not the case. 


3. No; if the null hypothesis is true, a significance level of a=0.10 will result in a Type I 
error 10% of the time just by chance. Since a Type I error is more serious in this case, it would be better to 
pick a smaller value of a , such as a=0.010 = 0.01, 


4. Assuming that the true proportion of all drive-thru customers who have to wait longer than 2 minutes to 
receive their food after placing an order is 0.63, there is a 0.0385 probability of getting a sample 
proportion of 0.576 or less who have to wait longer than 2 minutes just by chance in a random sample of 
250 drive-thru customers. 


Answers to Odd-Numbered Section 9.1 Exercises 
9.1 HO: p=0.75440 > p = 0.795; Ha: p<0.75 4a :p < 0.75, where p = the true proportion of the students at 
Mr. Tabor’s school who completed their math homework last night. 
9.3 HO: u=180/40 > = 180. Ha: 14180Le ‘#180 where = = the true mean volume of liquid 
dispensed by the machine. 
9.5 HO: o=3440 : o = 3. Ha: o>3a : o > 3 where o= the true standard deviation of the 
temperature in the cabin. 
9.7 (a) The null hypothesis is always that there is “no difference” or “no change” and the alternative 
hypothesis is what we suspect is true. These ideas are reversed in the stated hypotheses. Correct: HO: 
p=0.37 ; Ha: p>0.37 (b) 
Hypotheses are always about population parameters. However, the stated hypotheses are in terms of the 
sample statistic. Correct: HO: p=3000 grams; Ha: p<3000 

grams. 
9.9 (a) If HO: p=0.75 is true, then the proportion of all students at Mr. 
Tabor’s school who completed their homework last night is 0.75. (b) Assuming that the proportion of all 
students at Mr. Tabor’s school who completed their homework last night is 0.75, there is a 0.1265 
probability of getting a sample proportion of 0.68 or less just by chance in a random sample of 50 students 
at the school. 


9.11 Assuming the true mean volume of liquid dispensed by the machine is 180 ml, there is a 0.0589 
probability of getting a sample mean at least as far from 180 as 179.6 (in either direction) just by chance 
in arandom sample of 40 bottles filled by the machine. 

9.13 The student forgot to include the conditions and the direction in the interpretation. Assuming the null 
hypothesis is true, there is a 0.044 probability of getting the sample result I did or one even larger by 
chance alone. 


9.15 Because the P -value of 0.1265 is greater than a=0.05 , we fail to reject HO 


. We do not have convincing evidence that the true proportion of students at Mr. Tabor’s 
school that completed their math homework last night is less than 0.75. 


9.17 (a) Because the P -value of 0.0589 is less than a=0.10 , we reject HO 


. We have convincing evidence that the true mean volume of liquid dispensed by the 
machine is different from 180 ml. (b) Yes; because the P -value of 0.0589>a=0.05 


, we would fail to reject HO . We do not 
have convincing evidence that the true mean volume of liquid dispensed by the machine is different from 
180 ml. 


9.19 It is never correct to “accept the null hypothesis.” If the P-value is large, the data do not provide 
convincing evidence that the alternative hypothesis is true. However, lacking evidence for the alternative 
hypothesis does not provide convincing evidence that the null hypothesis is true. 


9.21 (a) HO: p=1 ; Ha: p<1 , where 
L= the true mean weight (in pounds) of bread loaves produced at the bakery. (b) There is some 
evidence for the alternative hypothesis because the mean weight of an SRS of bread loaves is only 0.975 
pound, which is less than what the mean is supposed to be (1 pound). (c) Assuming that the true mean 


weight of bread loaves produced at the bakery is 1 pound, there is a 0.0806 probability of getting a sample 
mean of 0.975 pound or less just by chance in a random sample of 50 bread loaves. (d) Because the P- 


value of 0.0806>a0=0.01 , we fail to reject HO 


. We do not have convincing evidence that the true mean weight for all loaves of bread 
produced at the bakery is less than one pound. 


9.23 Type I: You find convincing evidence that the mean income of all residents near the restaurant 
exceeds $85,000 when in reality it does not. Consequence: You will open your restaurant in a location 
where the residents will not be able to support it, so your restaurant may go out of business. Type IT: You 
do not find convincing evidence that the mean income of all residents near the restaurant exceeds $85,000 
when in reality it does. Consequence: You will not open your restaurant in a location where the residents 
would have been able to support it and you lose potential income. 

9.25 (a) A Type I error would be finding convincing evidence that the proportion of all calls in which first 
responders took more than 8 minutes to arrive had decreased when it really hadn’t. A Type IJ error would 
be not finding convincing evidence that the proportion of all calls in which first responders took more 
than 8 minutes to arrive decreased when it really had. (b) A Type I error would be worse because the city 
would overestimate the ability of the emergency personnel to get to the scene quickly and people may end 


up dying. (c) The probability of a Type I error is a @ 0.05 ; because the consequence may be 
the difference between life and death, the manager should use a value for a that is lower, such as 
a=0.01 

9.27 (a) HO: p=0.10 ; Ha: p>0.10 ' 


where p= the true proportion of all students at Simon’s school that are left-handed. (b) Based on 
the simulation results, 24 of the 100 simulated trials yielded a sample proportion of 0.16 or greater, so the 


P -value is approximately 24/100=0.24 . Assuming that the true 
proportion of all students at Simon’s school that are left-handed is 0.10, there is a 0.24 probability of 
getting a sample proportion of 0.16 or greater just by chance in a random sample of 50 students. (c) Use 
a=0.05a = 0.05. Because the PP-value of 0.24>a=0.050.24 > a = 0.05, we fail to reject Holo. We 
do not have convincing evidence that the true proportion of all students at Simon’s school that are left- 


handed is greater than 0.10. 
9.29 d 
9.31 ¢ 


9.33 (a) P(degree earned by a woman)=0.4168" (degreeearnedbyawoman) = 0.4168. approximately 
(24,611)\24; 611) (0.4168)=10,258(9-4168) = 10, 258 mathematics degrees were awarded to women. 
(b) Not independent; P(woman) = 0.4168, which is not equal to P(woman | bachelors) = 0.43. (c) P(at 


least 1 of the 2 degrees earned by a woman) = 1 —P(neither degree is earned by a woman) = 
1-(14,35324,611)(14,35224,610) 


=0.6599 


Section 9.2 
Answers to Check Your Understanding 


page 576: 1. STATE: HO: p=0.20 ; Ha: p>0.20 


, where p__ is the true proportion of all teens at the school who 
would say they have electronically sent or posted sexually suggestive images of themselves, using a=0.05 


. PLAN: One-sample z__ test for p. Random: We have a random sample of 250 
students from the school. 10%: 250 is less than 10% of the 2800 students at the school. Large Counts: 
np0=250(0.2)=50>10 


and n(1—p0)=250(0.8) @ 


200$10 . DO: pA=63250=0.252 
; Z=0.252-0.200.20( 0.80)250=2.06 


; P-value = 0.0197. CONCLUDE: Because the P -value of 0.0197< a=0.05 


, we reject HO . We have convincing evidence that more than 20% of the 
teens in her school would say they have electronically sent or posted sexually suggestive images of 
themselves. 


page 580: 1. STATE: HO: p=0.75 ; Ha: p40.75 


, where p__ is the true proportion of all restaurant employees at this 
chain who would say that work stress has a negative impact on their personal lives using a=0.05 


. PLAN: One-sample z__ test for p. Random: We have a random sample of 100 
employees from the large restaurant chain. 10%: Assume the sample size (100) is less than 10% of all 
employees at this large restaurant chain. Large Counts: np0=""P0 ~ 100(0.75)=75$10 
100 (0.75) = 75$10 and n(1-p0)"* (1 — po) =100(0.25)=— 100 (0.25) =95,.1925 > 10 po: 

0.68—0.75 


/0.75(0.25) 


= 0.68-0.750.75( 0.25)100 Ys =-1,62= ~1.62: p-value 


- _ 68 
p\=68100=0.68" ~ 100 
= 0.1052. 


= 0.68 
5Z 


CONCLUDE: Because the PP-value of 0.1052>a=0.100.1052 > a = 0.10, we fail to reject HO/40. We 
do not have convincing evidence that the true proportion of all restaurant employees at this large 
restaurant chain who would say that work stress has a negative impact on their personal lives is different 
from 0.75. 


2. The confidence interval gives the values of p? that are plausible based on the sample data; 0.75 is a 
plausible value because it is within the 90% confidence interval. So based on the confidence interval, we 


would fail to reject p=0.75P? = 0.79 at the a=0.10 significance level. The 90% 
confidence interval provided gives an approximate set of pO ’s that would not be rejected by a 
two-sided test at the a=0.10 significance level. A two-sided test only allows us to reject 


(or fail to reject) a hypothesized value for a particular population parameter, where a confidence interval 
provides a set of hypothesized values that would not be rejected based on a two-sided test. 


Answers to Odd-Numbered Section 9.2 Exercises 
9.35 Random: SRS of 60 students from a large rural high school. 10%: 60 is less than 10% of all students 
at this large high school. Large Counts: np0=60(0.80)=48>10 


and n(1-p0)=60(0.20)=12>10" (1 — Po) = 60 (0.20) = 12 > 10. 


snpo = 10 (0.5) = 5 and failures 


9.37 The expected number of successes np0=10(0.5)= 
n(1—p0)=10(0.5)=5" (1 — po) = 10 (0.5) = 5 are both less than 10, so the Large Counts condition is not 
met. 


9.39 (a) The sample result gives some evidence for Ha:a : p<0.80P = 0.80 because p/=4160=0.683 


Be 7 = 0.683 which is less than 0.80. 

z= Se = -2.27 
(b) z=0.683-0.800.80( 0.20)60=—2.27 V a ; P-value=0.0116P-value = 0.0116. (c) 
Because the P-value of 0.0116<a=0.05 , we reject HO 


. We have convincing evidence that the true proportion of all students at this large rural 
high school who have a computer at home is less than 0.80. 


9.41 (a) The P-value is P(z>2.19)=0.0143 . Assuming 
that the true population proportion is 0.5, there is a 0.0143 probability of getting a sample proportion as 


large as or larger than the one observed just by chance in a random sample of size n=200 


(b) a=0.01 ; because the P-value of 0.0143>a=0.01 
, we fail to reject HO . There is not 
convincing evidence that p>0.5 . For a=0.05 , yes! The P-value of 
0.0143<a=0.05 , SO this time we will reject HO 
. There is convincing evidence that p>0.5 . (c) Solve for p/ ; 


2.19=p\—0.50.5( 0.5)200 = pA=0.5774 


9.43 STATE: HO: p=0.75 ; Ha: p>0.75 
, where p= the true proportion of all middle school students 


who engage in bullying behavior, using a=0.05 . PLAN: One-sample z test for p. 
Random: We have a random sample of 558 middle school students. 10%: 558 is less than 10% of all 
middle school students. Large Counts: npO=558(0.75)=418.5>10 


and n(1—p0)=558(0.25)=139.5>10 


. DO: z=0.797-0.750.75( 0.25)558=2.56 


; P-value=0.0052 . CONCLUDE: Because the P-value of 0.0052<a=0.05 


, we reject HO . We have convincing 
evidence that the true proportion of all middle school students who engage in bullying behavior is greater 
than 0.75. 

9.45 (a) Type I: Finding convincing evidence that more than 37% of students were satisfied with the new 
parking arrangement, when in reality only 37% were satisfied. Consequence: The principal believes that 
students are satisfied and takes no further action. Type IJ: Failing to find convincing evidence that more 
than 37% are satisfied with the new parking arrangement, when in reality more than 37% are satisfied. 
Consequence: The principal takes further action on parking when none is needed. (b) STATE: HO: p=0.37 
; Ha: p>0.37 , where p=the true 
proportion of all students who are satisfied with the parking situation after the change 
using a=0.05 . PLAN: One-sample z2 test for p. Random: We have an SRS of 200 
students from the school. 10%: 200 is less than 10% of the population of size 2500. Large Counts: 
np0=200(0.37)=74>10"Po = 200 (0.37) = 74 > 10 and n(1-p0)=200(0.63)=126>10 


i —— = 1.32 
/ 0.37(0.63) 
n(1— po) = 200 (0.63) = 126 > 10 Ho; 7=0.415-0.370.37( 0.63)200=1.32  V ; P- 


value=0.0934P-value = 0.0934, 


CONCLUDE: Because the PP-value of 0.0934>0=0.050.0934 > a = 0.05, we fail to reject HO40. We 
do not have convincing evidence that the true proportion of all students who are satisfied with the parking 
situation after the change is greater than 0.37. 


9.47 A lower-tailed test (HO: p=0.50; Ha: p<0.50) 


will never reject the null hypothesis when the sample proportion is greater than the hypothesized value. 
9.49 (a) HO: p=0.75; Ha: p#0.75 
, where p=the true 
proportion of peas that will be smooth using a=0.05 
(b) p4=423423+133=423556=0.761 


2=0.761-0.750.75( 0.25)556=0.60 


P-value=0.5568 . (c) Assuming that the true proportion of smooth peas is 


0.75, there is a 0.5568 probability of getting a sample proportion as different from 0.75 as 0.761 (in either 
direction) just by chance in a random sample of 556 peas. CONCLUDE: Because the P-value of 
0.5568>a=0.05 , we fail to reject HO 
We do not have convincing evidence that the true proportion of peas that are smooth is different from 
0.75. 
9.51 STATE: HO: p=0.60; Ha:p#0.60 
, where p=the true 

the true proportion of teens who pass their driving test on the first attempt using o=0.05 

. PLAN: One-sample z__ test for p. Random: We have an SRS of 125 teens. 10%: 125 
is less than 10% of all teens who take the driving test. Large Counts: np0=125(0.60)=75=>10 
npy = 125 (0.60) = 75 > 10 og n(1—p0)=125(0.40)=50>10" (1 — po) = 125 (0.40) = 50 >= 10 po. 

0,688—0.60 


z= = 2.01 


! 0.60(0.40) 


7=0.688-0.600.60(0.40)125=2.01 \ = 
P-value=0.0444P-value = 0.0444, CONCLUDE: Because the PP-value of 0.0444<a=0.05 


0.0444 < a = 0.05, we reject HO . There is convincing evidence that the true proportion 
of teens who pass the driving test on their first attempt is different from 0.60. 


9.53 (a) STATE: p=the true proportion of teens who pass their driving test on the first 
attempt. PLAN: One-sample z interval for p. The conditions have been met. DO: (0.607,0.769) 

. CONCLUDE: We are 95% confident that the 
interval from 0.607 to 0.769 captures the true proportion of teens who pass the driving test on the first 
attempt. (b) The confidence interval gives the values of p that are plausible based on the sample data; 0.60 
is not a plausible value because it falls outside the 95% confidence interval. So based on the confidence 


interval, we would reject p=0.60 at the a=0.05 significance level. The 
95% confidence interval provided gives an approximate set of p0 *s that would not be rejected 
by a two-sided test at the a=0.05 significance level. A two-sided test only allows us to 


reject (or fail to reject) a hypothesized value for a particular population parameter. 


9.55 No; because the value 0.17 is included in the confidence interval, it is a plausible value for the true 
proportion of U.S. adults who would say they use Twitter. In other words, we do not have convincing 
evidence that the true proportion of U.S. adults who would say they use Twitter differs from 0.17. 


9.57 (a) p=the true proportion of undergraduates at this large university who would be 
willing to report cheating by other students. (b) Random: We have an SRS of 172 undergraduate students. 
10%: 172 is less than 10% of all undergraduates at this large university. Large Counts: 
np0=172(0.15)=25.8>10 


and n(1—p0)=172(0.85)=146.2>10 


. (c) The P-value is 0.146. Assuming that the true proportion of undergraduates at this large university 
who would report cheating is 0.15, there is a 0.146 probability of getting a sample proportion that is at 


least as different from 0.15 (in either direction) as p\=0.11 . (d) No; because the P-value 


of 0.146>a=0.05 , we fail to reject HO 
We do not have convincing evidence that the true proportion of undergraduates at this large university 
who would report cheating differs from 0.15. 


9.59 c¢ 
9.61 b 


9.63 (a) X-Y has a Normal distribution with mean pX-Y=pX-pY=5.3-5.26=0.04 


and standard deviation oX-Y=0 X2+o0 Y2=(0.01)2+(0.02)2=0.0224 


ox_y =,/02. +02 = 4/(0.01)? + (0.02)? = 0.0224 5 
sale yx : v ( ) .X-YX — Y must take on a positive number. 


(b) We want to find P(x-Y>0)" (X—Y>0). 
(i) 2=-1.792 = —1.79. p(z>-1.79)=0.9633P (2 > —1.79) = 0.9633 (i) 


normalcdf (lower: 0, upper: 1000, mean: 0.04, SD: 0.0224)=0.9629 
normalcdf (lower: 0, upper: 1000, mean:0.04,SD:0.0224) = 0.9629 hore is a 0.9629 probability 


that a randomly selected CD will fit in a randomly selected case. (c) P(all fit)\=(0.9629)100=0.0228 


; there is a 0.0228 probability that all 100 CDs will fit in their cases. 


Section 9.3 
Answers to Check Your Understanding 
page 591: 1. HO: p=320; Ha: 4320 
, where p!=the true 

mean amount of active ingredient (mg) in Aspro tablets from this batch of production 
using a=0.05 
2. Random: We have a random sample of 30 tablets. 10%: Assume the sample of size 30 is less than 10% 
of the population of all tablets in this batch. Normal/Large Sample:n=302>30 

. All conditions are met. 

3. t=319-320330=-1.83 


; P-value=0.0775 


4. Because the P-value of 0.0775>a=0.05 , we fail to 


reject HO . There is not convincing evidence that the true mean amount of the active 
ingredient in Aspro tablets from this batch of production differs from 320 mg. 
page 595: STATE: HO: p=8; Ha: p<8 


, where p!=the true 


mean amount of sleep that students at the teacher’s school get each night using a=0.05 


. PLAN: One-sample t test for 1. Random: The teacher selected a random sample of 
28 students. 10%: Assume the sample size (28) is less than 10% of the population of students at this 
school. Normal/Large Sample: There were only 28 students, so we need to examine the sample data. The 
histogram indicates that there is not much skewness and no outliers, so it is reasonable to use a t 
procedure. 


DO: t=6.643-81.98128=—3.625 


; P-value=0.0006P-value = 0.0006, 

CONCLUDE: Because the P-value of 0.0006<a=0.050.0006 < a@ = 0.05, we reject Ho/Z0. There is 
convincing evidence that students at this university get less than 8 hours of sleep, on average. 

page 597: 1. HO: p=128/0 : # = 128. Ha: y2128Me * HF 128 where p=the true = the true mean 
systolic blood pressure for the company’s middle-aged male employees. 


2. Random: The director examines the medical records of a random sample of 72 male employees in this 
age group. 10%: The sample size (72) is less than 10% of the population of middle-aged male employees 


at this large company. Normal/Large Sample: n=72>30 


3. The 95% confidence interval does include 128 as a plausible value, so we fail to reject HO 


at the a=0.05 significance level. We do not have convincing evidence 
that the true mean systolic blood pressure for the company’s middle-aged male employees is different 
from 128. 
page 603: 1. If the true mean percent change in TBBMC during the exercise program is p=1 


, there is a 0.80 probability that the researchers will find convincing evidence for Ha: p>0 


2. P(Type I error)=a=0.05 ; P(Type 
II error)=1—Power=1-0.80=0.20 


3. The researchers could increase the power of the test in Question 1 by increasing the significance level 


(a) or increasing the sample size (n) 


Answers to Odd-Numbered Section 9.3 Exercises 
9.65 Random: The teacher selected an SRS of 45 students. 10%: 45 is less than 10% of the population 
size of 1000. Normal/Large Sample: n=45>30 
9.67 (a) HO: p=10.5; Ha: p<10.5 

, where p!=the true 

mean battery life when playing videos for all tablets. (b) Random: A random sample 

of 20 tablets was selected. 10%: 20 is less than 10% of the population of all tablets. Normal/Large 
Sample: The sample size is less than 30 and the dotplot of the distribution of battery life is strongly 
skewed to the right. This condition is not met. 
9.69 (a) The sample result gives some evidence for the alternative hypothesis because x _=62.8464 


. (b) t=62.8-645.3625=-1.12 


P-value: ; Table B: df=25-1=24 P-value is between 


0.20 and 0.30. Tech: 2-tcdf(lower: —1000, upper: —1.12, df: 24)=0.2738 
2 - tedf (lower: — 1000, upper : — 1.12, df : 24) = 0.2738 


secs Barus (94 
9.71 (a) t=125.7-11529.845=2.41 v5 (b) P-value: df=45-1=44df = 45 — 1 = 44. 
Table B: Using df=40df = 40, the P-value is between 0.01 and 0.02. Tech: (c) tcdf(lower: 2.41, 
upper: 1000, df: 44)=0.0101t¢dE{lower:2.41 upper :1000,df : 44) = 0.0101 Because the P-value of 
0.0101<a=0.0101 < a =0,050.05, we reject HO/0. We have convincing evidence that the true mean 


SSHA score in the population of students at her college who are at least 30 years old is greater than 115. 


9.73 (a) STATE: HO: p=25 ; Ha: p>25 


, where pi=the true the true mean speed of all 


drivers in a construction zone using a=0.01 . PLAN: One-sample t__ test for p 
Random: The 10 drivers were randomly selected. 10%: 10 is less than 10% of all drivers in a construction 
zone. Normal/Large Sample: Because the sample is small, we need to graph the sample data. There is no 


strong skewness or outliers in the sample, so it is reasonable to use a t__ procedure. 


DO: x =28.8 and 
; df=9 and P-value=0.0069 
CONCLUDE: Because the P-value of 0.0069<a=0.01 


, we reject HO . We 
have convincing evidence that the true mean speed of all drivers in the construction zone is greater than 


25 mph. (b) Because we rejected HO , it is possible we made a Type I error—finding 
convincing evidence that the true mean speed is greater than 25 mph when it really isn’t. 

9.75 STATE: HO: p=5 ; Ha: <5Ha ‘<9 where p=! = the true 
mean reading level of all pages in this novel using a=0.05a = 0.05, PLAN: One-sample ¢ test for p. 


Random: Random sample of 40 pages from this novel. 10%: Assume the sample size (40) is less than 
10% of all pages in this novel. Normal/Large Sample:n=40230" = 40 > 30. Do: x =4.8% = 4.8, 


sx=0.852 = 0.8. t=-1.58 ; df=39 and P-value =0.0610 
CONCLUDE: Because the P-value of 0.0610>a=0.05 ; 


we fail to reject HO . There is not convincing evidence that the true mean reading level for 
this novel is less than 5. 


9.77 STATE: HO: p=11.5 ; Ha: p#11.5 
, where p=the true mean hardness of the tablets 


using a=0.05 . PLAN: One-sample t _ test for 1 _. Random: Random sample of 20 
tablets from one large batch. 10%: 20 is less than 10% of all tablets in the batch. Normal/Large Sample: 
Because the sample is small, we need to graph the sample data. There is no strong skewness or outliers in 
the sample, so it is reasonable to use a t procedure. 


DO: x =11.5164 and sx=0.095; t=0.77 

; df=19 and P-value=0.4494 
. CONCLUDE: 0.4494>0=0.05 Because the P-value 
of HO , we fail to reject p= . We do not have convincing evidence that the true 


mean hardness of these tablets is different from 11.5. 

9.79 (a) STATE: p= the true mean hardness for this type of pill. PLAN: One-sample t 
interval for #4. The conditions are met. DO: x =11.516@ = 11.516, sx=0,09582 = 9-095, and n=20 
n = 20; df=19df = 19; tx=2,093¢" = 2.093, (11.472,11.561)(41-472, 11.561) CONCLUDE: We are 


95% confident that the interval from 11.472 to 11.561 captures the true mean hardness measurement for 
this type of pill. (b) The confidence interval agrees with the test done in Exercise 77. Both give 11.5 asa 
plausible value for the true mean hardness i“. The confidence interval, however, gives other plausible 
values as well. 
9.81 (a) HO: p=200 milliseconds/o : # = 200 milliseconds. 444-1,4200 milliseconds 

, where p!=the true mean response 
time of European servers. (b) Random: Random sample of 14 servers in Europe. 10%: 14 is less than 10% 
of all servers in Europe. Normal/Large Sample: The sample size is small, but the problem states that a 
graph of the data reveals no strong skewness or outliers, so it is reasonable to use a t__ procedure. (c) 
Because the 95% confidence interval does not contain 200, we reject HO at the a=0.05 

significance level. We have convincing evidence that the mean response time of 

European servers is different from 200 milliseconds. (d) No! We cannot draw any conclusions about the 


mean response time of servers in the United States because we only collected information from a random 
sample of servers in Europe. 


9.83 (a) Yes; because the P -value of 0.06>a=0.05 , 


we fail to reject HO: p=10 at the 5% level of significance. The 95% 
confidence interval will include 10. (b) No; because the P-value of 0.06<a=0.10 


, we reject HO: p=10 at 
the 10% level of significance. The 90% confidence interval would not include 10 as a plausible value. 


9.85 If the true proportion of potatoes with blemishes in a given truckload is p=0.11 , there 
is a 0.764 probability that the company will find convincing evidence for Ha: p>0.08 


9.87 (a) Power will increase. Using a larger significance level makes it easier to reject HO 


when Ha is true. (b) Power will decrease. A smaller sample size gives less information about 
the true proportion p. (c) Power will decrease. It is harder to detect a smaller difference between the null 
and alternative parameter value. 

9.89 (a) A disadvantage is that the larger significance level will increase the probability of a Type I error. 
(b) A disadvantage is that the larger sample size would require more time and money. 

9.91 (a) If the true mean breaking strength of this company’s classroom chairs is p=294 — 294 pounds, 
there is a 0.71 probability that I will find convincing evidence for Ha:p<30044 ai p< 300. (b) 

P(Type I error)=a=0.05" (Type L error) = a = 0.05 ang P(Type II error)=1—0.71=0.29 

P (Type I error) = 1 — 0.71 = 0.29 (c) The power would increase by increasing the sample size or 
using a larger significance level. 


9.93 (a) Power=1-P(Type II error)=1-0.14=0.86P ower = 1— P(Type I error) = 1 — 0.14 = 0.86 
(b) P(Type I error)=a=0.01? (Typelerror) = a = 0.01 
9.95 (a) No; in a sample of size n=500” = 500, we expect to see about (500)(0.01)=5 


people who 
do better than random guessing, with a significance level of 0.01. These four might have ESP, or they 


may simply be among the “lucky” ones we expect to see just by chance. (b) The researcher should repeat 
the procedure on these four to see if they again perform well. 

9.97 Although the hypothesis test shows that the results are statistically significant (we have convincing 
evidence that p>128 ), this is not practically significant. A test with such a large 
sample size will often produce a significant result for a very small departure from the null value. There is 
little practical significance to an increase in average SAT score of only 2 points. 

9.99 It would not be wise to use these data to carry out a significance test about the mean amount spent by 
all shoppers at the supermarket because this sample wasn’t randomly selected—it was a convenience 
sample. Depending on the time of day or the day of the week, certain types of shoppers may be 
underrepresented. 


9.101 (a) Shape: Because np=500(0.08)=40>10 
and n(1—p)=500(0.92)=460=10 


, the sampling distribution of p/ is approximately Normal. Center: The mean of the sampling 
distribution of p/ is equal to the population proportion; :p/=p=0.08 


. Variability: The sample size (500) is less than 10% 
of the population of all potatoes, so the 10% condition has been met. opA=0.08(0.92)500=0.0121 


. (b) (i) 0.05 area to the right of zs z=1.645 . Solving 
1.645=p/—0.080.0121 gives p\=0.0999 


(ii) invNorm(area: 0.95, mean: 0.08, SD: 0.0121)=0.0999 
(c) Shape: Because np=500(0.11)=55$10 


and n(1—p)=500(0.89)=445>10 


, the sampling distribution of p\P is approximately Normal. Center: The mean of the sampling 
distribution of p\P is equal to the population proportion; up\=p=0.11' = an Variability: The 
sample size (500) is less than 10% of the population of all potatoes, so the 10% condition has been met. 

/0.11(0.89) 

~=4/ = 0.0140 

opA=0.11(0.89)500=0.0140°” V 500 , 

» — 9.0999-0.11 _ _ 7 
(d) (i) z=0.0999-0.110.0140=-0.72" 0.0140," s P(z>—-0.72)=0.7642 
P(z > —0.72) = 0.7642 


(ii) P(p4=0.0999)=normalcdf(lower: 0.0999, upper: 1000, pmean: 0.11, SD: 0.014)=0.7647. 
P (p > 0.0999) = normalcdf(lower:0.0999 upper:1000, pmean:0.11,SD:0.014) = 0.7647. 


The power of the test to detect p=0.11 is 0.7647. 


9.103 a 
9.103 c 
9.107 a 


9.109 (a) Not included; the margin of error does not account for undercoverage. (b) Not included; the 
margin of error does not account for nonresponse. (c) Included; the margin of error does account for 
sampling variability. 


Answers to Chapter 9 Review Exercises 
R9.1 (a) HO: p=64.2 ; Ha: p#64.2 
, where p!=the true mean height of this year’s 


female graduates from the local high school. The sample data give some evidence for Ha 


because x =63.57 64.2 . (b) HO: p=0.25 
; Ha: p>0.25 , where p=the true 
proportion of all students at this school who have played/danced in the rain. The sample 
data give some evidence for Ha because pA=2880= 
0.35>0.25 


R9.2 (a) Appropriate test: One-sample t test for p —_. Random: We have an SRS of 48 female graduates 
from this high school. 10%: Assume the sample size (48) is less than 10% of all female graduates from 


this large high school. Normal/Large Sample: n=48>30 


. (b) Appropriate test: One-sample z__ test for p. Random: We have a random sample of 80 students from 
this school. 10%: Assume the sample size (80) is less than 10% of all students at this school. 
Normal/Large Sample: np0=80(0.25)=20>10 


and n(1-p0)=80(0.75)=60=10 


R9.3 (a) t=-1.31 ; P-value=0.1963 

CONCLUDE: We will use a=0.05 . Because the P-value of 0.1963>a=0.05 

0.1963 > a = 0.05, we fail to reject HO/0. There is not convincing evidence that the true mean height 
for all female graduates from this high school differs from the national average of 64.2 inches. 

(b) z=2.072 = 2.07; P-value = 0.0194P-value = 0.0194 CONCLUDE: We will use a=0.050 = 0.05, 
Because the P-value of 0.0194<a=0.050.0194 < a = 0.05, we reject HOM0. We have convincing 
evidence that the true proportion of all students at this school who have played/danced in the rain at some 
point in their lives is greater than 0.25. 


R9.4 STATE: HO: p=22; Ha: p>22 
, where p=the mean 
amount of time (in seconds) it takes for adults to read four paragraphs of text in the 


ornate font Gigi. We will use a=0.05 . PLAN: One-sample t test for 1) _. Random: We 


have a random sample of 24 adults. 10%: 24 is less than 10% of the population of adults. Normal/Large 
Sample: Because the sample size is small, we need to graph the sample data. The histogram below shows 
that the distribution is roughly symmetric with no outliers, so using a t procedure is appropriate. 


DO: x =26.496 , SX=4.728 ; t=4.66 > df=23 
and P-value=0.000054 . CONCLUDE: Because the P- 
value of 0.000054<a=0.05 , we reject HO 


. There is convincing evidence that the mean amount time it takes to read four paragraphs 
of text in the ornate font Gigi is greater than 22 seconds. 
R9.5 (a) HO: p=0.05 ; Ha: p<0.05 F 
where p___ is the true proportion of adults who will get the flu after using the vaccine. We will use a=0.05 


. (b) Type I: Finding convincing evidence that less than 5% of patients would get the 
flu, when in reality at least 5% would. Consequence: The company might get sued for false 
advertisement. 


Type II: Failing to find convincing evidence that less than 5% would get the flu, when in reality less than 
5% would. Consequence: Loss of potential income. (c) Because a Type I error is more serious in this case, 


and P(Type I error)=a , | would recommend a significance level of 


a=0.01 to minimize the possibility of making this type of error. (d) If the true 


proportion of adults who will get the flu after using the vaccine is p=0.03P? ~ - 0.03 | there is a 0.9437 
probability that the researchers will find convincing evidence for Ha: p<0.05/4a: a p < 0.05, (e) The power 
could be increased by increasing the sample size or increasing the significance level o@. 

R9.6 STATE: HO: p=0.05/40 >p = 0.05. Ha: p<0.054a :p < 0.05 | where p=the true? = the true 
proportion of adults who will get the flu after using the vaccine. We will use a=0.05 

PLAN: One-sample z test for p. Random: We have a random sample of 1000 adults. 10%: 1000 is i 
than 10% of the population of adults. Large Counts: np0=1000(0.05)=50210 


and n(1—p0)=1000(0.95)=950$10 


. DO: z=-1.02 ; P-value=0.1549 . CONCLUDE: 
Because the P-value of 0.1549>a=0.05 , we fail to 
reject HO . We do not have convincing evidence that the true proportion of adults who will 


get the flu after using the vaccine is less than 0.05. 

R9.7 (a) Assuming that the roulette wheel is fair, there is a 0.0384 probability that we would get a sample 
proportion of reds (p4=31/50) at least this 
different from the expected proportion of reds (18/38) by chance alone. (b) Because the P-value of 
0.0384<a=0.05 , we reject HO . We have 
convincing evidence that the true proportion of reds is not equal to 18/38=0.474 


. (c) Because 18/38=0.474 is one of the plausible 
values in the interval, this interval does not provide convincing evidence that the wheel is unfair. It does 
not, however, prove that the wheel is fair as there are many other plausible values in the interval that are 
not equal to 18/38. Also, the conclusion here is inconsistent with the conclusion in part (b) because the 


manager used a 99% confidence interval, which is equivalent to a test using a=0.01 If 
the manager had used a 95% confidence interval, 18/38 would not be considered a plausible value. 


Answers to Chapter 9 AP® Practice Test 
T9.1b 
T9.2e 
19.3¢ 
T9.4e 
T9.5b 
T9.6¢ 
T9.7e 
T9.8d 
T9.9a 
T9.10 e 


T9.11 (a) STATE: HO: p=0.20 ; Ha: p>0.20 


, where p=the true proportion of customers who 


would pay $100 for the upgrade using a=0.05 . PLAN: One-sample z test for p. 
Random: We have a random sample of 60 customers. 10%: 60 is less than 10% of this company’s 
customers. Large Counts: np0=60(0.20)=12>10 


and n(1—p0)=60(0.8)=48>10 


. DO: z=1.29 : P-value=0.0985P-value = 0.0985 


CONCLUDE: Because the P-value of 0.0985>a=0.050.0985 > a = 0.05, we fail to reject HO/40. We do 
not have convincing evidence that the true proportion of customers who would pay $100 for the upgrade 
is greater than 0.20. (b) A Type I error would be finding convincing evidence that more than 20% of 
customers would pay for the upgrade, when in reality they would not. A Type II error would be not 
finding convincing evidence that more than 20% of customers would pay for the upgrade, when in reality 
more than 20% would. For the company, a Type I error is worse because they would go ahead with the 
upgrade and lose money. (c) If the true proportion of customers who would pay $100 for the upgrade is 
p=0.30P — 0.30 there is a 0.60 probability that you will find convincing evidence for Ha: p>0.20 

Hy: p > 0.20. 

T9.12 (a) HO: p=41.940°# = 41.9. Ha: <41.9He * # < 41-9 where p=the trueM = the true mean age 
of all employees of a large technology company. (b) Random: We have a random sample of 12 employees 
from this company. 10%: 12 is less than 10% of all employees of this large technology company. 
Normal/Large Sample: Because the sample is small, we need to graph the sample data. There is no strong 
skewness or outliers in the sample, so it is reasonable to use a t procedure. 


(c) Assuming that the true mean age of all employees at this large technology company is 41.9 years, 
there is a 0.003 probability of getting a sample mean of x =34.833 or less just by 
chance in a random sample of 12 employees. CONCLUDE: We will use a=0.05 : 
because the P-value of 0.003<a=0.05 , we reject HO 

. We have convincing evidence that the true mean age of all employees of this large 
technology company is less than 41.9 years. 
T9.13 STATE: HO: p=$158 ; Ha: p#$158 

, where pt =the true mean amount 

spent on food by households in this city. We will perform the test at the a=0.05 


significance level. PLAN: One-sample t test for ) _. Random: We have a random sample of 50 
households. 10%: 50 is less than 10% of households in this small city. Normal/Large Sample: n=50230 


. DO: t=2.47; df=49 
and P-value=0.0170 
CONCLUDE: Because the P-value of 0.0170 is less than a=0.05 , we reject HO 
. We have convincing evidence that the true mean amount spent on food per household in 


this city is different from the national average of $158. 


pla p” ! 


op/=0.08(0.92)500=0.0121 


- Hp’ op’ 
Chapter 10 
Section 10.1 
Answers to Check Your Understanding 
page 630: STATE: 95% CI for p1—p2 , where p1=true 
proportion of working women who would say that job security was very or extremely important and 
p2=true proportion of working men who would say that job security was very or 
extremely important. PLAN: Two-sample z interval for p1—p2 . Random: The 
data come from independent random samples of 806 working women and 944 working men. 10%: 
nl = 806<10% of the population of all working women and 
n2=944<10% of the population of all working 
men. Large Counts: 709, 806-709=97 , 802, and 944-802=142 
are all $10. 
i = 0.880 50 = 0.850 


DO: p\1=709806=0.880"! — %06 , pA2=802944=0.850?2 ~ 944 
(0.88-0.85)+1.960.88(0.12)806+0.85(0.15)944=0.03+0.032 


/ 0.88(0.12) 0.85(0.15) 
/ = 0.03 + 0.032 = 
(0.88 0.85) mF 1.96 \ 806 t O44 0 03 0 03 =(-0.002, 0.062)= (—0.002, 0.062) 


Tech: 2-PropZInt gives (—0.002, 0.062)(—9-002, 0.062) 


CONCLUDE: We are 95% confident that the interval from —0.002~—9-002 to 0.062 captures 
p1-p2=the?1 ~ P2 = the difference in the true proportions of all working women and all 
working men who would say that job security is very or extremely important. 


page 637: 1. STATE: HO: p1-p2=0/40 : Pi — P2 = 9 Ha: p1-p2<oHa : Pi — P2 < 9, where p1=the 


true proportion of children like the ones in the study who attend preschool that use 


social services later and p2=the true proportion of children like the ones in the study 
who do not attend preschool that use social services later using a=0.05 . PLAN: Two- 
sample z test for pl—-p2 . Random: The children were randomly assigned to 
attend or not attend preschool. Large Counts: 38, 62—38=24 , 49, and 61-49=12 

61 — 49 = 12 are $10. 

DO: p\1=3862=0.6129"1 = = oer? p’2=4961=0.8033"2 ~ ar a aise and pA=38+4962+61= 


~ 38449 
~ 62461 


87 
87123=0.7073123 ~ 9°"873 5 (9 6129-0.8033)-00.7073(0.2927)62+0.7073(0.2927)61=-2.32 


P-value=0.0102 . CONCLUDE: Because the P-value of 0.0102<a=0.05 


, we reject HO . There is convincing 
evidence that the true proportion of children like the ones in the study who do attend preschool who use 
social services later is less than the true proportion of children like the ones in the study who do not attend 
preschool who use social services later. In other words, children like those in this study who participate in 
preschool are less likely to use social services later in life. 


2. Because we rejected Hoo , we may have made a Type I error: finding convincing evidence that the 
true proportion of children like the ones in the study who attend preschool who use social services later is 
less than the true proportion of children like the ones in the study who do not attend preschool who use 
social services later, when in reality the true proportions are equal. 


3. No; the children who participated in the study were recruited from low-income families in Michigan, 
not randomly selected, so the results cannot be generalized to all children from low-income families. 


Answers to Odd-Numbered Section 10.1 Exercises 

10.1 (a) Approximately Normal because nCpC=15, nC(1-pC)=35, nApA=15 

nope = 15,ne (1 — pe) = 35, napa = 15 and nA(1-pA)=85""4 (1—pa) = 85 .6 >10> 10. (b) 
-=0.30-0.15=0.15#— = 0.30 — 0.15 = 0.15) Because 50<1050 < 10, of the jelly beans in the 
Child mix and 100<10%100 < 10% of the jelly beans in the Adult mix, 


™ _ / 0.3(0.7) 4 0.15(0.85) = 0.0740 
opC-pA=0.3(0.7)50+0.15(0.85)100=0.0740 ?°-?4 — V 50 100000 . The difference 


(Child mix— Adult mix) in the sample 
proportions of red jelly beans typically varies by about 0.0740 from the true difference in proportions of 
0.15. 


10.3 (a) P(p1-p’2<0)=P(z<0-0.150.0740)=0.0212 


(b) Yes, we might doubt the company’s claim. There is only a 2% chance of getting a proportion of red 
jelly beans in the Child sample less than or equal to the proportion of red jelly beans in the Adult sample 
if the company’s claim is true. 

10.5 Random: Not met because these data do not come from independent random samples or two groups 
in a randomized experiment. 10%: Since no sampling took place, the 10% condition does not apply. 
Large Counts: Not met because there were fewer than 10 successes (3) in the group from the west side of 
Woburn. 

10.7 Random: Cockroaches were randomly assigned to one of the two treatment groups. 10%: Since no 
sampling took place, the 10% condition does not apply. Large Counts: 25, 11, 18, and 18 are $10. The 
conditions are met. 


10.9 (a) STATE: p1=the true proportion of young men who live in their parents’ 
home and p2=the true proportion of young women who live in their parents’ home. 


PLAN: Two-sample z interval for p1—p2 . Random: Independent random 


samples. 10%: n1=2253<10% of all young men 
and n2=2629<10% of all young women. Large 
Counts: 986, 1267, 923, and 1706 are =10 . DO: (0.051,0.123) 

. CONCLUDE: We are 99% confident that the 


interval from 0.051 to 0.123 captures p1—p2=the true difference in 
the proportions of young men and young women who live in their parents’ home. (b) Because the interval 
does not contain 0, there is convincing evidence that the true proportion of young men who live at their 
parents’ home is different from the true proportion of young women who live in their parents’ home. 


10.11 STATE: p1=the true proportion of people like the ones in this study who would 
say they support President Obama’s decision when asked by Hassan and p2=the true 
proportion of people like the ones in this study who would say they support President Obama’s decision 
when asked by Ken. PLAN: Two-sample z-interval for p1—p2 . Random: Two 
groups in a randomized experiment. 10%: Does not apply. Large Counts: 11, 39, 21, and 23 are >10 
. DO: (-0.414, -0.100) 

. CONCLUDE: We are 90% 

confident that the interval from —0.414 to —0.100 captures pl—p2=the 
true difference in the proportions of people like the ones in this 

study who would say they support President Obama’s decision when asked by Hassan or by Ken. 


10.13 No; if the two population proportions were equal, the true difference in proportions would be 0. 
Because 0 is captured in the interval, we think it is plausible that the population proportions are equal, but 
this does not provide convincing evidence that the two proportions are equal. 


10.15 (a) HO : p1—p2=0 , Ha: p1-p2<0 
, where p1=the true 
proportion of 4- to 5-year-olds who would sort correctly and p2=the true proportion 


of 6- to 7-year-olds who would sort correctly. (b) Random: Independent random samples. 10%: 
n1=50<10, of all 4- to 5-year-olds and n2=53<10 
, of all 6- to 7-year olds. Large Counts: 10, 40, 28, and 25 


are all >10 
10.17 (a) HO : p1—p2=0 , Ha: p1-p2>0 
Hy : pi — po > 0, where pl=theP?! = the true proportion of shrubs that would resprout after being 


clipped and burned and p2=theP2 ~ the true proportion that would resprout after being clipped. (b) 
Random: Two groups in a randomized experiment. Large Counts: Not met because there were fewer than 
10 failures (0) in the treatment group, fewer than 10 successes (8) in the control group, and fewer than 10 
failures in the control group (4). 


10.19 (a) Yes; pA1—pA2=-0.328P 1-22 = 0.328 which gives some evidence in favor of 
Ha : p1-p2<0Ma : Pi ~ P2 < 9, (b) z=-3.452 = —3.45; P-value=0.0003 
(c) Because the P-value of 0.0003<a=0.05 


, we reject HO . We have convincing 
evidence that the true proportion of 4- to 5-year-olds who would sort correctly is less than the true 
proportion of 6- to 7-year-olds who would sort correctly. 

10.21 (a) STATE: HO: p1—p2=0 , 
Ha : pl—p2<0 , where p1=the 

true proportion of sophomores who bring a bag lunch and p2=the 

true proportion of seniors who bring a bag lunch. PLAN: Two-sample z test for 
pl-p2 . Random: Independent random samples. 10%: n1=80<10, 

of all sophomores and n2=104<10 
of all seniors. Large Counts: 52, 28, 78, and 26 are >10 
. DO: z=—1.48; P-value=0.0699 
. CONCLUDE: Because the P-value of 

0.0699>0=0.05 , we fail to reject HO 
There is not convincing evidence that the true proportion of sophomores who bring a bag lunch is less 
than the true proportion of seniors who bring a bag lunch. (b) If there is no difference in the true 


proportion of sophomores and seniors who bring a bag lunch, there is a 0.0699 probability of getting a 
difference in the proportions as large as or larger than the one observed (0.65-0.75=—0.10) 


by chance alone. 

10.23 STATE: HO : p1—p2=0 , Ha: p1-p240 

, where p1=the true 
proportion of children like the ones in this study who are exposed to peanut butter as infants who are 
allergic to peanuts at age 5 and p2=the true proportion of children not exposed to 
peanut butter who are allergic at age 5, using a=0.05 . PLAN: Two-sample z test for 
p1-p2 . Random: Two groups in a randomized experiment. Large Counts: 10, 
297, 55, and 266 are $10 . DO: z=—-5.71; P{value~0 

. CONCLUDE: 

Because the P-value of approximately 0<a=0.050 < a = 0.05, we reject HoZ0. There is convincing 
evidence that the true proportion of children like the ones in this study who are exposed to peanut butter 
as infants who are allergic to peanuts at age 5 is different from the true proportion of children like the 
ones in this study who are not exposed to peanut butter as infants who are allergic to peanuts at age 5. (b) 
Because we rejected Hoo , we may have made a Type I error. (c) No; this experiment likely had parents 
who volunteered their infants as subjects. When subjects are not randomly selected, we should not 
generalize the results of an experiment to some larger population of interest. 
10.25 (a) STATE: p1=theP?! ~ the true proportion of children like the ones in this study who are 
exposed to peanut butter as infants who are allergic to peanuts at age 5 and p2=theP2 ~ the tre 
proportion of children not exposed to peanut butter who are allergic at age 5. PLAN: Two-sample z 
interval for pl1—p2P1 ~ P2. The conditions are met. DO: (—0.185, ~0.093)\—9-185, ~ 0.093) 
CONCLUDE: We are 95% confident that the interval from —0.185~9-185 to -0.093 captures p1—p2=the 


Pi ~ p2 = the true difference between the proportions of infants like these who become allergic to 
peanut butter if they consume a baby-food form of peanut butter versus if they avoid peanut butter. (b) 
The confidence interval gives the values of p1—p2 that are plausible based on 


the sample data. The two-sided test only allows us to reject (or fail to reject) a difference of 0, where a 
confidence interval provides a set of plausible values for the true difference. 


10.27 (a) Two-sample z test for p1—p2 . Random: Two groups in a randomized 
experiment. Large Counts: 44, 44, 21, and 60 are >10 . (b) If there is no 
difference in the true pregnancy rates of women who are being prayed for and those who are not, there is a 
0.0007 probability of getting a difference in pregnancy rates as large or larger than the one observed in the 
experiment (0.500-0.259=0.241) by 
chance alone. (c) Because the P-value of 0.0007 is less than a=0.05 , we reject HO 

. There is convincing evidence that the pregnancy rates among women like these who are 
prayed for is higher than the pregnancy rates for those who are not prayed for. (d) It is important that the 
women who were trying to get pregnant didn’t know whether they were being prayed for or not. This 
knowledge might have affected their behavior in some way (even unconsciously) that would have affected 
whether they became pregnant or not. Then we wouldn’t know if it was the prayer or the other behaviors 
that caused the higher pregnancy rate. 
10.29 (a) HO: p1—p2=0, Ha: p1—p240 


, where p1=the true proportion of subjects like these who would admit to texting and 
driving when asked Version A of the question and p2=the true proportion of subjects 
like these who would admit to texting and driving when asked Version B. (b) Conditions not satisfied; the 
number of subjects who did not admit to texting and driving when asked Version A (25-16=9) 
is less than 10. (c) p\1—p42=0.64-0.48=0.16 
; there are 21 


trials of the simulation with a difference of proportions less than or equal to —0.16 and 14 
trials of the simulation with a difference of proportions greater than or equal to 0.16. Estimated P- 


value=35/100=0.35 . (d) Because 
the estimated P-value of 0.35>a=0.05 , we fail to reject 
HO . We do not have convincing evidence that there is a difference between the proportions 


of subjects like these who would admit to texting and driving when asked Version A versus when asked 
Version B of the question. 


1031 a 

10.33 ¢ 

10.35 (a) yA=—-13,832+14,954x , where 
y/=the predicted mileage and x=the age in years of the cars. (b) The 


predicted number of miles goes up by 14,954 for each increase of 1 year in age. (c) 
y\=-13,832+14,954(10)=135, 708 


. Residual=110,000-135,708=—25,708 


miles. The 
student’s car had 25,708 fewer miles than predicted, based on its age. 


Section 10.2 
Answers to Check Your Understanding 
page 655: 1. STATE: 95% CI for p1-p2 , where p11=the 
true mean change in pulse rate for students like these after drinking 12 ounces of 


cola with caffeine and 2=the true mean change in pulse rate for students like 
these after drinking 12 ounces of caffeine-free cola. PLAN: Two-sample t interval for p1-p2 


. Random: The volunteers were randomly assigned to drink either cola 
with caffeine or caffeine-free cola. Normal/Large Sample: The sample sizes are small, but the dotplots do 
not show any outliers or strong skewness. 


DO: x 1= 3.2, 815 2.70, n1l= 10, 
x 2 2, S2= 2.62, n2= 10 


Option 1: 2-Samp TInt gives (—1.302, 3.702) 


using df=17.986 
Option 2: df=9, t*=2.262 
(3.2-2)+2.2622.70210+2.62210=1.2+2.691=(—1.491, 3.891) 


CONCLUDE: We are 99% confident that the interval from —1.491 to 3.891 
beats per minute captures p1-p2=the#! ~ #2 = the difference in the true mean change in pulse 
rate for students like these after drinking 12 ounces of cola with versus without caffeine. 


2. The interval in Question 1 does not suggest that caffeine increases the average pulse rate of subjects 
like these. Because the interval contains 0, it is plausible that the true mean change in pulse rate for all 
students like these after drinking 12 ounces of cola with versus without caffeine may equal 0, indicating 
no difference. 

page 663: STATE: HO : p1-p2=0/0 : Ha — #2 = 9 Ha: y1—po>oHa ‘Hi — He > O where y1=the 
f41 = the trye mean breaking strength of polyester fabric buried for 2 weeks and ,12=the/2 ~ the tre 
mean breaking strength of polyester fabric buried for 16 weeks. PLAN: Two-sample ¢ test for p1—-p2 

#1 ~ #2, Random: The strips were randomly assigned to the treatments. Normal/Large Sample: The 
dotplots show no strong skewness and no outliers. 


2 weeks 


16 weeks “—, a a 7 CT CT a 
102 108 114 120 126 132 138 
Breaking strength (Ib) 


DO: x 1=123.8, s1=4.60, n1=5, x 2=116.4, s2=16.09 


n2=5 ; t= 0.99. Option 1: P-value=0.1857 

using df=4.65df = 4.65, Option 2: P-value between 0.15 and 
0.20 using df = 4. CONCLUDE: Because the P-value of 0.1857>a=0.050.1857 > a = 0.05, we 
fail to reject HOM. We do not have convincing evidence that the true mean breaking strength 
of polyester fabric that is buried for 2 weeks is greater than the true mean breaking strength for 
polyester fabric that is buried for 16 weeks. 


Answers to Odd-Numbered Section 10.2 Exercises 

10.37 (a) Normal because both population distributions are Normal. (b) px M-x B=188-170=18 
Hsy-z,g = 188-170 = 18 mg/dl (c) Because 25<10%29 < 10% of all 20- to 34-year-old males and 
36<10%,36 < 10%, of all 14-year-old boys, 
ox M-x B=41225+30236=9.60 mg/dl = 78 — V 5 + ag = 9-60mg/dl 
34-year- 


. The difference (20- to 


old males — 14-year-old boys) in the sample mean blood cholesterol levels typically varies by about 9.60 
mg/dl from the true difference in means of 18 mg/dl. 


P(Zy —_p<0)=P (z < —) — 0.0304 Pa 


10.39 (a) P(x M-x B<0)=P(z<0-189.60)=0.0304 9.60 


Yes; the likelihood that the sample mean cholesterol level of the boys is greater than the sample mean 
cholesterol level of the men is only 3%. 

10.41 Random: Met because these are two independent random samples. 10%: Met because 20<10% 

20 < 10%, of all males at the school and 20<10%20 < 10%, of all females at the school. Normal/Large 
Sample: Not met; there are fewer than 30 observations in each group and a dotplot for males shows 
several outliers. 
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10.43 Random: Met because these are two independent random samples. 10%: Met because 50<10% 


, of students in the United Kingdom and 50<10% , of 
students in South Africa. Normal/Large Sample: Met because 50230 and 
50230 , even though there is an outlier in the South African distribution. 


10.45 (a) The distributions of percent change are both slightly skewed to the left with no apparent 
outliers. The centers of the two distributions seem to be quite different, with people drinking red wine 
generally having more polyphenols in their blood. The distribution of percent change for the white wine 


drinkers is a bit more variable. (b) STATE: p.1=the true mean percent change in 
polyphenol level in the blood of people like those in the study who drink red wine and p.2=the 
true mean percent change in polyphenol level in the blood of people like those in 
the study who drink white wine. PLAN: Two-sample ¢ interval for 11-2 
Random: Two groups in a randomized experiment. Normal/Large Sample: The dotplots show no strong 
skewness and no outliers. DO: x 1=5.5,s1=2.517 
,n1=9, x 2=0.23, s2=3.292 
, and n2=9 
. Using df=14.97 , (2.845, 7.689); using df=8,(2.701, 7.839) 
. CONCLUDE: We are 90% 
confident that the interval from 2.845 to 7.689 captures .1-y12=the 
true difference in mean percent change in polyphenol level for 
men like these who drink red wine and men like these who drink white wine. 


10.47 (a) Skewed to the right because the earnings cannot be negative, yet the standard deviation is 
almost as large as the distance between the mean and 0. The use of the two-sample t procedures is 
justified because the sample sizes are both very large (675230 and 621230) 


_(b) 


STATE: p1=the true mean summer earnings of male students and p.2=the 
true mean summer earnings of female students. PLAN: Two-sample t interval for 
p1-p2 . Random: It is reasonable to consider the two samples independent 
because knowing the response of a male shouldn’t help us predict the response of a female. 10%: 
n1=675<10% of male students and n2=621<10% 
of female students. Normal/Large Sample: 
n1=675>30 and n2=621>30 
. Using df=1249.21 P 
(413.62, 634.64); using df=620, (413.54, 634.72) 
. CONCLUDE: We are 
90% confident that the interval from $413.62 to $634.64 captures 1—-p2=the 
true difference in mean summer earnings of male students and 
female students at this large university. (c) If we took many random samples of 675 males and 621 


females from this university and each time constructed a 90% confidence interval in this same way, about 
90% of the resulting intervals would capture the true difference in mean earnings for males and females. 


10.49 (a) No, because our interval includes a difference of 0 (no difference) as a plausible value. (b) No; 
instead, we don’t have convincing evidence that the mean reaction times are different. Zero is a plausible 
value for the difference in means, but there are many other plausible values besides 0 in the confidence 


interval. 

10.51 (a) HO: p1-p2=0 , Ha: p1-p240 
, where 11=the 

true mean time it would take to sort the cards for students like these while listening to music and 12=the 


true mean time it would take to sort the cards for students like these while not 
listening to music. (b) Random: Two groups in a randomized experiment. Normal/Large Sample: Not 
met; the sample sizes are small. The distribution of time to sort for those who listen to music shows no 
strong skewness and no outliers, but the distribution of time to sort for those who listen to no music is 
skewed right with one upper outlier. 


10.53 (a) HO: p1-p2=0 , Ha: p1-p240 
, where p1=the 


true mean reliability rating of Anglo customers and p12=the true mean reliability 
rating of Hispanic customers. (b) Random: Independent random samples. 10%: n1=92<10% 


of Anglo customers and n2=86<10% 
of Hispanic customers. Normal/Large Sample: 


n1=92>30 and n2=86>30 


10.55 (a) x 1-x 2=6.37-5.91=0.4621 — 2 = 6.37 — 9.91 = 0.46 points, which gives some evidence 


in favor of Ha: 1-p2#0Ha > fy — fe FO. (b) t=3.89# = 3.89; using df=143.69, P-value=0.0002 
df = 143.69, P-value = 0.0002 Using df=85, P-value=0.000248£ = 85, P-value = 0.0002. (c) Because 


the P-value of 0.0002<o=0.050.0002 < a = 0.05, we reject HO/40. We have convincing evidence that 

the true mean reliability rating for all Hispanic customers is different from the true mean reliability rating 

for all Anglo customers. 

10.57 (a) STATE: HO: pi-p2=0/f0 * #1 ~ #2 =9 Ha: pi-p2>oMe * Hi — M2 > 9 where p1=true 
mean reduction in blood pressure for subjects like these who take fish oil and 


2 = true mean reduction in blood pressure for subjects like these who take regular oil. PLAN: 


12=true 
Two-sample t test for 1-2/1 ~ #2, Random: Two groups in a randomized experiment. Normal/Large 
Sample: The dotplots show no strong skewness and no outliers. 
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DO: x 1=6.571, s1=5.855, nl=7, x 2=—-1.143, s2=3.185 
= 6.571, 8; = 5.855, Ny = T, Be = 1.143, s2 = 3.185 , nJ=72 = 7. t=3.06¢ = 3.06; using 


df=9.264 , P-value=0.0065 . Using df=6 
, P-value=0.0111 . CONCLUDE: Because the P- 
value of 0.0065<a=0.05 , we reject Hoo. We 


have convincing evidence that fish oil helps reduce blood pressure more, on average, than 
regular oil for subjects like these. (b) Assuming that the true mean reduction in blood pressure 
is the same regardless of whether the individual takes fish oil or regular oil, there is a 0.0065 
probability that we would observe a difference in sample means of 7.714 or greater by chance 
alone. 


10.59 STATE: HO: p1-p2=0/0 : Hi ~ #2 = 9 Ya: pi—poeoHa : Mi — He #0 where y1=the 
/41 = the trye mean number of words spoken per day by female students and .2=the#2 — the true mean 
number of words spoken per day by male students, using a=0.05a = 0.05, PLAN: Two-sample t test for 
y1-p2!41 — #2. Random: Independent random samples. 10%: n1=56<10%1 = 56 < 10% of females at 
a large university and n2=56<10% of males at a 
large university. Normal/Large Sample: n1=56>30 

and n2=56>30 


DO: t=-0.25 ; using df=106.195, P-value=0.8043 

. Using df=55 , P- 
value=0.8035 . CONCLUDE: Because the P-value of 
0.8043>a=0.05 , we fail to reject HO 


. We do not have convincing evidence that the true mean number of words 
spoken per day by female students is different from the true mean number of words spoken per 
day by male students at this university. 


10.61 (a) DO: x 1=16,177 , $1=7520 , n1=56 
, X 2=15,569 , S2=9108 : 
n2=56 . Using df=106.2, (—3521, 2737) 
; using df=55, 
(-3555, 2771) 
CONCLUDE: We are 95% confident that the interval from —3521 to 2737 words captures p:1—p12=the 


true difference in mean number of words spoken per day by 
female students and the mean number of words spoken per day by male students at this university. (b) 


The confidence interval gives the values of p1-p2 that are plausible based 
on the sample data. The two-sided test only allows us to reject (or fail to reject) a difference of 0, where a 
confidence interval provides a set of plausible values for the true difference in means. 


10.63 (a) The score distribution for the activities group is slightly skewed to the left while the score 
distribution for the control group is slightly skewed to the right. Neither distribution has any clear outliers. 
The center of the activities group is higher than the center of the control group. The scores in the activities 
group are less variable than the scores in the control group. Overall, it appears that scores are typically 
higher for students in the activities group. (b) Because the P-value of 0.013<a=0.05 


, we reject HO . We have convincing 
evidence that the true mean DRP score for third-grade students like the ones in the experiment who do the 
activities is greater than the true mean DRP score for third-grade students like the ones in the experiment 
who don’t do the activities. (c) Yes, because this was a randomized controlled experiment. (d) Because 
we rejected the null hypothesis, we could have committed a Type I error. 


10.65 (a) STATE: HO: p1-p2=10 , 
Ha: p1-p2>10 , where p.1=the 

true mean cholesterol reduction for people like the ones in the study when using 
the new drug and p12=the true mean cholesterol reduction for people like the ones 
in the study when using the current drug, using a=0.05 . PLAN: Two-sample ¢ test for 


p1-p2 . Random: The subjects were assigned at random to the new or 
current drug. Normal/Large Sample: The graphs of the data show no strong skewness or outliers. DO: 
t=0.98 - using df=26.96, P-value=0.16754f = 26.96, P-value = 0.1675 Using df=13, P- 
value=0.17254f = 13, P-value = 0.1725. CONCLUDE: Because the P-value of 0.1675>0=0.05 
0.1675 > a = 0.05, we fail to reject HO/0. We do not have convincing evidence that the true mean 
cholesterol reduction is more than 10 mg/dl greater for the new drug than for the current drug. (b) 
Because we failed to reject the null hypothesis, we could have committed a Type II error. 


10.67 (a) The researchers randomly assigned the subjects to create two groups that were roughly 
equivalent at the beginning of the experiment. (b) The difference in the sample means is 4.14. Based on 
the dotplot, only 1 out of the 100 differences were that great, meaning that the P-value is approximately 


0.01. Assuming that the true difference (Intrinsic-Extrinsic)(Imtrinsic ~ Extrinsic) in mean creativity 
rating is 0, there is a 0.01 probability that we would observe a difference in sample means of 4.14 or 


greater by chance alone. (c) Because the P-value of 0.01<a=0.050.01 < @ = 0.05, we have convincing 


evidence that the true mean rating for students like these who are provided with internal reasons is higher 
than the true mean rating for students like these who are provided with external reasons. (d) Because we 
found convincing evidence that the mean is higher for students with internal reasons when it is possible 
that there is no difference in the means, we could have made a Type I error. 


10.69 d 
10.71 b 


10.73 (a) By the 68—95—99.7 rule, about 5% of all observations will fall outside of this interval. 
P(at least one mean outside interval)=1-—(0.95)2= 0.0975 
P(at least one mean outside interval) = 1 — (0.95)? = 0.0975, (b) By the 68-95-99.7 rule, about 


2.5% of all observations will be greater than px +20x Mz + 205 Let X=theX = the number of samples 
that must be taken to observe one greater than ux +20x — 


. X is a geometric random variable with p=0.025 


P(X=4)=(1-0.025)3(0.025)=0.0232 


. (c) By the 68-95-99.7 rule, the probability of any one observation falling within the interval px —-ox — 


to ux +ox is 
about 0.68. Let X=the number of sample means out of 5 that fall outside this interval. 
Assuming the samples are independent, X is a binomial random variable with n=5 and 
p=0.32 . P(X24)=1-binomcdf(trials: 5, p: 0.32, x value: 3)=0.039 


. There is a 0.039 probability that at least 4 of the 5 sample means fall outside of this interval. This is a 
reasonable criterion, because when the process is under control, we would get a “false alarm” only about 
4% of the time. 


Section 10.3 
Answers to Check Your Understanding 
page 678: 1. STATE: 90% CI for ydiff=the true mean difference 


(Music—Without music) in time for students 
like the ones in this study to complete the arithmetic test. PLAN: One-sample t interval for pdiffMdiff. 
Random: The volunteers were randomly assigned the order of the music and no-music treatments. 
Normal/Large Sample: ndiff=30>30"aitt = 80 = 30. po; x diff="aitt = 2.8, sdiff=7.490%aitt = 7-490 
ndiff=30 . With 90% confidence and df=30-1=29, t*=1.699 


2.8+1.6997.49030=2.8+2.323=(0.477, 5.123) 


Tech: The TInterval function gives (0.477, 5.123) with df=29 
CONCLUDE: We are 90% confident that the interval from 0.477 to 5.123 captures the true mean 


difference (Music— Without music) in time 


for students like these to complete the arithmetic test. 

2. The 90% confidence interval in Question 1 suggests that students are between 0.477 and 5.123 seconds 
slower, on average, when completing the arithmetic test while music is playing. 

page 682: STATE: HO : pdiff=0 , Ha : pdiff>0 

Hq : bait > 9 where pdiff=Mdilf = the true mean difference (Air—Nitrogen) (Air — Nitrogen) j, 
pressure lost using a=0.05@ = 0.05, PLAN: Paired t test for pdiffMdiff . Random: Tires in each pair are 
randomly assigned to be filled with air or nitrogen. Normal/Large Sample: ndiff=31>30"diff — 31 > 30. 
DO: x diff=1.252%aitt = 1-252 caiff=1.202 , and ndiff=31 
t=1.252-01.20231=5.80 


df=30 and P-value~0 . CONCLUDE: 
Because the P-value of approximately 0<a=0.05 , we reject 
HO . We have convincing evidence that the true mean difference (Air—Nitrogen) 


in pressure lost is greater than 0. In other words, we 
have convincing evidence that tires lose less pressure when filled with nitrogen than when filled with air, 


on average. 


Answers to Odd-Numbered Section 10.3 Exercises 
10.75 (a) 


(b) Most of the differences are positive, meaning that the estimates of distance traveled as gauged by the 
weight tend to be greater than the estimates of distance traveled as gauged by the grooves of the tire. (c) 

x diff=4.556 , sdiff=3.226 . The estimate of the 
number of miles driven using the “weight” method is 4.556 thousand miles greater, on average, than the 
estimate of the number of miles driven using the “groove” method. 

10.77 (a) The pilot recorded two values for each individual (each of 12 randomly selected days). (b) Most 
of the differences are positive, meaning that the outbound flights (Dubai to Doha) tend to take longer. (c) 


x diff=10.083 , sdiff=10.766 ; the difference 
(Outbound-Return) in flight times typically 
varies by about 10.766 minutes from the mean of 10.083 minutes. 

10.79 STATE: jdiff=the true mean difference in the estimates from these two 
methods in the population of tires. PLAN: One-sample t interval for diff . Random: Random 
sample of 16 tires. 10%: 16<10% of all tires. Normal/Large Sample: The dotplot 
in Exercise 75 does not show any strong skewness or outliers. 

DO: x diff=4.556 , sdiff=3.226 , and 

ndiff=16; df=15 ; (2.837, 


6.275)0:275) . CONCLUDE: We are 95% confident that the interval from 2.837 to 6.275 thousand miles 
captures the true mean difference in the estimates from these two methods in the population of tires. 


10.81 (a) STATE: pdiff=the/diff = the true mean difference (After—Before)( After ~ Before) in 
reasoning scores for all preschool students who take six months of piano lessons. PLAN: One-sample t 
interval for pdiffMditf, Random: Random sample of 34 preschool children. 10%: 34<10%,34 < 10%, of 
all preschool children. Normal/Large Sample: ndiff=34>30"«ait = 34 = 30. Do: x diff=3.618 

Tait = 3.618  qitf=3.055 , ndiff=34 : df= 33, (2.731, 4.505) 


CONCLUDE: We are 90% confident that the interval from 2.731 to 4.505 captures pdiff= 
the true mean difference (After—Before) 

in reasoning scores for all preschool students who 
take six months of piano lessons. (b) No; a randomized experiment is needed to show causation. 
10.83 The dotplot of the distribution of difference (Outbound—Return) 

in flight times is skewed right with a possible 

high outlier, violating the Normal/Large Sample condition. 
10.85 STATE: HO : pdiff=0, Ha : pdiff#0 
where ,diff=the true mean difference (Inside — Drive-thru) in time it would take to 


receive an iced coffee at this Dunkin’ Donuts restaurant after placing the order, using a=0.05 


. PLAN: Paired t test for pdiff 


Random: The times were randomly selected. Normal/Large Sample: The dotplot does not show any 
strong skewness or outliers. 
DO: x diff=3.9 seconds, sdiff=5.646 , and ndiff=10 
> t=2.18 ; df=9 ; P{ value=0.0568 
. CONCLUDE: Because the P-value of 0.0568>a=0.05 
0.0568 > a = 0.05, we fail to reject HO/0. We do not have convincing evidence that the true mean 


Inside — Drive-thru) 


difference (Inside-Drive-thru)| is not 0. 


10.87 (a) Yes! The data arose from a randomized comparative experiment, so if the result of this study is 
Statistically significant, we can conclude that the difference in the ability to memorize words was caused 
by whether students were performing the task in silence or with music playing. (b) STATE: 

HO: y:diff=0, Ha: udittzoZo > Maite = 9, Ha : Maite 9 where udiff=Mdiff — the true mean 


difference (Music-Silence)\Music — Silence) 


in the average number of words recalled by students at 
this school. a=0.01@ = 0.01, PLAN: Paired t test for diff . Random: The treatments were 
assigned in a random order. Normal/Large Sample: ndiff= 30 = 30. DO: x diff= 
1.57, sdiff= 2.70, and ndiff= 30; t=3.18 
; df = 29; P-value = 0.0034. CONCLUDE: Because the P-value of 0.0034 < a=0.01 
, we reject HO . We have convincing evidence that the true mean 

difference (Music-Silence) in the average number 
of words recalled by students at this school is different from 0. (c) Because we rejected HO 
, we may have made a Type I error. 
10.89 (a) STATE: pdiff= the true mean difference (Music—Silence) 


in the average number of words recalled by 
students at this school. PLAN: One-sample t interval for diff . Random: The treatments were 
assigned in a random order. Normal/Large Sample: ndiff=30>30 
.DO: x diff=1.57 
, sdiff=2.70 , ndiff=30 ; df= 29 
, (0.211, 2.929) . CONCLUDE: We are 
99% confident that the interval from 0.211 to 2.929 captures the true mean difference (Music— Silence) 
in the number of words recalled by students at this 
school. (b) The confidence interval gives the values of diff that are plausible based on the 


sample data. The two-sided test only allows us to reject (or fail to reject) a difference of 0, where a 
confidence interval provides a set of plausible values for the true mean difference. 

10.91 (a) Two-sample t test; the data are being produced using two distinct groups of cars in a 
randomized experiment. (b) Paired t test; this is a matched pairs experimental design where both 
treatments are applied to each subject in a random order. (c) Paired t test; the data were collected from the 
male and female partners in 40 couples. 


10.93 (a) The table shows two values for each individual (each student). (b) Most of the differences 


(Baseball—Softball) are positive, meaning the 
students tend to throw the baseball farther than the softball. (c) HO: pdiff=0, Ha: pdiff>0 


where diff=the true mean difference (Baseball—Softball) 
in distance thrown. (d) ndiff=24<30 
and the boxplot of the differences has an outlier. (e) A 

mean difference of 6.54 or larger happened in 0 of the 100 simulation trials, so the estimated P-value is 0. 
Because the P-value<a=0.05 , we reject HO 

. These data provide convincing evidence that students like these can throw the baseball 
farther, on average, than the softball. 
10.95 a 
10.97 e 


10.99 (a) One-sample z interval for a proportion. (b) Paired t test for the mean difference. (c) Two-sample 
z interval for the difference in proportions. (d) Two-sample t test for a difference in means. 


10.101 (a) Paired t test; we have paired data (two scores for each student). (b) STATE: HO : pdiff=0 


, Ha: pdiff>0 
, where pdiff=the true mean 
increase in SAT verbal scores of students who were coached, using a=0.05 PLAN: 
Paired t test for pdiff . Random: Random sample of 427 students. 10%: ndiff=427 


is less than 10% of students who are coached. Normal/Large Sample: ndiff=427>30 
naitt = 427 > 30. Ho: t=10.16t = 10.16; df=426; df = 426; p_valuewoP-value ~ 0, CONCLUDE: 
Because the P-value of approximately 0<a=0.050 < a = 0.05, we reject Hoo. There is convincing 
evidence that students who are coached increase their scores on the SAT verbal test, on average. 


Answers to Chapter 10 Review Exercises 
R10.1 (a) Approximately Normal because 80, 20, 42, and 28 are >102 10, (b) 

7 _anofs.._¢. = 0.80 — 0.60 = 0.20 100 < 10% 
uupAN-pAK=0.80-0.60=0.20" Px PK (c) Because 100<10%, ~ 40/0; of all 


registered cars in Nathan’s state and 70<10% of all registered cars in Kyle’s state, 
op\N-pAK=0.8(0.2)100+0.6(0.4)70=0.071 


. The difference (Nathan—Kyle) in the sample 
proportions of registered cars that are made by American manufacturers typically varies by about 0.071 
from the true difference in proportions of 0.20. 


R10.2 STATE: pT=true proportion of teens who use Facebook 
and pA=true proportion of adults who use Facebook. PLAN: Two-sample z 
interval for p[—pA . Random: Independent random samples. 10%: 


nT=1060<10%, of all U.S. teens and 


nA=2003<10% , of all U.S. adults. Large 
Counts: 753, 307, 1162, and 841 are =10 . DO:(0.084, 0.176) 
. CONCLUDE: We are 95% confident 

that the interval from 0.084 to 0.176 captures pT—pA=the 
difference in the true proportions of all U.S. teens and all U.S. adults who use Facebook. 
R10.3 (a) STATE: HO: p1-—p2=0 , 
Ha : p1l—p2<0 , where p1=the 

true proportion of patients like these who take AZT and develop AIDS and p2=the 


true proportion of patients like these who take placebo and develop AIDS, using 


a=0.05 . PLAN: Two-sample z test for pl—p2 . Random: The 
subjects were assigned at random to take AZT or a placebo. Large Counts: 17, 418, 38, and 397 are >10 
. DO:z=-2.93 ; P-value=0.0017 


.CONCLUDE: Because the P-value of 0.0017<a=0.05 


, we reject HO . We have convincing 
evidence that taking AZT lowers the proportion of patients like these who develop AIDS compared to a 
placebo. (b) Type I: Finding convincing evidence that AZT lowers the risk of developing AIDS, when in 
reality it does not. Consequence: Patients will pay for a drug that doesn’t help. Type I: Not finding 
convincing evidence that AZT lowers the risk of developing AIDS, when in reality it does. Consequence: 
Patients won’t take the drug when it could delay the onset of AIDS. 
R10.4 Shape: Because n1=49>30"1 = 49 = 30 and n2=49>30"2 = 49 = 30 the shape of the 
distribution of x 1-x 21 — ©2 is approximately Normal. Center: px 1-x 2=15-15=0 
Ha-ay  -Is = cm. Variability: Because 49<10%49 < 10% of all candles made by Machine 1 


and 49<10% of all candles made by Machine 2, 
ox 1-x 2=0.15249+0.10249=0.026 


cm. 
R10.5 (a) STATE: p11=the true mean NAEP quantitative skills test score for young 
men and p.2=the true mean NAEP quantitative skills test score for young women. 
PLAN: Two-sample ¢ interval for p1—p2 . Random: It is reasonable to 


consider the two samples independent because knowing the response of a male shouldn’t help us predict 
the response of a female. 10%: n1=840<10%, 


of all young men and n2=1077<10% 


of all young women. Normal/Large Sample: 


n1=840230 and n2=1077230 
. DO: x 1=272.40 
, $1=59.2 , n1=840, x 2=274.73, s2=57.5 


, n2=1077 


nz = 1077 Using df=1777.52df = 1777.52, (6.76, 2.10)\ 6-76, 2.10). using df=839, (-6.76, 2.10) 
df = 839, (—6.76, 2.10) -.CONCLUDE: We are 90% confident that the interval from —6.76 to 2.10 
captures 1.1-p12=the/#1 ~ #2 = the true difference in the mean NAEP quantitative skills test score for 
young men and the mean NAEP quantitative skills test score for young women. (b) Because 0 is in the 
interval, it is plausible that the true difference is 0. That is, we do not have convincing evidence of a 
difference in mean score for male and female young adults. 


R10.6 (a) The difference data come from two groups in a randomized experiment. (b) The distribution of 
differences for the control group is slightly skewed to the right while the distribution of differences for the 
treatment group is roughly symmetric. Neither distribution has any clear outliers. The center for the 
treatment group is greater than the center for the control group. The differences in the control group are 
more variable (based on the JQR) than the differences in the treatment group. Overall, it appears that 
students in the treatment group had bigger improvements, on average. (c) STATE: HO: p1—-p2=0 

Ho: a ~ He = 0 Ha: p1-p2>o!a * Hi ~ He > O where 1=the/1 = the true mean difference in test 
scores for students like these who get the treatment message and p.2=the true mean 
difference in test scores for students like these who get the neutral message, using a=0.05 


. PLAN: Two-sample t test for p1—-p2 . Random: The 
students were assigned at random to the treatment and control groups. Normal/Large Sample: The 
boxplots show no strong skewness and no outliers. DO: x 1=11.4, s1=3.169, n1=10,x 2=8.25, s2=3.69 


, n2=8 ; t=1.91 . Using df=13.919 , P-value=0.0382 
; using df=7 , P-value=0.0489 
. CONCLUDE: Because the P-value of 0.0382<a=0.05 
, we reject HO . There is convincing 

evidence that the true mean difference in test scores for students like these who get the treatment message 
is greater than the true mean difference in test scores for students like these who get the neutral message. 
(d) We cannot generalize to all students who failed the test because our sample was not a random sample 
of all students who failed the test. It was a group of students who agreed to participate in the experiment. 
R10.7 (a) 


(b) x diff=0.131 ; sdiff=0.119 . For the 8 sprinters 
in the sample, the 50-meter dash time averages 0.131 second longer with the standing start than with the 
blocks. This gives some evidence that the 50-meter dash takes longer with a standing start, on average, 


than with the blocks. (c) STATE: HO : pdiff=0 , Ha: pdiff>0 
, where udiff= the true mean difference 
(Standing — Blocks) in 50-meter run time, using a=0.05 . PLAN: Paired t test for diff 
. Random: The order was randomized. Normal/Large Sample: The dotplot shows no strong 
skewness and no outliers. DO: x diff=0.131 , sdiff=0.119 
, and ndiff=8 5 t=3.11 . Using df=7df = 7, P- 


value=0.0085P-value = 0.0085, CONCLUDE: Because the P-value of 0.0085<a=0.05 
0.0085 < a = 0.05, we reject HO/0. We have convincing evidence that the true mean difference 
(Standing—Blocks)( Standing ~ Blocks) jn 50-meter run time for sprinters like these is greater than 0. 
(d) STATE: diff=the true/aitt = the true mean difference (Standing—Blocks) 
in 50-meter run time. PLAN: One-sample t 
interval for diff . Conditions were checked in part (c). DO: (0.0513, 0.2112) 
. CONCLUDE: We are 90% confident that the 


interval from 0.051 to 0.211 captures pdiff=the true mean difference 


(Standing—Blocks) in 50-meter run time for 


sprinters like these. The confidence interval gives the values of diff that are plausible based 
on the sample data. The test allows us to reject (or fail to reject) a difference of 0, where a confidence 


interval provides a set of plausible values for the true mean difference. 


Answers to Chapter 10 AP® Practice test 
T10.1e 
T10.2 b 
T10.3 a 
T10.4a 
T10.5 e 
T10.6 e 
T10.7 ¢ 
T10.8 c 
Wig GS 
T10.10 a 


T10.11 (a) STATE: 95% CI for p1-p2 , where p1=the 
true mean hospital stay for patients like these who get heating blankets during 


surgery and ,12=the true true mean hospital stay for patients like these whose 
temperatures were reduced during surgery. PLAN: Two-sample t interval for p1-p2 


. Random: Two groups in a randomized experiment. Normal/Large 


Sample: n1=104>30 and n2=96>30 
.DO: x 1=12.1 ; 

51=4.4, n1=104 ,X 2=14.7 

, s2=6.5, n2=96 . Using 
df=165.12 , (-4.16, -1.04) 

; using df=95 : 

(-4.17, -1.03) CONCLUDE: We 
are 95% confident that the interval from —4.16 to —1.04 captures 
i1—p2=the true true difference in mean length of hospital 


stay for patients like these who get heating blankets during surgery and those who have their core 
temperatures reduced during surgery. (b) Yes; because 0 is not in the interval, we have convincing 
evidence that the true mean hospital stay for patients like these who get heating blankets during surgery is 
different from the true mean hospital stay for patients like these who have core temperatures reduced 
during surgery. (c) If we were to repeat this experiment many times and calculate 95% confidence 
intervals for the difference in mean length of hospital stay each time, about 95% of the intervals would 
capture the true difference in mean hospital stay for patients like these who get heating blankets during 
surgery and mean hospital stay for patients like these who have core temperatures reduced during surgery. 


T10.12 (a) STATE: HO: p1—p2=0 , 
Ha : p1-p2>0 , where p1=the?1 = the trye 
proportion of cars that have the brake defect in last year’s model and p2=theP?2 ~ the tre proportion of 
cars with the defect in this year’s model, using a=0.05@ = 0.05, PLAN: Two-sample z test for p1—p2 
P1 ~ P2, Random: Independent random samples. 10%: n1=100<10%"1 = 100 < 10% of last year’s 
model and n2=350<10%"2 = 390 < 10% of this year’s model. Large Counts: 20, 80, 50, and 300 are 
>102 10. Do: DO:z=1.39D0:z = 1.39; P-value=0.0822P-value = 0.0822 CONCLUDE: Because the 
P-value of 0.0822>a=0.05 , we fail to reject HO 
. We do not have convincing evidence that the true proportion of brake defects is smaller 
in this year’s model compared to last year’s model. (b) Because we failed to reject the null hypothesis, we 
may have made a Type II error. The automaker may take other action to improve the production process, 
even though it is working better, which could be costly. 
T10.13 (a) Students may improve from Monday to Wednesday just because they have already done the 
task once. Then we wouldn’t know if the experience with the test or the caffeine is the cause of the 
difference in scores. A better way to run the experiment would be to randomly assign half the students to 
get 1 cup of coffee on Monday and the other half to get no coffee on Monday. Then have each person do 
the opposite treatment on Wednesday. (b) STATE: HO: pdiff=0 
, Ha : pdiff<0 \ 
where ,diff=the true mean difference (No coffee—Coffee) 
in the number of words recalled by students 


like the ones in this study without coffee and with coffee, using a=0.05 . PLAN: Paired t 


test for pdiff . Random: The treatments were assigned in a random order. Normal/Large 
Sample: The dotplot shows no strong skewness and no outliers. 


DO: x diff=—1 , sdiff=0.816 , ndiff=10 

; t=-3.87t =—3.87; df=odf = 9; P-value=0.0019P-value = 0.0019, CONCLUDE: 
Because the P-value of 0.0019<a=0.050-0019 < a = 0.05, we reject HO/40. We have convincing 
evidence that the true mean difference (No coffee—Coffee) 


in word recall for students like the ones in this 
study is less than 0. 


Answers to Cumulative AP® Practice Test 3 
AP3.1e 
AP3.2 b 
AP3.3 d 
AP3.4 ¢ 
AP3.5 d 
AP3.6 d 
AP3.7 c 
AP3.8 a 
AP3.9 d 
AP3.10c¢ 


AP3.11b 
AP3.12 c 
APS,13 ¢ 
AP3.14d 
AP3.15 d 
AP3.16 c 
AP3.17 b 
AP3.18 b 
AP3.19 ¢ 
AP3.20 c 
AP3.21 a 
AP3.22 d 
AP3.23 b 
AP3.24 e 
AP3.25 a 
AP3.26 b 
AP3.27 c 
AP3.28 d 
AP3.29 a 
AP3.30 b 


AP3.31 STATE: HO : pdiff=0 , Ha : pdiff<0 

, where idiff=the true/aitt = the true tye mean change in 
weight (After—Before)\ After ~ Before) jn pounds for people like these who follow a five-week crash 
diet, using a=0.05a = 0.05, PLAN: Paired t test for pdiffHdiff. Random: Random sample of dieters. 
10%: 15<10%15 < 10% of all dieters. Normal/Large Sample: The boxplot shows no strong skewness 
and no outliers. 


-25 -20 -15 -10 -5 0 5 10 15 
Difference (After — Before) in weight (Ib) 


DO: x” diff=-3.67 ditt =—3-6, sqiff=11.538dit = 11-53 ndiff=15 5 t=-1.21 
; df=14 ; P-value=0.1232 
-CONCLUDE: Because the P-value of 0.1232>a=0.05 
, we fail to reject HO . We do not have convincing evidence that the true mean change in 
weight (After—Before) for people like these who 


follow a five-week crash diet is less than 0. 


AP3.32 (a) This is an observational study. No treatments were imposed. (b) STATE: HO: p1—p2=0 
, Ha: p1-p2<0 
, where p1=the true 


proportion of VLBW babies who graduate from high school by age 20 and p2=the 
true proportion of non-VLBW babies who graduate from high school by age 20, using a=0.05 


. PLAN: Two-sample z test for p1—p2 . Random: 
Independent random samples. 10%: n1=242<10% 
of all VLBW babies and n2=233<10% of all non- 
VLBW babies. Large Counts: 179, 63, 193, and 40 are >10 . DO: z=-2.33 
; P-value=0.0095 . CONCLUDE: 
Because the P-value of 0.0095<a=0.05 , we reject HO 


. We have convincing evidence that the true proportion of VLBW babies who graduate 
from high school by age 20 is less than the true proportion of non- VLBW babies who graduate from high 
school by age 20. 

AP3.33 (a) Because there is no leftover curved pattern in the residual plot. The residuals look randomly 
scattered around the residual=0 line. (b) yA=-73.64+5.7188x 
, where y/=predicted distance 
and x=temperature (°C) 
. (c) The predicted distance from the nearest fish to the 


outflow pipe increases by 5.7188 meters for each additional 1-degree-Celsius increase in water discharge 
temperature. (d) y\=—73.64+5.7188(29)=92.21 


meters; the residual=78—-92.21=-14.21 


meters. The actual distance from 
the nearest fish to the outflow pipe on this afternoon was 14.21 meters closer than the distance predicted 


by the regression line with x=29°C 


AP3.34 Define W=theW = the weight of a randomly selected gift box. Then, pW=27HW = 27 and 


oW=2.019W = 2-01, (b) The distribution of W will also be Normal. We want to find P(W>30) 
P(W > 30) 


30—27 


(i) =30-272.01=1.497 ~ 201 ~ 149. pzs1.49)=0.0681P (2 > 1-49) = 0.0681 (i) 

P(W>30)=normalcdf (lower: 30, upper: 1000, mean: 27 
, SD: 2.01)=0.0678. There is a 

0.0678 probability of randomly selecting a box that weighs more than 30 ounces. (c) P (at least one 


box is greater than 30 ounces) 1—(1—0.0678)5=0.2960 


; there is a 0.2960 probability of selecting a random sample of 5 boxes and having at least one box weigh 


more than 30 ounces. (d) The distribution of W will also 


be Normal, with pW =p)W=27 
and oW =2.015= 0.899 


want 

to find P(W >30) 

(i) z2=30-270.899=3.34 ; 
P(z>3.34)=0.0004 

(ii) P(W >30)=normalcdf (lower: 30, upper: 1000, 


mean: 27, SD: 0.899)=0.0004. 


There is a 0.0004 probability of randomly 
selecting 5 boxes that have a mean weight of more than 30 ounces. 


AP3.35 (a) Assuming that the true difference in the mean annualized daily return for Stock A and Stock B 
is 0, there is a 0.042 probability that we would observe a difference in sample means of 4.7 or greater by 
chance alone. CONCLUDE: Because the P-value of 0.042<a=0.05 


, We reject Hof0. We have convincing evidence that the 
true mean annualized return for stock A is different from the true mean annualized return for stock B. (b) 


2 a ee oe 
Ho : odA2-0B2=0, Ha: oA2-0B2>0He oe laa: oe 0, Ha : VA “Sa 0 where oA2=the true 
Co 
a the true variance of returns for stock A and oB2=the?B — the true variance of returns for stock 
(12.9)? 


= .806 
B. (c) F=(12.9)2(9.6)2=1.806 (9.6)" . Values of F that are greater than 1 indicate 


that the price volatility for stock A is higher than that for stock B. The statistic provides some evidence for 
the alternative hypothesis because 1.806>11.806 > 1. (d) A test statistic of 1.806 or greater occurred in 


only 6 out of the 200 trials. Thus, the approximate P-value is 6/200=0.030/ 200 = 9.03 Because 


0.03<a=0.05 , we reject HO . There is 
convincing evidence that the true variance of returns for stock A is greater than the true variance of 
returns for stock B. 


Chapter 11 
Section 11.1 


Answers to Check Your Understanding 


page 714: 1. HO : The company’s claimed color distribution for its Peanut M&M’S® is 
correct. Ha : The company’s claimed color distribution is not correct. 

2. There were 65 candies in the sample from the bag. The expected count of blue, orange, green, and 
yellow candies is 65(0.20)=13 , and the expected 
count of red and brown is 65(0.10)=6.5 

3. x2=(14-13)213+(9-13)2 13+ ---=2.3847 


page 717: 1. The expected counts (13, 13, 13, 13, 6.5, 6.5) are all at least 5. We should use df=6-1=5 


2. Table C: The P-value is greater than 0.25; Tech: P-value=y2 
cdf(lower: 2.3847, upper: 10000, df: 5) =0.7938 


3. Because the P-value of 0.7938>0a=0.05 , we fail to 


reject HO . There is not convincing evidence that the color distribution of M&M’S Peanut 
Chocolate Candies is different from what the company claims. 


page 721: 1. STATE: HO : The distribution of car colors in Oro Valley is the same as the 
distribution of car colors across North America. 


Ha : The distribution of car colors in Oro Valley is not the same as the distribution of car 
colors across North America. We’ll use a=0.050 = 0.05, PLAN: Chi-square test for goodness of fit. 
Random: The data come from a random sample of 300 cars in Oro Valley. 10%: n=3007 = 300 is less 
than 10% of all cars in Oro Valley. Large Counts: All expected counts (69, 54, 48, 45, 30, 27, 6, 21) are at 
least 5. 


2 


>» (84-69) (38—54)* 
: Cnn, a = x" = = +--+ = 29,921 
DO: Test statistic: y2=(84—69)269+(38-54)254+---=29.921 69 54 
P-value: df=8-1=74f = 8-1 = 7. Table C: P-value<0.0005P-value < 0.0005; Tech: yox” cdf(lower: 
29.921, upper: 10000, df: 7) ~0 CONCLUDE: Because the P-value of approximately 
0<a=0.05 , we reject HO . We have 


convincing evidence that the distribution of car colors in Oro Valley is not the same as the distribution of 
car colors across North America. 


2. The table of chi-square contributions is shown below. 


Color Observed Expected O-E (O - E)@/E 
White 84 69 15 3.2609 
Black 38 54 —16 4.7407 
Gray 31 48 —17 6.0208 
Silver 46 45 1 0.0222 
Red 27 30 —3 0.3 

Blue 29 27 2 0.1481 
Green 6 6 0 0 

Other 39 21 18 15.429 


The two biggest contributions to the chi-square statistic came from gray and other colored cars. There 
were fewer gray cars than expected and more other-colored cars than expected. As for Cass’s question: It 
does seem that drivers in Oro Valley prefer lighter- colored cars as there were more white cars than 
expected and fewer black cars than expected, so it seems Cass might be onto something! 


Answers to Odd-Numbered Section 11.1 Exercises 


11.1 (a) HO : The company’s claimed distribution for its deluxe mixed nuts is correct. Ha 
: The company’s claimed distribution is not correct. (b) Cashews: 150(0.52)=78 
, Almonds: 150(0.27)=40.5 
, Macadamia nuts: 150(0.13)=19.5 
, and Brazil nuts: 150(0.08)=12 


(c) y2=(83-78)278+(29-40.5)240.5+---=6.599 


11.3 (a) The P-value (0.061) is between 0.05 and 0.10. (b) The P-value (0.0003) is less than 0.0005. 
11.5 (a) All expected counts (see Exercise 11.1) are >5; df=3 


(b) The P-value (0.0858) is between 0.05 and 0.10. (c) Because the P-value of 0.0858>a=0.05 


, we fail to reject HO . We do not have 
convincing evidence that the company’s claimed distribution for its deluxe mixed nuts is not correct. 


11.7 Time spent doing homework is quantitative. Chi-square tests for goodness of fit should only be used 
for distributions of categorical data. 
11.9 STATE: HO : Froot Loops contain an equal proportion of each color. Ha 


Froot Loops do not contain an equal proportion of each color. a=0.05 PLAN: Chi- 
square test for goodness of fit. Random: Random sample of Froot Loops. 10%: n=120<10% 


of all Froot Loops. Large Counts: All expected counts = 


120(1/6) = 20 = 5. DO: y2=7.9 ; df=5 . The P-value (0.1618) is 
between 0.15 and 0.20. CONCLUDE: Because the P-value of 0.1618>a=0.05 


, we fail to reject HO . We do not have 
convincing evidence that Froot Loops do not contain an equal proportion of each color. 
11.11 STATE: HO : The distribution of eye color and wing shape is the same as what the 
biologists predict. Ha : The distribution of eye color and wing shape is not the same as what 
the biologists predict. 
a=0.05 PLAN: Chi-square test for goodness of fit. The conditions are met. DO: 
X2=6.187 ; df=3 . The P-value (0.1029) is between 0.10 and 0.15. 
CONCLUDE: Because the P-value of 0.1029>a=0.05 P 


we fail to reject HO . We do not have convincing evidence that the distribution of eye color 
and wing shape is different from what the biologists predict. 

11.13 (a) STATE: Ho/40: Nuthatches do not prefer particular types of trees when searching for seeds and 
insects. Ha/4a: Nuthatches do prefer particular types of trees. a=0.05@ = 0.05 PLAN: Chi-square test for 
goodness of fit. Random: Random sample of 156 red-breasted nuthatches. 10%: n=156<10%, 


n = 156 < 10, of all nuthatches. Large Counts: 84.24, 62.4, and 9.36 are all > 5. DO: y2=7.418 
= 7.418. df=2df = 2. The P-value (0.0245) is between 0.02 and 0.025. CONCLUDE: Because the P- 
value of 0.0245<a=0.050.0245 < a = 0.05, we reject HO/40. There is convincing evidence that 
nuthatches prefer particular types of trees when they are searching for seeds and insects. (b) The 
breakdown of the chi-square statistic is y2=2.407+4.416+0.595 

. The largest contributors are the 
Douglas firs and the ponderosa pines. There are fewer red-breasted nuthatches observed in Douglas firs 
(70-84.24=-14.24) and more red- 
breasted nuthatches observed in ponderosa pines (79-62.4=16.6) 


than we would expect, so the nuthatches 
seem to prefer the ponderosa pines the most and the Douglas firs the least. 


11.15 (a) STATE: HO : Mendel’s 3:1 genetic model is correct. Ha : Mendel’s 
3:1 genetic model is not correct. a=0.05 PLAN: Chi-square test for goodness of fit. The 
conditions are met. DO: y2=0.3453 ; df 1 . The P-value (0.5568) 


is greater than 0.25. CONCLUDE: Because the P-value of 0.5568>0=0.05 


, we fail to reject HO . We do not have 
convincing evidence that Mendel’s 3:1 genetic model is not correct. (b) HO: p=0.75 


, Ha: p40.75 where p= the 
true proportion of peas that will be smooth. a=0.05 ; Z=0.5876 ; P- 
value=0.5568 . CONCLUDE: Because the P-value of 
0.5568>0=0.05 , we fail to reject HO 


We do not have convincing evidence that the true proportion of peas that are smooth is different than 
0.75. In both cases, we fail to reject the null hypothesis. The P-value for each test has the same value, and 


Z2=x2 I 
11.17 (a) HO : The true distribution of flavors for Skittles candies is the same as the 


company’s claim. Ha : The true distribution of flavors for Skittles candies is not the same as 
the company’s claim. 


(b) Each expected count=60(0.2)=12>5 


(c) Using df=4 and Table C, the value for a=0.05@ = 0.05 js 9.49 and for a=0.01 

a = 0.01 is 13.28. So yox” Statistics greater than 9.49 would provide significant evidence at the a=0.05 

a = 0.05 level and yox” values greater than 13.28 would provide significant evidence at the a=0.01 
level. (d) One possibility is 6 lemon, 6 green apple, 16 orange, 16 strawberry, and 16 

grape. x2=10 , which is between 9.49 and 13.28. 

11,79 

M21-c 

11.23 (a) 


(b) There is an association between age group and playing video games for the subjects in the study. As 
age increases, the proportion of adults that play video games decreases. 

11.25 (a) The conditions are met. Random: It is reasonable to consider the two samples independent 
because knowing the response of a heavy reader shouldn’t help us predict the response of a light reader. 


10%: n1=47<10% of heavy readers and 
n2=32<10% of light readers at this large school. 
Normal/Large Sample: n1=47230 and 
n2=32>30 . (b) STATE: 95% CI for p1-p2 
where p1= the true mean English grade of heavy 
readers and p2= the true mean English grade of light readers. PLAN: Two-sample t 
interval for p1-p2 . DO: df=31 ; Table B and df=30 


, (0.1163, 0.4517); Tech: (0.1197, 0.4483) with df=59.46 


CONCLUDE: We are 95% confident that the interval from 0.1197 to 0.4483 captures the true difference 
in the mean English grade of heavy and light readers. (c) No; even though 0 is not in the confidence 
interval, this was an observational study so no conclusion of cause and effect can be made. 


Section 11.2 
Answers to Check Your Understanding 


page 732: 1. HO : There is no difference in the true distributions of superpower preference 


for all survey takers from the United Kingdom and from the United States. Ha : There is a 
difference in the true distributions of superpower preference. 


2. The expected count for the U.K./Fly cell is: (99)(200)415=47.711 


. The rest of the expected counts are shown in the table: 


Country 
U.K. US: Total 
Superpower preference Fly 99 
Freeze time 96 


Invisibility 32.289 34.711 67 


Super strength 20.723 22.211 43 
Telepathy 53.012 56.988 110 
Total 200 215 415 


3. y2=(54-47.711)247.7114(45-51.289)251.289+:--=6.29 


page 737: 1. Random: Independent random samples of 200 survey takers from the U.K. and 215 survey 
takers from the U.S. 10%: n=200 is less than 10% of all U.K. survey takers and n=215 

is less than 10% of all U.S. survey takers. Large Counts: All expected counts (47.7, 51.3, 
46.3, 49.7, 32.3, 34.7, 20.7, 22.3, 53.0, 57.0) are at least 5. 


2. The test statistic is y2=6.29 ; P-value: df=(5-1)(2-1)=4 


Table C: The P-value is between 0.15 and 0.20. 
Tech: y2 cdf(lower: 6.29, upper: 10000, df: 4) =0.1785 


3. Assuming that the distribution of superpower preference is the same for all U.K. and U.S. survey 
takers, there is a 0.1785 probability of observing differences in the distributions as large as or larger than 


the ones in this study by chance alone. 


4. Because the P-value of 0.1785>a=0.05 , we fail to 
reject HO . We do not have convincing evidence that there is a difference in the true 
distributions of superpower preference for all survey takers from the United Kingdom and the United 
States. 

page 740: STATE: HO : There is no difference in the distribution of quality of life for 
patients who have suffered a heart attack in Canada and the U.S. Ha : There is a difference in 
the distribution of quality of life. We will use a=0.01 . PLAN: Chi-square test for 


homogeneity. Random: The data came from independent random samples from Canada and the United 


States. 10%: n1=311 is less than 10% of all Canadian heart attack patients and 


n2=2165 is less than 10% of all U.S. heart attack patients. Large Counts: All 
expected counts (77.37, 538.63, 71.47, 497.53, 109.91, 765.09, 41.70, 290.30, 10.55, 73.45) are at least 5. 
DO:\2=(75-77.37)277.37+:+:+(65-73.45)273.45=11.725 


P-value: df=(5—1)(2-1)=4 


; Table C: The P-value is between 0.01 and 0.02. Tech: y2 cdf(lower: 11.725, upper: 10000, 
df: 4) =0.0195 CONCLUDE: Because the P-value of 0.0195>a=0.01 
, we fail to reject HO . There is not 


convincing evidence that there is a difference in the distribution of quality of life for heart attack patients 
in Canada and the United States. 

page 746: STATE: HO : There is no association between gender and perceived body image 
in the population of U.S. college students. Ha : There is an association between gender and 


perceived body image in the population of U.S. college students. We will use a=0.01 
PLAN: Chi-square test for independence. Random: Random sample of 1200 U.S. college students. 10%: 


n=1200 is less than 10% of the population of all college students. Large Counts: All 
expected counts (541.50, 313.50, 148.83, 86.17, 69.67, 40.33) are at least 5. DO:y2=(560-541.5)2541.5+ 
BOit = (560—541.5)" ,_ (295 313.50)? |... = 47.176 


(295-313.50)2313.50+:--=47.176 541.5 313.50 


P-value: df=(3- 1)(2-1)=244 =(9-—1) 3 = 1) = 2. Table C: The P-value is less than 0.0005. Tech: y2 
Xx” cdf(lower: 47.176, upper: 10000, df: 2) x0 0 
CONCLUDE: Because the P-value of approximately 0<a=0.010 < a@ = 0.01, we reject HO/0. There is 


convincing evidence that there is an association between gender and perceived body image in the 
population of U.S. college students. 


Answers to Odd-Numbered Section 11.2 Exercises 

11.27 Those who completed the red survey were more likely to choose a chocolate candy in a red 
wrapper, those who completed the blue survey were more likely to choose a chocolate candy in a blue 
wrapper, and those who completed the control survey had a slight preference for chocolate candy in a blue 


wrapper. 
11.29 (a) HO : The distribution of candy choice is the same for subjects like these who 
receive the red survey, the blue survey, and the control survey. Ha : The distribution of candy 


choice is not the same. 

(b) Expected count = (20)(26)/60 = 8.67. Remaining expected counts are 8.67, 8.67, 11.33, 11.33, 11.33. 
(c) x2=6.65 

11.31 (a) Random: Treatments were randomly assigned. 10%: Not needed because the volunteers were 
not randomly selected from some population. Large Counts: All expected counts 8.67, 8.67, 8.67, 11.33, 


11.33, and 11.33 are > 5. (b) y2=6.65 . Using df=(2-1)(3-1)=2 


, the P-value (0.0359) is between 0.025 and 0.05. (c) Assuming the distribution of candy choice is the 
same for subjects like these who receive the red survey, the blue survey, and the control survey, there is a 
0.0359 probability of observing differences in the distributions as large as or larger than the ones in this 
study by chance alone. (d) Because the P-value of 0.0359>a=0.01 


, we fail to reject HO . We do not have 
convincing evidence that the distribution of candy choice is different for subjects like these who receive 
the red survey, the blue survey, and the control survey 


11.33 Because we do not have the actual counts of the travelers in each category. We also do not know if 
the sample was taken randomly or if the samples are independent. 


11.35 STATE: HO : There is no difference in the distribution of color for name-brand and 
store-brand gummy bears. Ha : There is a difference in the distribution of color. a=0.05 

PLAN: Chi-square test for homogeneity. Random: Independent random samples. 10%: 
n1=373 is less than 10% of all name-brand gummy bears and n2=622 


is less than 10% of store-brand gummy bears. Large Counts: All expected counts 
130.83, 218.17, 58.86, 98.14, 50.61, 84.39, 77.97, 130.03, 54.73, and 91.27 are > 5. DO: y2=1.81 


; Using df=(5-1)(2-1)=4 


, the P-value (0.7698) is greater than 0.25. CONCLUDE: Because the P-value of 0.7698>a=0.05 


, we fail to reject HO . There is not 
convincing evidence of a difference in the distribution of color for name-brand and store-brand gummy 
bears. 

11.37 (a) 


Treatment 


Outcome Success 
Failure 
Total 


(b) STATE: HO : The true proportions of smokers like these who quit for a year are the 
same for the four treatments. Ha : The true proportions of smokers like these are not the same 


for the four treatments. a=0.05 PLAN: Chi-square test for homogeneity. Random: 4 
groups in a randomized experiment. Large Counts: All expected counts 61.75, 61.75, 62, 40.49, 182.25, 


182.25, 183, and 119.51 are > 5. DO:y2=34.937 ; Using df=(2-1) 
(4-1)=3 


, the P-value<0.0005(#0) 


CONCLUDE: Because the P-value of approximately 0<a=0.05 

, we reject HO . There is convincing evidence 
that the true proportions of smokers like these who are able to quit for a year are not the same for the four 
treatments. 
11.39 The chi-square test statistic is calculated: y2=7.662+ 
2.596 + 2.430 + 0.823 + 10.076 + 3.414 + 5.928 + 2.008 = 34.937. The largest component comes from 


those who had success using both the patch and the drug. Far more people fell in this group than would 


have been expected (87—62=25) , making it the 
most effective treatment. The next largest component comes from those who had success using just the 
patch. Far fewer people were in this group than would have been expected (40-61.75=—21.75) 


, making it the least effective 
treatment. 
11.41 (a) STATE: HO : There is no association between weekly sauna frequency and 
suffering from sudden cardiac death in the population of middle-aged men from eastern Finland. Ha 


: There is an association between weekly sauna frequency and sudden cardiac death in this 
population. (b) The expected counts are 49.326, 124.177, 16.497, 551.674, 1388.823, 184.503. (c) 


¥2=6.032 _ Using df=(2-1)(3-1)=2 


, the P-value (0.0490) is between 0.025 and 0.05. (d) CONCLUDE: Because the P-value of 


0.0490<a=0.05 , we reject HO . There is 
convincing evidence that there is an association between weekly sauna frequency and suffering from 
sudden cardiac death in the population of middle-aged men from eastern Finland. 


11.43 STATE: HO : There is no association between gender and relative finger length in the 
population of U.S. high school students who completed the survey. Ha : There is an 
association between gender and relative finger length in this population. a=0.10 PLAN: 


Chi-square test for independence. Random: Random sample of U.S. high school students. 10%: n=460 


is <10, of the population of all U.S. high school students who 
completed the survey. Large Counts: All expected counts 77.97, 80.03, 42.44, 43.56, 106.59, and 109.41 
are > 5. DO: The calculator’s y2-Test gives y2=2.065 
and P-value=0.3561 using df=2 . CONCLUDE: 


Because the P-value of 0.3561>a=0.100.3561 > a = 0.10, we fail to reject Ho/Z0. There is not 
convincing evidence of an association between gender and relative finger length in the population of U.S. 
high school students who completed the survey. 

11.45 STATE: HO/#0: There is no association between age and opinion about loan-free tuition in the 
population of U.S. adults. Ha/a: There is an association between age and opinion in this population. 


a=0.05a = 0.05 
PLAN: Chi-square test for independence. Random: Random sample of U.S. adults. 10%: n=1509<10% 


n = 1509 < 10% of all US. adults. Large Counts: All expected counts (68.07, 140.68, 285.33, 361.91, 
44.93, 92.86, 188.33, 238.88, 7.00, 14.46, 29.33, 37.21) are > 5. DO: The calculator’s y2-Test 


gives x2=39.755 and P-value~0 
using df=6 . CONCLUDE: Because the P-value 
of approximately 0<a=0.05 , we reject HO 


We have convincing evidence that there is an association between age and opinion about loan-free tuition 
in the population of U.S. adults. 


11.47 (a) Chi-square test for homogeneity; the data came from two independent random samples. (b) Chi- 
square test for independence; the data came from a single random sample (n=1480 US. 
adults), with the individuals classified according to two categorical variables (age and whether or not they 
are vegan/vegetarian). 
11.49 (a) Chi-square test for independence; the data came from a single random sample (n=4854 

young adults aged 19 to 25 years), with the individuals classified according to two 
categorical variables (gender and “Where do you live now?”). (b) HO : There is no 


association between gender and where people live in the population of young adults. Ha : 
There is an association between gender and where people live in this population. (c) Random: Random 


sample of young adults. 10%: n=4854<10% of all young 
adults. Large Counts: The expected counts in the Minitab output are all > 5. (d) Interpretation: Assuming 
there is no association between gender and where people live in the population of young adults, there is a 
0.012 probability of getting a random sample of 4854 young adults with an association as strong as or 
stronger than the one found in this study by chance alone. Conclusion: Because the P-value of 
0.012<a=0.05 , we reject HO . There is 
convincing evidence that there is an association between gender and where people live in the population 
of young adults. 

11.51 y2=3.147+3.309+0.045+0.048+2.181+2.294+ 


0.007+0.007=11.038 ; the largest component comes from males who 
currently live in their parents’ homes. There were more males living in their parents’ homes than would 
have been expected (986-930.51=55.49) 

. The next largest component comes from females who currently live in their parents’ homes. There were 
fewer females living in their parents’ homes than would have been expected (923-978.49=-55.49) 


11.53 (a) HO : There is no difference in the improvement rates for patients like these who 


receive gastric freezing and those who receive the placebo. Ha : There is a difference in the 
improvement rates for patients like these. (b) Assuming there is no difference in the improvement rates 
between patients like these who receive gastric freezing and those who receive the placebo, there is a 
0.570 probability of observing a difference in improvement rates as large as or larger than the difference 
observed in the study by chance alone. CONCLUDE: Because the P-value of 0.570>a=0.05 


, we fail to reject HO . There is not 


convincing evidence of a difference in the improvement rates for patients like these who receive gastric 
freezing and those who receive the placebo. (c) The P-value for this test is identical to the P-value for the 
test in part (a). Also, z2=(—0.57)2=0.3249*y2=0.322 


11.55 e 

Sy a 

11,59 '¢ 

11.61 (a) One-sample t interval for p __ (b) Two-sample z test for the difference between two 
proportions 


11.63 (a) This was an experiment because a treatment (type of rating scale) was deliberately imposed on 
the students who took part in the study. (b) Several of the expected counts are less than 5. 


Answers to Chapter 11 Review Exercises 
R11.1 STATE: HO : The proposed 1:2:1 genetic model is correct. 


Ha : The proposed 1:2:1 genetic model is not correct. a=0.01 
PLAN: Chi-square test for goodness of fit. Random: Random sample of 84 pairs of yellow-green parent 
plants. 10%: n=84<10% of all yellow-green parent plants. 
Large Counts: The expected counts 21, 42, and 21 are all > 5. DO:y2=6.476 
. Using df=3-1=2 

, the P-value (0.0392) is between 0.025 and 0.05. 
CONCLUDE: Because the P-value of 0.0392>a=0.01 ; 
we fail to reject HO . We do not have convincing evidence that the proposed 1:2:1 genetic 
model is not correct. 


R11.2 The result is not valid because several of the expected counts are less than 5. The expected counts 
are 12.5, 0.5, 4.5, 1.5, 5, 12.5, 0.5, 4.5, 1.5, and 5. 


R11.3 (a) Chi-square test for homogeneity. The data came from two independent random samples. (b) 
The value 252 is the expected count for the “2009/None” cell. It is calculated (366)(14,557)/21,177. The 
value 14.113 is the component of the chi-square test statistic that comes from the cell 2009/None. It is 
calculated (192-251.587)2251.587=14.113 

. (c) The 
three cells that contribute most to the chi-square test statistic are: 2015/None, which had more households 
with no TV than expected (174-114=60) : 
2015/One, which had more households with one TV than expected (1680-1494=186) 


; and 2015/Five or more, which had fewer 
households with five or more TVs than expected (392—511=—119) 


R11.4 (a) 


Treatment 


Cardiac event? Yes 


Total 33 34 AO 107 


(b) The success rate was highest for stress management (30/33=0.909) 
, followed by exercise (27/34=0.794) 
and usual care (28/40=0.70) 
.(c) STATE: HO : The true 
success rates for patients like these are the same for all three treatments. Ha : The true success 
rates for patients like these are not the same. a=0.05 


PLAN: Chi-square test for homogeneity. Random: 3 groups in a randomized experiment. Large Counts: 
The expected counts (6.79, 6.99, 8.22, 26.21, 27.01, 31.78) are all => 5. DO:y2=4.84 


. Using df=(2-1)(3-1)=2 


, the P-value (0.0889) is between 0.05 and 0.10. CONCLUDE: Because the P-value of 0.0889>a=0.05 


, we fail to reject HO . We do not have 
convincing evidence that the true success rates for patients like these are not the same for all three 


treatments. 
R11.5 STATE: HO : There is no association between gender and goals for 4th, 5th, and 6th 
grade students in Michigan. Ha : There is an association between gender and goals for these 


students. a=0.05 
PLAN: Chi-square test for independence. Random: Random sample of students. 10%: n=478<10% 


of all 4th, 5th, and 6th grade students in Michigan. Large 
Counts: All of the expected counts 129.7, 117.3, 74.04, 66.96, 47.26, and 42.74 > 5. DO: The calculator’s 


x2-Test gives y2=21.455 and P-value~0.00002 
using df=2 . CONCLUDE: Because the P- 
value of 0.00002<a=0.05 , we reject HO 


There is convincing evidence that there is an association between gender and goals for 4th, 5th, and 6th 
grade students in Michigan. 


Answers to Chapter 11 AP® Practice Test 
T11.1¢ 
T11.2e 
T11.3d 
T11.4c¢c 
T11.5d 


TiGd 

T11.7b 

T11.8a 

T11L3d 

TILI0d 

T11.11 STATE: HO : The distribution of gas types among customers at these service 
Stations is the same as the distributor’s claim. Ha : The distribution at these service stations is 
not the same. a=0.05 

PLAN: Chi-square test for goodness of fit. Random: Random sample of drivers. 10%: n=400<10% 


of all customers at this distributor’s service stations. 
Large Counts: The expected counts 240, 80, and 80 > 5. DO:y2=13.15 


. Using df=3-1=2 
, the P-value (0.0014) is between 0.001 and 
0.0025. CONCLUDE: Because the P-value of 0.0014<a=0.05 
, we reject HO . There is convincing 
evidence that the distribution of gas type among drivers at these service stations is not the same as the 
distributor claims. 
T11.12 (a) Random assignment was used to create three roughly equivalent groups at the beginning of the 


study. (b) HO : The true distributions of subsequent arrest status are the same for all three 


police responses for subjects like these. Ha : The true distributions of subsequent arrest status 
are not the same for all three police responses for subjects like these. (c) If the true distributions of 
subsequent arrest status are the same for all three police responses for subjects like these, there is a 0.0796 
probability of getting differences between the three distributions as large as or larger than the ones 


observed by chance alone. (d) Because the P-value of 0.0796 is larger than a=0.05 , we 
fail to reject HO . There is not convincing evidence that the true distributions of subsequent 


arrest status are not the same for all three police responses for subjects like these. 


T11.13 STATE: HO : There is no association between smoking status and educational level 
among French men aged 20 to 60 years. 

Ha : There is an association between smoking status and educational level among these men. 
a=0.05 PLAN: Chi-square test for independence. Random: Random sample of French 


men. 10%: n=459<10% of all French men aged 20 to 60 
years. Large Counts: The expected counts (59.48, 44.21, 42.31, 50.93, 37.85, 36.22, 42.37, 31.49, 30.14, 


34.22, 25.44, 24.34) are all => 5. DO: The calculator’s y2-Test gives y2=13.305 
and P-value=0.0384 using df=6 
. CONCLUDE: Because the P-value of 0.0384<a=0.05 


, we reject HO . There is convincing 
evidence of an association between smoking status and educational level among French men aged 20 to 


60 years. 


Chapter 12 
Section 12.1 


Answers to Check Your Understanding 
page 781: STATE: B1=the slope of the true regression line relating y=score 


to x=row for students like these. PLAN: t interval for the slope. Linear: The 
scatterplot shows a linear pattern, and the residual plot shows no leftover curved patterns. Independent: 
Knowing one student’s score shouldn’t help predict the score of another student. Normal: There is no 
strong skewness or outliers in the dotplot of the residuals. Equal SD: The amount of scatter of points 


around the residual =0 line varies somewhat, but it is still plausible that o is the same 
for all x-values and that the differences are due to sampling variability. Random: Students were randomly 
assigned a seat location within the classroom. DO: With df=30-2=28, t*=2.048 


-1.117+2.048(0.947)=(-3.056,0.822) 


. Tech: The calculator gives (—3.057, 0.823) 


using df=28 . CONCLUDE: 

We are 95% confident that the interval from —3.056 to 0.822 captures the slope of the true 
regression line relating y=score to X=row for students like these. 
page 786: 1. STATE: We want to test HO: B1=0 , Ha: B1<0 

, where B1= the slope of the 
true regression line relating y=score to x=row for students like these using 
a=0.05 . PLAN: t test for the slope. The conditions for regression inference are met. 
DO: According to the output, the standardized test statistic is t=-1.18¢ = 1.18. Because the computer 


output includes a two-sided P-value and the sample slope is consistent with Ha,P-value=0.248/2=0.124 


CONCLUDE: Because the P-value of 0.124>a=0.05 F 


we fail to reject HO . There is not convincing evidence of a negative linear relationship 
between row and score for students like these. 


2. Type I error: The teacher finds convincing evidence of a negative linear relationship between row and 
score, when the true slope is 0. Type II error: The teacher does not find convincing evidence of a negative 


linear relationship between row and score, when the true slope is <0 . Because we failed to 
reject the null hypothesis, it is possible that we could have made a Type II error. 


Answers to Odd-Numbered Section 12.1 Exercises 
12.1 (a) py=105+4.2(15)=168 


cm 


(b) Because the conditions were met, the distribution of heights for 15-year-old students follows a Normal 
distribution with a mean of 168 cm and a standard deviation of 7 cm. Let X=the height 
of 15-year-old students at this school. We want to find P(X>180) 
. (i) z=180-1687=1.71 
; the proportion of z-scores above 
1.71 is 0.0436. (ii) normalcdf (lower: 180, upper: 1000, mean: 168, SD: 7) =0.0432 . About 
4.32% of 15-year-old students are taller than 180 cm. (c) Probably not; the slope of the sample regression 


line would almost certainly differ from 4.2 due to sampling variability. We would, however, expect the 
slope of the sample regression line to be close to 4.2. 


12.3 The Equal SD condition is not met because the standard deviation of the residuals clearly increases 
as the laboratory measurement (x) increases. 

12.5 Linear: There is no leftover curved pattern in the residual plot, indicating that a linear model is 
appropriate. Independent: Knowing the BAC for one subject should not help us predict the BAC for 
another subject. Normal: The histogram of the residuals shows no strong skewness or outliers. Equal SD: 


The residual plot shows a similar amount of scatter about the residual=0 line for each 


x=number of beers. Random: The students were randomly assigned a certain number of 
cans of beer to drink. 


12.7 (a) bO=-0.012701 . If a student drinks 0 beers, the BAC will be 
—0.012701 , on average. Of course, with 0 beers, BAC should be 0. (b) b1=0.017964 

; for each increase of 1 beer consumed, the average BAC increases by about 
0.017964. (c) s=0.0204 ; the actual BAC amounts typically vary by about 0.0204 from the 
amounts predicted with the least-squares regression line using x=number of beers. (d) 
SEb1=0.0024 ; if we repeated the random assignment many times, the 


slope of the sample regression line would typically vary by about 0.0024 from the slope of the true 
regression line for predicting BAC from number of beers. 


12.9 (a) df=14 , t#=2.977 ; (0.011, 0.025) 
. (b) We are 99% confident that the interval from 
0.011 to 0.025 captures the slope of the true regression line relating y=BAC to x=number 


of beers consumed. (c) If we repeated the experiment many times and computed a 
confidence interval for the slope each time, about 99% of the resulting intervals would contain the slope 


of the true regression line relating y=BAC to x=number of beers 

consumed. 

12.11 STATE: B1=the slope of the true regression line relating y=amount 
expelled to x=tapping time. PLAN: t interval for the slope. Linear: The 


scatterplot shows a linear pattern. Also, the residual plot shows no leftover curved patterns. Independent: 
Knowing the amount expelled by one can does not help us predict the amount expelled by another can. 
Normal: There is no strong skewness or outliers in the histogram of the residuals. Equal SD: The amount 


of scatter of points around the residual=0 line appears to be about the same at all x 


values. Random: Cans were randomly assigned a tapping time. DO: df=38 , using df=30 
, t*=2.042 + (-2.9962, —2.2738) 
. CONCLUDE: We are 95% 
confident that the interval from —2.9962 to —2.2738 captures the slope of 


the true regression line relating y=amount expelled to x=tapping time. 


12.13 STATE: B1= the slope of the population regression line relating y=number 
of clusters of beetle larvae to x=number of stumps. PLAN: t interval 
for the slope. The conditions are met. DO: df=21 , te=2.831 


. (8.678,15.11) 
. CONCLUDE: We are 99% confident that the interval from 8.678 to 15.11 captures the slope of the 


population regression line relating y=number of clusters of beetle larvae to x=number 
of stumps. 
12.15 STATE: HO: B1=0 , Ha: B1<0 
, where B1= the slope of the true 
regression line relating y=com yield to x=weeds per meter; a=0.05 


. PLAN: t test for slope. Linear: There is no leftover curved pattern in the residual plot. 
Independent: Knowing the yield for one meter does not help us predict the yield for another meter. 
Normal: The dotplot of the residuals shows no strong skewness or outliers. Equal SD: The residual plot 


shows a similar amount of scatter about the residual=0 line for all x values. Random: 
The plots were randomly assigned how many weeds were allowed to grow. DO: t=-1.92 


; P-value=0.075/2=0.0375 
. CONCLUDE: 


Because the P-value of 0.0375<a=0.05 , we reject Hoo. 
There is convincing evidence that the slope of the true regression line relating corn yield to weeds per 
meter is negative. In other words, we have convincing evidence that having more weeds reduces corn 
yield. 

12.17 STATE: HO: 61=0/40 : 61 = 9 Ha: B1<0/e * 81 < 9, where B1="1 = the slope of the 
population regression line relating y=heartY ~ heart disease death rate to x=winet = wine consumption 
in the population of countries; a=0.05@ = 0.05, PLAN: t test for the slope. Linear: There is no leftover 
curved pattern in the residual plot. Independent: The sample size n=19<10%" = 19 < 10% of all 
countries. Normal: The histogram of residuals shows no strong skewness or outliers. Equal SD: The 
residual plot shows that the standard deviation of the death rates might be a little smaller for large values 
of wine consumption x, but it is hard to tell with so few data values. Random: The data come from a 
random sample. Because the Equal SD condition is questionable, we will proceed with caution. 
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Do: Using tech, t=-6.46¢ = - 6.46 | df=17df = 17, and P-value ~~ 0. CONCLUDE: Because the P- 


value of approximately 0<a=0.050 < a@ = 0.05, we reject HO . There is convincing 
evidence of a negative linear relationship between wine consumption and heart disease death rate in the 
population of countries. 


12.19 STATE: HO: B1=0 , Ha: B1<0 
, where B1=the slope of the 
true regression line relating y= performance on the 10-question math test to x= volume 
of the music for students like these; a=0.05 . PLAN: t test for the slope. The conditions 
are met. DO: t=-4.163 ; P-value=0.0003/2=0.00015 
. CONCLUDE: 

Because the P-value of 0.00015<a=0.05 we reject HO 

. There is convincing evidence of a negative linear relationship relating y=performance 


on the 10-question math test to x= volume of the music for students like 
these. 


12.21 (a) With df=19 , tk=2.093 ; (9016.4, 14,244.8) 


. (b) Because the automotive group 
claims that people drive 15,000 miles per year, it says that for every increase of 1 year, the mileage would 


increase by 15,000 miles. (c) t=11,630.6—15,0001249=—2.70 


df=19 , the P-value is between 2(0.005)=0.01 
and 2(0.01)=0.02 
. Tech: P-value=0.0142 
. Because the P-value of 0.0142<a=0.05 


, we reject HO . We have convincing 
evidence that the slope of the population regression line relating miles to years is not equal to 15,000. (d) 
Yes; because the interval in part (a) does not include the value 15,000, the interval also provides 
convincing evidence that the slope of the population regression line relating miles to years is not equal to 
15,000. 
1225.¢ 
12.25 a 
27 b 
12.29 (a) This was an experiment because the two treatments (say the color of the printed word and read 
the word) were deliberately assigned to the students. (b) He used a randomized block design where each 
student was a block. He did this to help account for the different abilities of students to read the words or 
to say the color they were printed in. (c) The random assignment was used to help average out the effects 
of the order in which people did the two treatments. For example, if every subject said the color of the 
printed word first and was frustrated by this task, the times for the second treatment (reading the word) 
might be worse. Then we wouldn’t know the reason the times were longer for the second treatment— 
because of frustration or because the second method actually takes longer. 


12.31 (a) There appears to be a moderately strong, positive linear association between the length of time 
to read the word and length of time to identify the color. 


(b) yA=4.887+1.1321x , where y/=the predicted time 


to identify the color and x=the amount of time 


to read the words. (c) y4=4.887+ 1.1321(9)=15.076 


seconds, so the residual is y-y4=13-15.076=—2.076 


seconds. The actual time 
to complete the color task was 2.076 seconds less than the time predicted by the regression line with x=9 


seconds to complete the word task. (d) If the true slope of the regression line relating time 
to identify the colors and time to read the words is 0 and this experiment were repeated many times, there 
is a 0.0215 probability of getting an observed slope of 1.1321 or larger by chance alone. 


Section 12.2 
Answers to Check Your Understanding 
page 810: 1. Option 1: premium/=—343+8.63(58)=$157.54 


Option 2: In(premium)/=—12.98+4.416(In 58)=4.9509 — y=e4.9509=$141.30 
Option 3: In(premium)/=—0.063+0.0859(58)=4.9192 — yA=e4.9192=$136.89 


2. The exponential model (Option 3) best describes the relationship because the scatterplot showing 
In(premium) versus age was the most linear and this model had the most randomly scattered residual plot. 


Answers to Odd-Numbered Section 12.2 Exercises 


12.33 (a) y4=—-0.08594+0.21x , where y is 
the period and x is the length. (b) y\=—0.08594+0.2180=1.792 

seconds 
12.35 (a) yA2=-0.15465+0.0428x , where y is the 


period and x is the length. (b) yA2=—0.15465+0.0428(80)=3.269 


, 80 yA=3.269=1.808 seconds. 
12.37 (a) The scatterplot of log(period) versus log(length) is roughly linear. Also, the residual plot shows 


no obvious 


leftover curved patterns. (b) log y=—0.73675+0.51701 log(x) 


’ 


where y is the period and x is the length. (c) log y=-0.73675+0.51701 log(80)=0.24717 


, 80 yA=100.24717=1.77 seconds 
12.39 log y=1.01+0.72 log(127)=2.525 


y=102.525=334.97 grams is the 
predicted brain weight of Bigfoot. 


12.41 (a) In distance=—3.1099+2.0361-In(speed) 
(b) In distance=—3.1099+2.0361-In(48)=4.772 


, sO distance=e4.772=118.16 feet. 
Residual=110.33-118.16=—7.83 


feet. Mr. Waggoner 


traveled a distance of 7.83 feet less than the distance predicted by the power model with x=48 

mph. 

12.43 (a) Because the scatterplot of In(count) versus time is fairly linear, an exponential model would be 
reasonable. (b) In yA=5.97316-0.218425x 


, where y is the count of 
surviving bacteria (in hundreds) and x is time in minutes. (c) In y4=5.97316-0.218425(17)=2.26 


, SO y\=e2.26=9.58Y = 748 = 9.58 or 958 bacteria. 
12.45 (a) In population’=4.7885+0.0059(years)@ population = 4.7885 + 0.0059 (years) 
(b) In population\=4.7885+0.0059(320)=6.6765!2 population = 4.7885 + 0.0059 (320) = 6.6765 <5 


population\=e(6.6765)=793.54Population = e697) — 793.54 million people. 
12.47 (a) The exponential model because the scatterplot of In(percent) versus distance is roughly linear. A 
power model would not work as well because the scatterplot of In(percent) versus In(distance) is clearly 
curved. (b) In y\=4,6649-0.1091x1 Y = 4.6649 — 0.10912, where y=percent¥ = Percent made and 
x=distance (feet). (c) In y4=4.6649-0.1091 
(21)=2.3738 
, 80 yA=e2.3738=10.738% 


. (d) Because the last few residuals 
(those farthest to the right) were all positive, we might expect that our prediction, which was based on 


x=21 , might also be too small. 
12.49 (a) 


A scatter plot marking length versus weight is shown. 


There is a strong, positive curved relationship between heart weight and length of left ventricle for 


mammals. 


(b) 


Because the relationship between In(weight) and In(length) is roughly linear, heart weight and length 
seem to follow a power model. An exponential model would not be appropriate because the relationship 
between In(weight) and length is clearly curved. (c) In yA=—0.314+3.1387 In x 


, where y is the 
weight of the heart and x is the length of the cavity of the left ventricle. (d) 
In yA=-0.314+3.1387 In(6.8)=5.703 


, 80 yA=e5.703=299.77 grams. 
Sic 
12,53:C 
12.55 (a) (i) z=3-4.50.9=- 1.67 
and z=6-4.50.9=1.67 
; the proportion of z-scores between 
—1.67 and 1.67 is 0.9050. (ii) normalcdf (lower: 3, upper: 6, mean: 4.5, SD: 0.9)=0.9044 


. There is a 0.9044 probability that Marcela’s shower lasts between 3 
and 6 minutes. (b) (i) Solving -0.67=Q1-4.50.9 


gives Q1=3.897 
minutes. Solving 0.67=Q3-4.50.9 gives Q3=5.103 


minutes. (ii) Tech: invNorm(area: 0.25, mean 4.5, SD: 0.9) gives Q1=3.893 
minutes and invNorm(area: 0.75, mean: 4.5, SD: 0.9) gives Q3=5.107 


minutes. Thus, an outlier is any value above 
5.107+1.515.107—3.8932=6.928 


. Because 7>6.928 


, a Shower of 7 minutes would be considered an outlier for Marcela. (c) (i) 
P(X>7)=P(z>7-4.50.9)=0.0027 


; (ii) normalcdf(lower: 7, upper: 1000, mean: 4.5, SD: 0.9)=0.0027 Let Y=the number of 
days that Marcela’s shower is 7 minutes or higher. Y is a binomial 
random variable with n=10 and p=0.0027 . P(Y22)=1 


— binomcdf(trials: 10, p: 0.0027, x value: 1) =0.0003 


. There is a 0.0003 probability that Marcela’s shower time would be 7 minutes or more on 
at least 2 of the 10 days. (d) Because the distribution of X is Normal, the sampling distribution of x — 


is also Normal; px =p=4.5 . Because 
10<10, of days that Marcela showers, the standard deviation is ox =0.910=0.285 
. We want 
P(x >5) . (i) 2=5-4.50.285=1.75 


; P(z>1.75)=0.0401 
(ii) normalcdf(lower: 5, upper: 1000, mean: 4.5, SD: 0.285) =0.0397 . There is a 0.0397 probability 
that the mean length of Marcela’s showers on these 10 days exceeds 5 minutes. 
12.57 (a) STATE: p= the true proportion of all AP® Statistics teachers attending this workshop 
who would say they have a tattoo. PLAN: One-sample z interval for p. Random: Random sample of 
teachers at the institute. 10%: n=98<10 % of the population of 
teachers at this workshop. Large Counts: 23 and 75 are > 10. DO: (0.151, 0.319). CONCLUDE: We are 
95% confident that the interval from 0.151 to 0.319 captures p=the true proportion of 
AP® Statistics teachers at this workshop who would say they have a tattoo. (b) No; because the value 0.29 
is included in the interval of plausible values, we do not have convincing evidence that the true proportion 
of teachers at the workshop who would say they have a tattoo is different from 0.29. (c) If we had two 
more successes, p\=0.25 and the interval would be (0.165, 0.335) 
. If we had two more failures, p\=0.23 
and the interval would be (0.148, 0.312). In both cases, the value 0.29 is included in the 
interval, so even if both of these teachers had responded, the answer to part (b) would not change. 


Answers to Chapter 12 Review Exercises 


R12.1 (a) The scatterplot reveals that there is a negative, moderately strong, linear relationship between 


height (inches) and number of steps. (b) yA=113.57-0.9211x , 


where y/ is the predicted number of steps and x is the height of the student (in inches). 


(c) (i) bO=113.57 ; if a student has a height of 0 inches, the number of steps to 
walk the length of the school hallway is 113.57 steps, on average. Of course, a student with a height of 0 
will not be able to walk anywhere, so this is unreasonable. (ii) b1=—0.9211 


; for each increase of 1 inch in height, the average number of steps to 


walk the length of the school hallway decreases by 0.9211 steps. (iii) s=3.50429 ; the 
actual number of steps needed to walk the length of the school hallway typically varies by about 3.50429 


from the amount predicted with the least-squares regression line using x=height . (iv) 


SEb1=0.1938 ; if we repeated the random selection many times, the slope 
of the sample regression line would typically vary by about 0.1938 from the slope of the population 
regression line for predicting number of steps from height. 

R12.2 Linear: The residual plot shows no leftover curved pattern. Independent: Knowing the number of 
steps needed for one student should not help us predict the number of steps needed for another student. 


Also, the sample size n=36<10% of all students at the high 
school. Normal: The histogram of the residuals does not show strong skewness or outliers. Equal SD: The 
residual plot shows roughly equal amounts of scatter for all x values. Random: The data come from a 


random sample. 


R12.3 STATE: HO: B1=0 , Ha: B1<0 
, where $ 1=the slope of the 
population regression line relating y=number of steps to x=height using 
a=0.05 . PLAN: t test for the slope. The conditions are met. DO:t=—-4.75 
. With df=34 , using df=30 , the P-value (0) is 


less than 0.0005. CONCLUDE: Because the P-value of approximately 0<a=0.05 

, we reject HO . There is convincing evidence 
that the slope of the true regression line relating height to number of steps is negative. In other words, we 
have convincing evidence that taller students require fewer steps to walk the length of the hallway. 


R12.4 STATE: B1= the slope of the population regression line relating y=number 
of steps to x=height . PLAN: t interval for the slope. The conditions are 
met. DO: With df=34 , using df=30 , t®=2.042 


; (-1.317, -0.525) 
. CONCLUDE: We are 95% 
confident that the interval from —1.317 to -0.525 captures the slope of the 


population regression line relating y=number of steps to x=height . The 
interval provides more information because it gives an interval of plausible values for the population 
slope. 


R12.5 (a) The transformation did achieve linearity because there is no leftover curved pattern in the 
residual plot. (b) y4=—0.000595+0.3(1x2) 


where y=intensity and x=distance (c) y4=—0.000595+0.3(1( 
2.1)2)=0.0674 


candelas 


R12.6 (a) A linear model is not appropriate because the scatterplot clearly shows that the relationship 
between weight and price is curved. (b) A power model is more appropriate in this case because the 
scatterplot showing In(price) versus In(weight) is fairly linear, whereas the scatterplot showing In(price) 
versus weight is curved. (c) For the power model, In yA=9.7062+2.2913 In(2)=11.294 


Iny _ 9.7062 + 2.2913 In (2) — 11.294 and y\=e11.294=80,338.16Y = ell.294 = 80, 338.16 dollars. 


For the exponential model, In y=8.2709+1.3791(2)=11.0291! y = 8.2709 + 1.3791 (2) = 11.0291 ang 


* — 211.0291 _ ; 
y\=e11.0291=61,642.08Y = © n8! = 61, 642.08 dollars. Based on my answer to part (b), I think the 
power model will give a better prediction. 


Answers to Chapter 12 AP® Practice Test 

T12.1¢ 

TI2.2b 

Ti23d 

T12.4d 

T12.5d 

T12.6d 

T12,7 ¢ 

T12.8d 

Ti29d 

TH10c 

T12.11 (a) (i) b1=4.8323 ; for each increase of 1 milligram in growth hormone, 
the average weight gain increases by 4.8323 ounces. (ii) bO=4.5459 ; if a chicken 
is given no growth hormone (x=0) , the predicted weight 


gain is 4.546 ounces, on average. (iii) s=3.135 ; the actual weight gain will typically vary 
by about 3.135 ounces from the weight gain predicted with the least-squares regression line using x=dose 


of growth hormone. (iv) SEb1=1.0164 ; if we repeated the 
random assignment many times, the slope of the sample regression line would typically vary by about 
1.0164 from the slope of the true regression line for predicting weight gain from dose of growth hormone. 
(b) STATE: HO: B1=0 , Ha: 8140 
, where B1=the slope of the 
true regression line relating y=weight gain to x=dose of growth hormone; 


a=0.05 . PLAN: t test for the slope. The conditions are met. DO: t=4.75 


and P-value=0.0004 . CONCLUDE: Because the P-value of 


0.0004<a=0.05 , we reject HO . There is 
convincing evidence of a linear relationship between the dose of growth hormone and weight gain for 


chickens like these. (c) STATE: 81=the slope of the true regression line 
relating y=weight gain to x=dose of growth hormone. PLAN: t interval for 
the slope. The conditions are met. DO: df=13 , (2.6369, 7.0277) 


. CONCLUDE: We are 95% confident that the 
interval from 2.6369 to 7.0277 captures the slope of the true regression line relating y=weight 


gain to x=dose of growth hormone for chickens like these. 


T12.12 (a) There is clear curvature evident in both the scatterplot and the residual plot. (b) Option 1: 
y\=2.078+0.0042597(30)3= 


117.09 board feet; Option 2: In yA=1.2319+0.113417(30)= 


4.63441 and y\=e4.63441=102.967 


board feet. (c) The residual plot for 
Option 1 is much more scattered, while the residual plot for Option 2 is curved, meaning that the model 
relating the amount of usable lumber to cube of the diameter is more appropriate. Thus, the prediction of 
117.09 board feet seems more reliable. 


Answers to Cumulative AP® Practice Test 4 
AP4.1 e 
AP4.2 ¢ 
AP4.3 d 
AP4.4a 
AP4.5 b 
AP4.6 e 
AP4.7 d 
AP4.8 a 
AP4.9 e 
AP4.10 a 
AP4.11 b 
AP4.12 d 
AP4.13 e 
AP4.14 d 
AP4.15 b 
AP4.16 a 


AP4.17 e 
AP4.18 c 
AP4.19 a 
AP4.20 b 
AP4.21.e 
AP4.22 c 
AP4.23 b 
AP4.24 c 
AP4.25 d 
AP4.26e 
AP4.27 d 
AP4.28 c 
AP4.29 b 
AP4.30 b 
AP4.31 d 
AP4.32 a 
AP4.33 e 
AP4.34 b 
AP4.35 a 
AP4.36 b 
AP4.37 d 
AP4.38 e 
AP4.39 d 
AP4.40 d 
AP4.41 STATE: HO: p1—-p2=0, Ha: p1-p240 


, where p11= true mean difference in electrical potential for diabetic mice and 2= 


true mean difference in electrical potential for normal mice; a=0.05 


. PLAN: Two-sample ¢ test for p1-p2 . Random: Independent random 
samples. 10%: n1=24<10 % of all diabetic mice and 
n2=18<10 % of all normal mice. Normal/Large Sample: 


The graphs of the data reveal no outliers or strong skewness so a two-sample t procedure is appropriate. 
DO:t=2.55 . Using df=38.46 , P-value=0.0149 
. CONCLUDE: Because the P-value of 0.0149<a=0.05 


, we reject HO . There is convincing 
evidence that the true mean electric potential for diabetic mice differs from that for normal mice. 


AP4.42 (a) HO: p1—p2=0, Ha: p1—p2<0 


, where p1= the true proportion of women like the ones in the study who were physically 


active as teens who would suffer a cognitive decline and p2= the true proportion of 
women like these who were not physically active as teens who would suffer a cognitive decline. (b) A 
two-sample z test for p1—p2P1 ~ P2. (c) No; because the participants were mostly white women from 
only four states, the findings may not be generalizable to women in other racial and ethnic groups or who 
live in other states. (d) Two variables are confounded when their effects on the response variable 
(measure of cognitive decline) cannot be distinguished from one another. For example, women who were 
physically active as teens might have also done other things differently as well, such as eating a healthier 
diet. If healthier diets lead to less cognitive decline, we would be unable to determine if it was their 
physically active youth or their healthier diet that slowed their level of cognitive decline. 


AP4.43 (a) Because the first question called it a “fat tax,” people may have reacted negatively because 
they believe this is a tax on those who are overweight. The second question provides extra information 
that gets people thinking about the obesity problem in the U.S. and the increased health care that could be 
provided as a benefit with the tax money, which might make them respond more positively to the 
proposed tax. The question should be worded in a more straightforward manner. (b) This method samples 
only people at fast-food restaurants. They may go to these restaurants because they like the sugary drinks 
and wouldn’t want to pay a tax on their favorite beverages. Thus, it is likely that the proportion of those 
who would oppose such a tax will be overestimated with this method. A random sample of all New York 
State residents should be taken to provide a better estimate of the level of support for such a tax. (c) Use a 
stratified random sampling method in which each state is a stratum. Select a random sample from each 
state to obtain results for each state and combine the random samples to obtain an overall estimate for the 
nation as a whole. 

AP4.44 Let W=Mr.W = Mr. Worcester arrives first, L=Mr.L = Mr. Legacy arrives first, C=Dr. 


C = Dr. Currier arrives first, and S=the coffee is strong. The tree diagram below 


organizes the given information. 


A tree diagram is shown. 


(a) P(S)=0.16 ; there is a 0.16 probability that the coffee will be strong on a 
randomly selected morning. (b) P(C|S)=0.05/0.16= 

0.3125. Given that the 
coffee is strong, there is a 0.3125 probability that it was brewed by Dr. Currier. 
AP4.45 (a) The scatterplot reveals a strong, negative curved relationship between seed count and seed 
weight. (b) Model B is better because the scatterplot shows a much more linear pattern and its residual 
plot shows no leftover curved pattern. The scatterplot for Model A still has a curved pattern and the 
residual plot 
has a leftover U-shaped pattern. (c) In(weight)4=15.491—1.5222 In(3700)=2.984 


, So weight“=e2.984=19.77mg. 


AP4.46 (a) Let X= diameter of a randomly selected lid. Shape: Normal distribution. Center: 
ux =4 inches. Variability: Because 25 is less than 10% of all lids produced 
that hour, ox =0.004 inches. (b) We want to find P(x <3.99 or 

x >4.01) 


(i) z=3.99-40.004=-2.50 


and 


z=4,01-40.004=2.50 ; the 
proportion of z-scores less than —2.50 or greater than 2.50 is 0.0062+0.0062=0.0124 

. (ii) 1 — normalcdf(lower: 3.99, upper: 4.01, mean: 4, SD: 0.004) 
=0.0124 . Assuming that the machine is working properly, there is a 0.0124 probability that 


the mean diameter of a sample of 25 lids is less than 3.99 inches or greater than 4.01 inches. (c) We want 


to find P(4<x <4.01) 


(i) z=4-40.004=0 and 
z=4,01-40.004=2.50 ; the 
proportion of z-scores between 0 and 2.50 is 0.9938-0.5000=0.4938 

(ii) normalcdf(lower: 4, upper: 4.01, mean: 4, SD: 0.004) =0.4938 . Assuming that the machine is 
working properly, there is a 0.4938 probability that the mean diameter of a sample of 25 lids is between 
4.00 and 4.01 inches. (d) Let Y= the number of samples (out of 5) in which the sample mean 
is between 4.00 and 4.01. The random variable Y has a binomial distribution with n=5 and 
p=0.4938 . Tech: P(X>4)=1 = 
binomcdf(trials: 5, p: 0.4938, x value: 3) = 0.1798 . Assuming that the manufacturing process is 


working correctly, there is a 0.1798 probability that in 5 consecutive samples, 4 or 5 of the sample means 
will be above the desired mean of 4.00 but below the upper boundary of 4.01. (e) Because the probability 
found in part (b) is less than the probability found in part (d), getting a sample mean below 3.99 or above 
4.01 is more convincing evidence that the machine should be shut down. This event is much less likely to 
happen by chance when the machine is working correctly. 


Glossary/Glosario 


1.5xIQR1-5 X IQR rule for outliers 

An observation is called an outlier if it falls more than 1.5xIQR 
1.5 x IQR above the third quartile or below the first quartile. (p. 
64) 


10% condition 


When taking an SRS of size n” from a population of size N,V ’ 
check that n<0.10N.2 < 0.10.N. (p. 419) 


68—-95-99.7 rule 

(also known as the empirical rule) In the Normal distribution with 
mean up and standard deviation o,7> (a) approximately 68% of the 
observations fall within o7 of the mean yp,» (b) approximately 
95% of the observations fall within 2020 of 1, and (c) 
approximately 99.7% of the observations fall within 3030 of p./ 
(p. 117) 


addition rule for mutually exclusive events 
If A and B are mutually exclusive events, P(A or B)=P(A)+P(B). 


alternative hypothesis 


Ha/Za The claim that we are trying to find evidence for ina 
significance test. (p. 554) 


anonymity 
The names of individuals participating in a study are not known 
even to the director of the study. (p. 279) 


association 

A relationship between two variables in which knowing the value 
of one variable helps predict the value of the other. If knowing the 
value of one variable does not help predict the value of the other, 
there is no association between the variables. (p. 19) 


Espanol 


regla 1.5xlal.5 x la gama e 
Se le dice valor atipico a una c 
gama entre cuartiles por encir 
cuartil. (pag. 64) 


condicion del 10% 
Cuando se toma una muestra a 


poblacion de tamafio nV »se 
419) 


regla 68—95-99.7 

A la que también se le dice la ' 
con media pi y desviacién esi 
las observaciones caen dentro 
el 95% de las observaciones ci 
aproximadamente el 99.7% de 
H- (pag. 117) 


regla de suma para eventos ¢ 
Si A y B son eventos que se e) 
P(A or B) = P(A) +P(B). 


hipotesis 
Ha/Za alternativa La proposic 
estadistica estamos tratando de 


anonimato 
Cuando se desconocen los nor 
estudio; inclusive el director d 


asociacion 

Relacion entre dos variables e1 
la prediccion del valor de la ot 
facilita la prediccion del valor 
asociacion entre las variables. 


back-to-back stemplot (also called back-to-back stem-and-leaf 
plot) 

Plot used to compare the distribution of a quantitative variable for 
two groups. Each observation in both groups is separated into a 
stem, consisting of all but the final digit, and a leaf, the final digit. 
The stems are arranged in a vertical column with the smallest at the 
top. The values from one group are plotted on the left side of the 
stem and the values from the other group are plotted on the right 
side of the stem. Each leaf is written in the row next to its stem, 
with the leaves arranged in increasing order out from the stem. (p. 
39) 


bar graph 

Graph used to display the distribution of a categorical variable or to 
compare the sizes of different quantities. The horizontal axis of a 
bar graph identifies the categories or quantities being compared. 
The heights of the bars show the frequency or relative frequency for 
each value of the categorical variable. The graph is drawn with 
blank spaces between the bars to separate the items being 
compared. (p. 10) 


bias 

The design of a statistical study shows bias if it is very likely to 
underestimate or very likely to overestimate the value you want to 
know. (p. 224) 


biased estimator 

A Statistic used to estimate a parameter is biased if the mean of its 
sampling distribution is not equal to the true value of the parameter 
being estimated. (p. 448) 


bimodal 
A graph of quantitative data with two clear peaks. (p. 33) 


binomial coefficient 
The number of ways of arranging kk successes among nt 
observations is given by the binomial coefficient (nk)=n!k!(n—k)! 

n! 
iliac for k=0,1,2,... nk = 0,1,2,..- where 
nl=n(n-1)(n-2)-....3: 2-1 = 1(n — 1) (mn — 2) +...°3+ 2-1 ang 
0!=1.0! = 1. (p. 407) 


ae” 


binomial distribution 
In a binomial setting, suppose we let X=the number of successes. 


diagrama de tallos contiguos 
(también se le dice diagrama d 
comparar la distribuci6n de un 
observacion efectuada en amb 
de todos los digitos salvo el ul 
digito. Los tallos se organizan 
pequefias arriba. Los valores d 
del tallo y los valores del otro 
Cada hoja se coloca en el reng 
dispuestas en orden ascendient 


(pag. 39) 


grafico de barras 

Se usa para ilustrar la distribuc 
comparar el tamanio de diferen 
de barras identifica las categor 
La altura de las barras muestra 
valor de la variable categ6rica. 
entre las barras a fin de separa 
comparar. (pag. 10) 


sesgo 
El disefio de un estudio estadis 
probabilidad de subestimar o s 


calculador sesgado 

La estadistica que se usa para | 
media de la distribucién de su 
parametro que se esta compute 


bimodal 
Grafico de datos cuantitativos 


coeficiente binomial 
La cantidad de maneras de org 


representa con el coeficiente b 
para k=0,1,2,... nk = 0,1,2,. 
3-2-1 y O!=1.m!=n(n-1)(n—-2)-. 


distribucion binomial 
En un entorno binomial, supor 


X = the number of successes. The probability distribution of X 
X is a binomial distribution with parameters n” and p,P> where nn 
is the number of trials of the chance process and pP is the 
probability of a success on each trial. (p. 406) 


binomial probability formula 
If XX has the binomial distribution with n” trials and probability p 
P of success on each trial, the possible values of XX are 0,1,2,...,n. 


0,1,2,...,7. If kk is any one of these values, P(X=k)= 
P(X =k) = (7) p*(1— 


(nk)pk(1—p)n-k. k 


binomial random variable 
The count XX of successes in a binomial setting. The possible 


values of XX are 0,1,2,...,n.9; 1, 2,-++5M (p. 406) 


binomial setting 

Arises when we perform several independent trials of the same 
chance process and record the number of times that a particular 
outcome occurs. The four conditions for a binomial setting: 


e Binary? The possible outcomes of each trial can be classified as 
“success” or “failure.” 


Independent? Trials must be independent; that is, knowing the 
result of one trial must not tell us anything about the result of any 
other trial. 


Number? The number of trials n7 of the chance process must be 
fixed in advance. 


Same probability? There is the same probability pP? of success on 
each trial. (p. 404) 


block 

Group of experimental units that are known before the experiment 
to be similar in some way that is expected to affect the response to 
the treatments. (p. 257) 


boxplot 

A visual representation of the five-number summary. The box spans 
the quartiles and shows the variability of the central half of the 
distribution. The median is marked within the box. Lines extend 
from the box to the smallest and largest observations that are not 
outliers. Outliers are marked with a special symbol such as an 
asterisk (*). (p. 68) 


aciertos.X = la cantidad de aciertos. 
una distribucién binomial con 
cantidad de ensayos del proce: 
un acierto en cualquiera de los 


formula de probabilidad bin 
Si XX tiene la distribucién bi) 
de acierto en cada ensayo, los 
0,1,2,.--,7 Si kk es cualqu 


P(X =k) = ( 
(nk)pk(1—p)n-k. 


variable aleatoria binomial 
La cuenta XX de aciertos ent 


XX son 0,1,2,...,n.9; 1; 2;--- 


entorno binomial 

Surge cuando se realizan varic 
de probabilidad y se anota lac 
dado. Las cuatro condiciones 


e :Binario? Los resultados po 
como “acierto” o “fracaso”. 


e Independiente? Los ensayc 
el resultado de un ensayo nc 
otro ensayo. 


e «Numero? La cantidad de e1 
tiene que fijar con anticipac 


e ¢Misma probabilidad? Exist 
acierto en cada ensayo. (pag 


bloque 

Grupo de unidades experiment 
similares de alguna manera pr‘ 
tratamientos. (pag. 257) 


diagrama de caja y bigotes 
Representacion visual del resu 
cuartiles y muestra la variabili 
Dentro de la caja se marca la r 
caja a las observaciones mas p 
atipicos. Los valores atipicos s 
un asterisco (*). (pag. 68) 


categorical variable 
A variable that assigns labels that place each individual into a 
particular group, called a category. (p. 3) 


census 
Study that attempts to collect data from every individual in the 
population. (p. 221) 


central limit theorem (CLT) 

In an SRS of size n’ from any population with mean yp and finite 
standard deviation 0,7» when n’? is large, the sampling distribution 
of the sample mean x © is approximately Normal. (p. 476) 


Chebyshev’s inequality 


In any distribution, the proportion of observations falling within kk 
1 


standard deviations of the mean is at least 1-1k2.. _ & (p. 119) 


chi-square distribution 

A distribution that is defined by a density curve that takes only non- 
negative values and is skewed to the right. A particular chi-square 
distribution is specified by giving its degrees of freedom. (p. 715) 


chi-square test statistic 
Measure of how far the observed counts are from the expected 
counts. The formula is 


variable categorizada 
Variable que asigna una etique 
grupo particular, conocido con 


censo 
Un estudio en el que se trata d 
la poblacion. (pag. 221) 


teorema del limite central 
Traza una muestra aleatoria se 
con la media p y una desviac 
limite central manifiesta que c 
muestreo de la media de la mu 
476) 


desigualdad de Chebychov 
En cualquier distribucion, la p 


de kk desviaciones estandar di 
119) 


distribuci6n de ji cuadrado 

Distribuci6n que se define por 
valores no negativos y esta ses 
distribucion de ji cuadrado dac 


prueba estadistica de ji cuad 
Una medicion de la distancia € 
previstas. La formula es 


¥, (Observed Count-Expected Count)2Expected Count ¥, (Cuenta Observadas—Cuenta 


. ’ é 9 
> (Cuenta Observadas—Cuenta Previstas)* 


x2 


font Cuenta Previstas 


where the sum is over all possible values of the categorical variable 3 


or all cells in the two-way table. (p. 712) en la que la suma est sobre to 


categorizada o sobre todas las 


chi-square test for goodness of fit 

A test of the null hypothesis that a categorical variable has a 
specified distribution. (p. 718) For more details, see the inference 
summary on page T-16. 


prueba de ji cuadrado para « 
Prueba de la hipotesis nula en 
distribucion especificada. (pag 
de inferencia en la pagina T-1¢ 


chi-square test for homogeneity 

A test of the null hypothesis that the distribution of a categorical 
variable is the same for two or more populations/treatments. (p. 
737) For more details, see the inference summary on page T-16. 


prueba de ji cuadrado de ho 
Prueba de la hipotesis nula en 
categorica por la misma en do: 
Para mas informacion, ver el r 


chi-square test for independence 

A test of the null hypothesis that there is no association between 
two categorical variables in the population of interest. (p. 744) For 
more details, see the inference summary on page T-16. 


cluster sample 

Sample obtained by classifying the population into groups of 
individuals that are located near each other, called clusters, and 
then choosing an SRS of the clusters. All individuals in the chosen 
clusters are included in the sample. (p. 230) 


coefficient of determination r27” 

A measure of the percent reduction in the sum of squared residuals 
when using the least-squares regression line to make predictions, 
rather than the mean value of y.¥: In other words, r2r” measures 
the percent of the variability in the response variable that is 
accounted for by the least-squares regression line. (p. 190) 


comparison 
Experimental design principle. Use a design that compares two or 
more treatments. (p. 247) 


complement 


C 
The complement of event A, written as ACA > is the event that A 
does not occur. (p. 316) 


complement rule 
The probability that an event does not occur is 1 minus the 
probability that the event does occur. In symbols, P(AC)=1-P(A). 


P(A°) =1~ P(A). @ 316) 


completely randomized design 
Design in which the experimental units are assigned to the 
treatments completely by chance. (p. 255) 


components 
Individual terms that are added together to produce the test statistic 


y2eF 
ponent=(Observed Count-Expected Count)2Expected Count 


(p. 752) 


componente = 


prueba de ji cuadrado de inc 
Prueba de la hipotesis nula en 
categoricas dentro de la poblac 
informacion, ver el resumen di 


muestra de cluster 

Muestra que se obtiene clasific 
que estan ubicados uno cerca ¢ 
escogiendo una muestra aleato 
individuos en los clusters esco 


coeficiente de determinaci6n 
Medida del porcentaje de redu 
cuando se hacen predicciones 


minimos cuadrados, en lugar c 
mide el porcentaje de variabili 
a través de la linea de regresid 


comparacion 
Principio de disefo experimen 
tratamientos. (pag. 247) 


complemento 


El] complemento del evento A, 
A no ocurre. (pag. 316) 


regla del complemento 
La probabilidad de que no suc 
que el evento si suceda. En ref 


P(A°) =1-P(A).- pig 


diseno completamente aleato 
Cuando las unidades experime 
manera completamente al azar 


componentes 
Los términos individuales que 


. 
prueba y2:X * 


componente=(Cuenta Observadas-— 


~ , 9 
(Cuenta Observadas— Cuenta Previstas)* 


Cuenta Previstas 


(pag. 752) 


conditional distribution 

Describes how the values of one variable vary among individuals 
who have a specific value of another variable. There is a separate 
conditional distribution for each value of the other variable. (p. 18) 


conditional probability 

Probability that one event happens given that another event is 
already known to have happened. Suppose we know that event A 
has happened. Then the probability that event B happens given that 
event A has happened is denoted by P(BIA). (B\A). To find the 


conditional probability P(BIA),2 (BiA) 
P(ANMB) 


B|A)=P(AnB)P(A) P(A) 


> use the formula 


P(BiA) = 


(p. 331) 


conditional relative frequency 

Gives the percent or proportion of individuals that have a specific 
value for one categorical variable among individuals who share the 
same value of another categorical variable (the condition). (p. 17) 


confidence interval Gives 

an interval of plausible values for a parameter. The interval is 
calculated from the data and has the form 

int estimatetmargin of errorPOint estimate + margin of error 


or, alternatively, 
tistict(critical value)-(standard deviation of statistic) 


(p. 497) 


confidence level CC 

Success rate of the method for calculating the confidence interval. 
In CC'% of all possible samples, the method would yield an interval 
that captures the true parameter value. (p. 497) 


confidential 


A basic principle of data ethics that requires that an individual’s 
data be kept private. (p. 279) 


confounding 

When two variables are associated in such a way that their effects 
on a response variable cannot be distinguished from each other. (p. 
243) 


distribucién condicional 

Describe como varian los valo 
tienen un valor especifico de o 
condicional separada para cadi 


probabilidad condicional 

La probabilidad de que un eve 
evento ya sucedi6. Supongam« 
Entonces la probabilidad de qu 


evento A ya sucedio, se denote 


probabilidad condicional P(B|. 
P (BIA) ee) 
BIA)=P(ANB)P(A) P(A) 


(pag. 331) 


frecuencia relativa condicior 
Ofrece el porcentaje 0 proporc 
especifico para una variable ce 
mismo valor de otra variable c 


intervalo de confianza 
Ofrece un intervalo de valores 
computa a partir de los datos y 


esl 


gtimado de punto+margen de erro 
o alternativamente, 


estadistica+(valor critico)-(desviaci 


estadAstica + (valor ctAtico) (desviaci&h estA ndar de la estadAst ica) 


(pag. 497) 


nivel de confianza CC 

La tasa de aciertos del método 
confianza. En el CC'% de toda 
un intervalo que capta el valor 


confidencial 
Principio basico de la ética de 
un individuo se mantengan en 


confuso 
Cuando dos variables se asocii 
variable de respuesta no se puc 


continuous random variable 

Variable that can take any value in an interval on the number line. 
The probability distribution of a continuous random variable is 
described by a density curve. The probability of any event is the 
area under the density curve and above the values of the variable 
that make up the event. (p. 361) 


control 

Experimental design principle that mandates keeping other 
variables that might affect the response the same for all 
experimental units. (p. 252) 


control group 

Experimental group whose primary purpose is to provide a baseline 
for comparing the effects of the other treatments. Depending on the 
purpose of the experiment, a control group may be given a placebo 

or an active treatment. (p. 248) 


convenience sample 
Sample selected by taking from the population individuals that are 
easy to reach. (p. 223) 


correlation 

r? Measures the direction and strength of the linear relationship 
between two quantitative variables. Correlation is usually written as 
r.r. We can calculate 1” using the formula r=1n-1)\(xi-x sx) 


¥ (2) (#2). 
Sz Sy (pp. 153, 156) 


(yi-y sy). 


critical value 

Multiplier that makes a confidence interval wide enough to have the 
stated capture rate. The critical value depends on both the 
confidence level CC’ and the sampling distribution of the statistic. 
(p. 504) 


cumulative relative frequency graph 

A cumulative relative frequency graph plots a point corresponding 
to the cumulative relative frequency in each interval at the smallest 
value of the next interval, starting with a point at a height of 0% at 
the smallest value of the first interval. Consecutive points are then 

connected with a line segment to form the graph. (p. 93) 


variable aleatoria continua 
Emplea cualquier valor en un | 
probabilidad de una variable a 
de densidad. La probabilidad c 
curva de densidad y encima de 
evento. (pag. 361) 


control 

Principio del disefio experime! 
afectar la respuesta iguales pat 
252) 


grupo de control 

Grupo experimental cuyo fin 5 
la cual se comparan los efecto: 
experimento, a un grupo de co 
tratamiento activo. (pag. 248) 


muestra de conveniencia 
Muestra escogida de individuc 
contacto. (pag. 223) 


correlacion 

1? Mide el sentido y la fuerza 
cuantitativas. La correlacién g 
Calculamos la r con la formule 


_», ri—T yi-Y 
r=— ——_} |——— }. 
n-l 8s Sy 


valor critico 

Multiplicador que amplia el in 
la tasa de captacion indicada. ] 
confianza CC’ como de la dist 
504) 


grafico de la frecuencia relat 
El grafico de frecuencia relatiy 
a la frecuencia relativa acumul 
intervalo que le sigue, comenz 
valor mas pequefo del primer 
los puntos consecutivos con ul 


(pag. 93) 


data analysis 
Process of describing data using graphs and numerical summaries. 


(p. 4) 


density curve 

Curve that (a) is always on or above the horizontal axis and (b) has 
area exactly 1 underneath it. A density curve describes the overall 
pattern of a distribution. The area under the curve and above any 
interval of values on the horizontal axis is the proportion of all 
observations that fall in that interval. (p. 110) 


discrete random variable 

Variable that takes a fixed set of possible values with gaps between. 
The probability distribution of a discrete random variable gives its 
possible values and their probabilities. The probability of any event 
is the sum of the probabilities for the values of the variable that 
make up the event. (p. 362) 


distribution 
Tells what values a variable takes and how often it takes these 
values. (p. 4) 


dotplot 

A graph that displays the distribution of a quantitative variable by 
plotting each data value as a dot above its location on a number 
line. (p. 30) 


double-blind 

An experiment in which neither the subjects nor those who interact 
with them and measure the response variable know which treatment 
a subject received. (p. 249) 


event 

Any collection of outcomes from some chance process. An event is 
a subset of the sample space. Events are usually designated by 
capital letters, like A, B, C, and so on. (p. 315) 


expected counts 
Expected numbers of individuals in the sample that would fall in 


each cell of the one-way or two-way table if Hoo were true. (p. 
711) 


analisis de los datos 
Proceso que describe los datos 
numeé€ricos. (pag. 4) 


curva de densidad 

Curva que (a) siempre esta sot 
1 area exactamente debajo. La 
de una distribucion. El area de 
de valores en el eje horizontal 
que caen en dicho intervalo. (f 


variable aleatoria discreta 

Variable que emplea un conju 
hay brechas. La distribuci6n d 
discreta arroja valores posible: 
cualquier evento es la suma de 
variable que compone el event 


distribucién 
Indica qué valores adopta una 
valores. (pag. 4) 


grafico de puntos 
Grafico que muestra la distribu 
valor de cada dato encima de s 


(pag. 30) 


doble ciego 

Experimento en el que ningun: 
con los sujetos y que miden la 
recibio el sujeto. (pag. 249) 


evento 

Cualquier coleccion de los res 
decir, un evento es un subconj 
generalmente se designan con 
sucesivamente. (pag. 315) 


cuentas previstas 
Las cantidades previstas de inc 


celda en la tabla, sea de una vi 
711) 


experiment 
A study in which researchers deliberately impose treatments on 
individuals to measure their responses. (p. 241) 


experimental unit 

The object to which a treatment is randomly assigned. When the 
experimental units are human beings, they are often called subjects. 
(p. 245) 


explanatory variable 
Variable that may help explain or predict changes in a response 
variable. (p. 153) 


exponential model 

Relationship of the form y=abx.¥ = ab". If the relationship 
between two variables follows an exponential model and we plot 
the logarithm (base 10 or base e€) of y¥ against x,”» we should 
observe a straight-line pattern in the transformed data. (p. 803) 


extrapolation 

Use of a regression model for prediction far outside the interval of 
values of the explanatory variable x® used to obtain the model. 
Such predictions are often not accurate. (p. 178) 


factor 
Explanatory variable in an experiment. (p. 246) 


factorial 

For any positive whole number n,’» its factorial n!! is 
nl=n(a-I(n-2)-...3:24M! = (m= 1) (m= 2)... BDA 
In addition, we define 0!=1.9! = 1. (p. 407) 


fail to reject 


Hoo If the observed result is not unlikely to occur when the null 


hypothesis is true, we should fail to reject Hoo and say that we do 
not have convincing evidence for HaHa: (p. 558) 


first quartile 
qi9i If the observations in a data set are ordered from smallest to 


largest, the first quartile Qq1@i is the median of the observations 
whose position is to the left of the median. (p. 64) 


experimento 
Estudio en el que los investiga 
tratamientos a individuos, con 


unidad experimental 

Objeto al cual se le asigna un t 
experimentales son seres humé 
sujetos. (pag. 245) 


variable explicativa 
Variable que puede ayudar a e 
respuesta. (pag. 153) 


modelo exponencial 

Relacién de la forma y=abx..... 
un modelo exponencial, y traz 
de y¥ con respecto a x,» sed 
datos transformados. (pag. 80: 


extrapolacion 

Uso de un modelo de regresi61 
intervalo de valores de la varie 
el modelo. Tales predicciones 


factor 
La variable explicativa en un € 


factorial 
Para cualquier numero entero | 
n!=n-(n-1)-(n-2):...°3-2°-L..... 


no rechazar 

Hoo si no es improbable qu 
verdad la hipotesis nula, no se 
no contamos con evidencia co! 


primer cuartil 
QqiQi Si las observaciones de 


ascendiente (del mas pequefio 
media de las observaciones cu 


five-number summary 
Smallest observation, first quartile, median, third quartile, and 


largest observation, written in order from smallest to largest. In 
symbols: 
nimumQ1MedianQ3Maximum Minimum Q; Median Qs Maximum 


(p. 68) 


frequency table 
Table that displays the count (frequency) of observations in each 
category or interval. (p. 9) 


general addition rule 

If A and B are two events resulting from a chance process, then the 
probability that event A or event B (or both) occur is P(A or 
B)=P(AUB)=P(A)+P(B)-P(AnB). 

P(A orB) = P(AUB) = P(A) + P(B) — P(ANMB). (p. 


320) 


general multiplication rule 
The probability that events A and B both occur can be found using 


the formula P(ANB)=P(A)-P(BIA)P (AB) = P(A) - P (BIA) 


where P(BIAY? (B\A) is the conditional probability that event B 
occurs given that event A has already occurred. (p. 338) 


geometric distribution 

In a geometric setting, suppose we let Y=the number of trials 

Y = the number of trials jt takes to get a success. The probability 
distribution of YY is a geometric distribution with parameter p,?» 
the probability of a success on any trial. The possible values of Y 


are 1,2,3,....4) 2) 3)++++ (p. 423) 


geometric probability formula 

If YY has the geometric distribution with probability pP? of success 
on each trial, the possible values of YY are 1,2,3... A, 2,3,.65+ Tf 
kk is any one of these values, P(Y=k)=(1—p)k-1p. 


P(Y =k) =(1—p)*"'p. (p. 423) 


geometric random variable 


media. (pag. 64) 


resumen de cinco cifras 

Consta de la observacion mas 
cuartil y la observacién mas gi 
desde la mas pequenia hasta la 
simbolica, el resumen de cincc 


MinimoQ1MediaQ3Maximo........- 


(pag. 68) 


tabla de frecuencias 
Muestra la cuenta (frecuencia) 
intervalo. (pag. 9) 


regla general de adicion 

Si A y B son dos eventos cual 
probabilidad, la probabilidad c 
suceda es P(A 0 B)=P(AUB)= 


P(AoB) = P(AUB) = P(A) + P(B) — P(ANB). 


regla general de multiplicaci 
La probabilidad de que suceda 
utilizando la formula P(A NB): 


en la que P(BIAY? (B\A) es 
evento B a la luz de que el eve 


distribucién geométrica 

En un entorno geométrico, suf 
cantidad de ensayos que se pre 
de la probabilidad de YY es u 
p,?» la probabilidad de lograr 


posibles de YY son 1,2,3,.. J 


formula de probabilidad geo 
Si Y tiene una distribuci6n gec 


cada ensayo, los posibles valoi 
uno cualquiera de estos valore 


P(Y =k)=(1—p)" "pg 


variable aleatoria geométric: 


The number of trials YY that it takes to get a success in a La cantidad de ensayos YY qu 
geometric setting. (p. 423) entorno geométrico. (pag. 423 


entorno geométrico 
Surge un entorno geométrico ¢ 
mismo proceso de probabilida 
precisan para lograr un acierto 
lograr un acierto tiene que ser 


geometric setting 
Arises when we perform independent trials of the same chance 
process and record the number of trials it takes to get one success. 
On each trial, the probability p? of success must be the same. (p. 
422) 


histograma 
Muestra la distribucion de una 
intervalo de valores en forma ( 
frecuencias o las frecuencias r 
40) 


histogram 
Graph that displays the distribution of a quantitative variable by 
showing each interval of values as a bar. The heights of the bars 
show the frequencies or relative frequencies of values in each 

interval. (p. 40) 


eventos independientes 
Dos eventos son independient 
probabilidad de que el otro suc 
independientes si 

A |B)=P(A|BC)=P(A)?a =? a9 = 2) 


independent events 
Two events are independent if the occurrence of one event does not 
change the probability that the other event will happen. In other 
words, events A and B are independent if 


1B)=P(AIBC)=P(a)? (AIB) = P (A|BS) = P(A) P( 


and y 


P( B |A)=P(B|AC)=P(B)’e=7@4°) = PB) 
P(B\A) = P (Blac) = P(B) 
| (pag. 335) 
(p. 335) 
independent random variables variables aleatorias indepent 
If knowing the value of XX does not help us predict the value of <i eanacerlos valores de xe 


y,¥; then XX and YY are independent random variables. (p. 390) entonces XX y YY son varial 


individual individuo 
An object described by a set of data. Individuals may be people, Un objeto descrito por un conj 
animals, or things. (p. 2) personas, animales 0 cosas. (p: 


inference inferencia 
Drawing conclusions that go beyond the data at hand. (pp. 6, 270) Llegar a conclusiones que van 
(pags. 6, 270) 


inference about cause and effect 

Conclusion from the results of an experiment that the treatments 
caused the difference in responses. Requires a well-designed 
experiment in which the treatments are randomly assigned to the 
experimental units. (p. 280) 


inference about a population 

Conclusion about the larger population based on sample data. 
Requires that the individuals taking part in a study be randomly 
selected from the population of interest. (p. 280) 


influential observation 

An observation is influential for a statistical calculation if removing 
it would markedly change the result of the calculation. Points that 
are outliers in the x® direction of a scatterplot are often influential. 
(p. 200) 


informed consent 

Basic principle of data ethics that states that individuals must be 
informed in advance about the nature of a study and any risk of 
harm it may bring. Participating individuals must then consent in 
writing. (p. 279) 


institutional review board 
Board charged with protecting the safety and well-being of the 


participants in advance of a planned study and with monitoring the 
study itself. (p. 279) 


interquartile range 


IQR=Q3-Q1. JOR = Q3 — Qi- (p. 64) 


intersection 

The event “A and B” is called the intersection of events A and B. It 
consists of all outcomes that are common to both events, as is 
denoted by ANB.AM B. (p. 323) 


joint relative frequency 

Gives the percent or proportion of individuals that have a specific 
value for one categorical variable and a specific value for another 
categorical variable. (p. 15) 


inferencia sobre causa y efec 
Uso de los resultados de un ex 
son los tratamientos los que m 
un experimento bien disefiado 
manera aleatoria a las unidade 


inferencia sobre una poblaci 
Conclusion sobre una poblacic¢ 
Se precisa que los participante 
aleatoria a partir de la poblacic 


observacion influyente 

La observacion es influyente e 
notaria un cambio sustancial e 
son valores atipicos en el senti 
son influyentes. (pag. 200) 


autorizacion informada 

Principio basico de ética en la 
ha de informar con antelacién 
riesgos 0 perjuicios que podria 
luego tendran que dar su autor 


junta de revision institucion< 
Principio basico de la ética de 
planificados tienen que contar 
contar con un monitoreo por u 
funcion consiste en salvaguarc 
participantes. (pag. 279) 


gama entre cuartiles 


IQR=Q3-Q1.JQR = Q3 — Qi. 


interseccion 

Al evento “A y B” se le conoc 
Consiste en todos los resultadc 
como lo expresa ANB.A MB. 


frecuencia relativa conjunta 
Ofrece el porcentaje 0 proporc 
especifico para una variable cé 
variable categ6rica. (pag. 15) 


Large Counts condition 
Suppose XX is the number of successes and p\p is the proportion 
of successes in a random sample of size n” from a population with 
proportion of successes p.?: The Large Counts condition says that 
the distribution of XX and the distribution of pp will be 
approximately Normal if np>10"? = 10 and n(1-p)=10. 
n(1—p) > 10. (p. 420) 


Large Counts condition for a chi-square test 
It is safe to use a chi-square distribution to perform calculations if 
all expected counts are at least 5. (p. 717) 


law of large numbers 

If we observe more and more repetitions of any chance process, the 
proportion of times that a specific outcome occurs approaches a 
single value, which we call the probability of that outcome. (p. 301) 


least-squares regression line 
The line that makes the sum of the squared residuals as small as 
possible. (p. 183) 


level 
Specific value of an explanatory variable (factor) in an experiment. 
(p. 246) 


margin of error 
The difference between the point estimate and the true parameter 

value will be less than the margin of error in CC'% of all samples, 
where CC is the confidence level. (p. 271) 


marginal relative frequency 


Gives the percent or proportion of individuals that have a specific 
value for one categorical variable. (p. 14) 


matched pairs design 

Common form of blocking for comparing just two treatments. In 
some matched pairs designs, each subject receives both treatments 
in arandom order. In others, the subjects are matched in pairs as 


condicion de cuentas grande: 


Supongamos que XX represe1 
proporcion de aciertos en una_ 
una poblaci6on con una propor 


grandes establece que la distril 
aproximadamente normal si n| 


condicion de cuentas grande: 
Se puede utilizar sin problema 
computos si todas las cuentas | 


ley de las cifras grandes 

Si se observan mds y mas repe 
probabilidad, la proporci6n de 
aproxima a un valor sencillo, < 
dicho resultado. (pag. 301) 


linea de regresion de minimc 
La linea que reduce al minimo 


(pag. 183) 


nivel 
Valor especifico de una variab 


(pag. 246) 


margen de error 

La diferencia entre el estimadc 
menor que el margen de error 
es el nivel de confianza. (pag. 


frecuencia relativa marginal 
Ofrece el porcentaje 0 proporc 
especifico para una variable ce 


diseno de pares coincidentes 
Forma comun de crear bloque: 
tratamientos. En algunos disefi 
somete a ambos tratamientos e€ 


closely as possible, and each subject in a pair is randomly assigned ponen en pares que coincidan - 
to receive one of the treatments. (p. 260) asigna de manera aleatoria a fi 
260) 


mean media 

The average of all the individual data values in a distribution of El] promedio de todos los valo1 
quantitative data. To find the mean, add all the values and divide by de datos cuantitativos. Para ob 
the total number of observations. If the data come from a sample, se divide el resultado entre el 1 
use x © to denote the sample mean. If the data come from a forman parte de una muestra, 1 
census, use p to denote the population mean. (p. 54) Si los datos fueron obtenidos < 
media de la poblacion. (pag. 5. 


mean of a density curve media de una curva de densi 
Point at which a density curve would balance if made of solid E] punto en el cual la curva se 
material. (p. 112) material macizo. (pag. 112) 


mean (expected value) of a discrete random variable media (valor previsto) de un. 
Describes the variable’s long-run average value over many, many Describe el valor promedio de 
repetitions of the same chance process. To find the mean (expected muchas repeticiones del mism: 
value) of xx > multiply each possible value by its probability, then valor previsto) de x,x > se mu 
add all the products: probabilidad y luego se suman 
.=E(X)=x1 p1+x2p2+x3 p3+...=). xi pi LAX=E(X)=x1 pl+x2 p2+x3 p3+...= 
px = E(X)=2, p, +22 po +73 p3+...= 
(p. 366) 
(pag. 366) 
median media 
The midpoint of a distribution; the number such that about half the E] punto intermedio de una dis 
observations are smaller and about half are larger. To find the aproximadamente la mitad de 
median of a distribution: (1) Arrange all observations in order of mitad son mas grandes. Para h 
size, from smallest to largest. (2) If the number of observations n/t organizan todas las observacio 
is odd, the median is the middle observation in the ordered list. (3) pequefias a las mas grandes. (2 
If the number of observations n’t is even, the median is the average impar, la media es el la observ 
of the two middle observations in the ordered list. (p. 57) Si la cantidad de observacione 


dos observaciones en el medio 


median of a density curve media de una curva de densi 
The point with half the area under the curve to its left and the El punto en el que la mitad del 
remaining half of the area to its right. (p. 112) izquierda y la otra mitad del a1 


mode modo 
Value in a distribution having the greatest frequency. En una distribucion, el valor gq 


multiple comparisons comparaciones multiples 
Problem of how to do many comparisons at once with an overall E] problema de cémo hacer mi 
measure of confidence in all our conclusions. (p. 728) de confianza general en todas - 


multiplication rule for independent events 

If A and B are independent events, then the probability that A and 
B both occur is P(ANB)=P(A):P(B)./ (AB) = P(A): P(B). 
(p. 344) 


mutually exclusive (disjoint) events 
Two events A and B that have no outcomes in common and so can 


(A and B) = 0. (p. 


never occur together. That is, P(A and B)=0.P 
317) 


negative association 
When above-average values of one variable tend to accompany 
below-average values of the other. (p. 157) 


nonresponse 
Occurs when an individual chosen for the sample can’t be contacted 
or refuses to participate. (p. 233) 


Normal approximation to a binomial distribution 

Suppose that a count XX of successes has the binomial distribution 
with n’ trials and success probability p.P’ When n’ is large, the 
distribution of XX is approximately Normal with mean np”? and 


standard deviation np(1—p). V "? (1 — p). We use this Normal 
approximation when np>10"P = 10 and n(1—p)>10. 


Normal curve 
Important kind of density curve that is symmetric, single-peaked, 
and bell-shaped. (p. 114) 


Normal distribution 

Distribution described by a Normal density curve. Any particular 
Normal distribution is completely specified by two numbers, its 
mean pu and standard deviation 0.7. The mean of a Normal 
distribution is at the center of the symmetric Normal curve. The 
standard deviation is the distance from the center to the change-of- 
curvature points on either side. (p. 114) 


Normal/Large Sample condition 
A condition for performing inference about a mean, which requires 


regla de multiplicaci6n de ev 
Si A y B son eventos indepenc 
ambos, tanto A como B es P(/ 
P(ANMB) = P(A)-P(B). 


exclusivos mutuamente (dese 
Dos eventos A y B que no tien 


pueden suceder a la vez. Es de 


asociacion negativa 
Cuando los valores por encimé 
acompanar a los valores por de 


no respondi6 
Sucede cuando a un individuo 
contactar 0 el sujeto se niega a 


aproximacion Normal hacia 
Supongamos que una cuenta > 
con n?t ensayos y una probabi 
distribucién de XX es aproxir 


desviacion estandar np(1—p). \ 
Normal cuando np>10"P = 1 


curva Normal 
Tipo importante de curva de d 
con la forma de curva de camy 


distribucién Normal 

Segun la describe una curva de 
Normal dada se especifica con 
desviacion estandar o.7- Lam 
centro de la curva Normal sim 
del centro a los puntos a ambo 


condicion de muestra 
Normal/Grande Condicion par 


that the data come from a Normally distributed population or that 


the sample size is large (n>30),(" > 30). When the sample size is 
small and the shape of the population distribution is unknown, a 
graph of the sample data shows no strong skewness or outliers. 
When performing inference about a difference between two means, 
check that this condition is met for both samples. (p. 532) 


Normal probability 

plot Plot used to assess whether a data set follows a Normal 
distribution. If the points on a Normal probability plot lie close to a 
straight line, the plot indicates that the data are approximately 
Normally distributed. (p. 133) 


null hypothesis Hoo 
Claim we weigh evidence against in a significance test. Often the 
null hypothesis is a statement of “no difference.” (p. 554) 


observational study 
Study that observes individuals and measures variables of interest 
but does not attempt to influence the responses. (p. 242) 


observed counts 
Actual numbers of individuals in the sample that fall in each cell of 
the one-way or two-way table. (p. 711) 


one-sample té interval for a mean difference 
Confidence interval used to estimate a population mean. (p. 679) 
For more details, see the inference summary on page T-16. 


one-sample tf test for a mean 

A test of the null hypothesis that a population mean is equal to a 
specified value. (p. 529) For more details, see the inference 
summary on page T-16. 


one-sample z2 interval for a proportion 
Confidence interval used to estimate a population proportion. (p. 
511) For more details, see the inference summary on page T-16. 


one-sample z2 test for a proportion 
A test of the null hypothesis that a population proportion is equal to 


requiere que los datos proceda 


o que el tamafio de la muestra 
es pequefia y se desconoce la f 
grafico de la muestra no tendré 
inferencia sobre la diferencia ¢ 
cumplan con esta condicion. (| 


grafico de probabilidad Nor 
Se usa para evaluar si un conjt 
normal. Si los puntos en un gri 
una linea recta, el grafico indic 
distribuido Normalmente. (pag 


hipotesis nula Hoo 
Contrapeso de la evidencia en 
hipotesis nula es una declaraci 


estudio de observacion 
Se observan los individuos y s 
trata de influir en las respuesta 


cuentas observadas 
Las cifras reales que correspor 
cada celda de la tabla de una v 


intervalo té de una sola mue 
Intervalo de confianza que se | 
(pag. 679) Para mas informaci 
T-16. 


intervalo té de una sola mue 
Prueba de la hipotesis nula en 
valor especificado. (pag. 529) 
inferencia en la pagina T-16. 


intervalo z2 de una sola mue 
Intervalo de confianza que se | 
poblacion. (pag. 511) Para ma: 
en la pagina T-16. 


prueba z2 de una sola muest 
Prueba de la hipotesis nula en 


a specified value. (p. 573) For more details, see the inference 
summary on page T-16. 


one-sided alternative hypothesis 

An alternative hypothesis that states that a parameter is larger than 
the null hypothesis value or that states that the parameter is smaller 
than the null value. (p. 555) 


one-way table 
Table used to display the distribution of a single categorical 
variable. (p. 709) 


outlier 

Individual value that falls outside the overall pattern of a 
distribution. Call an observation an outlier if it falls more than 1.5 
IQR above the third quartile or more than 1.5 IQR below the first 
quartile. (pp. 34, 99) 


outlier in regression 

Observation that lies outside the overall pattern of the other 
observations. Points that are outliers in the y¥ direction but not the 
x direction of a scatterplot have large residuals. Other outliers 
may not have large residuals. (p. 157) 


PP-value 
The probability of getting evidence for the alternative hypothesis 


Hatta as strong or stronger than the observed evidence when the 
null hypothesis HO/#0 is true. The smaller the PP-value, the 


stronger the evidence against Hoo and in favor of Ha!ta 
provided by the data. (p. 558) 


paired data 

The result of recording two values of the same quantitative variable 
for each individual or for each pair of similar individuals. To 
analyze paired data, start by computing the difference for each pair. 
Then make a graph of the differences. Use the mean difference and 
the standard deviation of the differences as summary statistics. (p. 
673) 


paired té interval for a mean difference 

Confidence interval used to estimate a population (true) mean 
difference. (p. 677) For more details, see the inference summary on 
page T-16. 


igual a un valor especificado. | 
resumen de inferencia en la pa 


hipotesis alternativa unilater 
Hipotesis alternativa que indic 
valor de la hip6dtesis nula o qui 
el valor nulo. (pag. 555) 


tabla de una via 
Se usa para mostrar la distribu 
709) 


valor atipico 

Un valor individual que cae pc 
Una observacion se considera 
intercuartil (IQR) 1.5 veces mz 
intercuartil 1.5 veces menor qu 


valor atipico en regresion 

Observaci6on que yace por fue! 
observaciones. Los puntos que 
en el sentido x® de un grafico 
posible que otros valores atipic 


valor PP 


La probabilidad de obtener evi 
tan fuerte o mas fuerte que la ¢ 


Ho es verdadera. Cuanto meni 


evidencia contra Hoo yen fi 
(pag. 558) 


datos apareados 

El resultado de registrar dos vi 
cada individuo o por cada par 
apareados, calcula primero la « 
grafico de las diferencias. Usa 
de las diferencias como estadi: 


intervalo t€ apareado para li 
Intervalo de confianza que se | 
una poblacion. (pag. 677) Pare 
inferencia en la pagina T-16. 


paired t? test for a mean difference 

A test of the null hypothesis that a population (true) mean 
difference is equal to a specified value, usually 0. (p. 680) For more 
details, see the inference summary on page T-16. 


parameter 
A number that describes some characteristic of the population. (p. 
442) 


percentile 
The pth percentile of a distribution is the value with p? percent of 
the observations less than it. (p. 91) 


pie chart 
Chart that shows the distribution of a categorical variable as a “pie” 
whose slices are sized by the counts or percents for the categories. 


A pie chart must include all the categories that make up a whole. (p. 


10) 


placebo 
A treatment that has no active ingredient but is otherwise like other 
treatments. (p. 244) 


placebo effect 
Describes the fact that some subjects respond favorably to any 
treatment, even an inactive one (placebo). (p. 249) 


point estimate 
Specific value of a point estimator from a sample. (p. 494) 


point estimator 
Statistic that provides an estimate of a population parameter. (p. 
494) 


pooled or combined sample proportion 
The overall proportion of successes in the two samples is 


=count of successes in both samples combinedcount of individuals in p 


h samples combined=X1+X2n1+n2 


(p. 632) 


prueba tf apareada para la « 
Prueba de la hipotesis nula en 
poblacion es igual a un valor e 
mas informacion, ver el resum 


parametro 
Numero que describe algunas 
442) 


percentil 
El percentil pavo de una distril 
observaciones es menor que la 


grafico circular 

Muestra la distribuci6n de una 
circulo subdividido segun las « 
grafico circular tiene que inclu 
totalidad. (pag. 10) 


placebo 
Tratamiento que, mas alla de r 
los tratamientos. (pag. 244) 


efecto placebo 
Describe el hecho de que algu 
cualquier tratamiento, incluso 


estimado de punto 
El valor especifico de un estir 
494) 


estimador de punto 
Estadistica que nos da un estin 
494) 


proporcion combinada de la 
La proporcion total de aciertos 


cuenta de aciertos enambas mut 
eniambas muestras combinadas=X1 


cuenta de aciertos en 


ambas muestras combinadas X 


cuenta de individuos en n 


ambas muestras combinadas 


population 
In a Statistical study, the entire group of individuals we want 
information about. (p. 221) 


population (true) regression line 


Regression line py=B0+B1xMy ~ Po + Aix based on the entire 
population of data. (p. 770) 


positive association 

When above-average values of one variable tend to accompany 
above-average values of the other and below-average values also 
tend to occur together. (p. 157) 


power 
The probability that a test will reject Hoo at a chosen significance 
level a@ when a specified alternative value of the parameter is true. 
The power of a test against any alternative is 1 minus the 
probability of a Type II error for that alternative; that is, 


power=1-B.Power = 1 - B. (p. 598) 


power model 


Relationship of the form y=axp.¥ = @2”- When experience or 
theory suggests that the relationship between two variables is 
described by a power model, you can transform the data to achieve 
linearity in two ways: (1) raise the values of the explanatory 


variable x® to the pP power and plot the points (xp,y),(@"» Y)> or 
(2) take the pth root of the values of the response variable y¥ and 


plot the points (x,yp). (2, 9): If you don’t know what power to 
use, taking the logarithms of both variables should produce a linear 
pattern. (p. 796) 


predicted value 


yy (read “y¥ hat”) is the predicted value of the response variable 
yY for a given value of the explanatory variable x.2- (p. 181) 


probability 

A number between 0 and 1 that describes the proportion of times an 
outcome of a chance process would occur in a very long series of 
repetitions. (p. 301) 


probability distribution 


(pag. 632) 


poblaciéon 
En un estudio estadistico, la pc 
sobre el cual deseamos contar 


linea de regresion (real) de I 


La linea de regresi6n pry=B0+{ 
la poblacion de datos. (pag. 77 


asociacion positiva 

Cuando los valores por encim< 
acompanar a los valores por et 
por debajo del promedio tamb: 


poder 


La probabilidad de que una pri 
significancia dado cuando un \ 
es verdadero. El poder de una 
1 menos la probabilidad de un 
poder=1-p,Poder = 1 — 8. 


modelo de poder 

Relacion de la forma y=axp..... 
que la relacién entre dos varial 
pueden transformar los datos f£ 
(1) elevar los valores de la var 


los puntos (xp,y),(2”: Ys 0 (2 


de respuesta yY y trazar los pu 
potencia se ha de usar, se tome 
producir un patron lineal. (pag 


valor proyectado 


yy es el valor proyectado de 
dado de la viariable explicativ. 


probabilidad 
Cifra entre 0 y 1 que describe. 
proceso aleatorio sucederia en 


(pag. 301) 


distribuci6n de la probabilid 


Gives the possible values of a random variable and their 
probabilities. (p. 362) 


probability model 
Description of some chance process that consists of two parts: a 

sample space S that lists all possible outcomes and a probability for 
each outcome. (p. 314) 


quantitative variable 
Variable that takes number values that are quantities—counts or 
measurements. (p. 3) 


quartiles 
The quartiles of a distribution divide the ordered data set into four 
groups having roughly the same number of values. (p. 64) 


random assignment 
Experimental design principle. Use chance to assign experimental 


of experimental units by balancing the effects of other variables 
among the treatment groups. (p. 251) 


Random condition 

A condition for performing inference, which requires that the data 
come from a random sample from the population of interest or from 
a randomized experiment. When comparing two or more 
populations or treatments, check that the data come from 
independent random samples from the populations of interest or 
from groups in a randomized experiment. (p. 512) 


random sampling 
Using a chance process to determine which members of a 
population are chosen for the sample. (p. 225) 


random variable 
Variable that takes numerical values that describe the outcomes of 
some chance process. (p. 362) 


randomization distribution 


units to treatments. Doing so helps create roughly equivalent groups 


Presenta los valores posibles d 
probabilidades. (pag. 362) 


modelo de probabilidad 

Descripcion de un proceso de 
espacio de muestra S que enur 
probabilidad para cada resulta 


variable cuantitativa 
Variable que toma valores nun 
o medidas. (pag. 3) 


cuartiles 
Los cuartiles de una distribuci 
cuatro grupos que tienen apro> 


(pag. 64) 


asignacion aleatoria 
Principio de disefo experimen 
experimentales a los tratamien 
unidades experimentales mas ¢ 
de otras variables entre los gru 


condicion Aleatoria 

Condicion para hacer inferenc 
tomados de una muestra aleatc 
experimento aleatorio. Al com 
verifica que los datos hayan si: 
independientes de la poblaciér 
aleatorio. (pag. 512) 


muestreo aleatorio 
Uso de un proceso de probabil 
poblacion se han de elegido er 


variable aleatoria 
Toma valores numéricos que c 
azar y probabilidad. (pag. 362: 


distribucion de la aleatoriedi 


Distribution of a statistic (like p\1—pA2P1 ~ P2 or x\1-x/A2 

©, — 22 ) in repeated random assignments of experimental units to 
treatment groups assuming that the specific treatment received 
doesn’t affect individual responses. When the conditions are met, 
usual inference procedures based on the sampling distribution of 
the statistic will be approximately correct. (p. 636) 


randomized block design 

Experimental design begun by forming blocks consisting of 
individuals that are similar in some way that is important to the 
response. Random assignment of treatments is then carried out 
separately within each block. (p. 257) 


range 
A measure of variability equal to the distance between the 
minimum value and the maximum value of a distribution. That is, 


range=maximum-minimum.'@nge = Maximum — minimum. (p, 


60) 


regression line 

Line that describes how a response variable y¥ changes as an 
explanatory variable x2 changes. We often use a regression line to 
predict the value of y¥ for a given value of x.%- (p. 176) 


reject 
Ho/Zo If the observed result is too unlikely to occur just by chance 


when the null hypothesis is true, we can reject Hoo and say that 


there is convincing evidence for Hafa- (p. 558) 


relative frequency table 
Table that shows the percents (relative frequencies) of observations 
in each category or interval. (p. 9) 


replication 

Experimental design principle. Use enough experimental units in 
each group so that any differences in the effects of the treatments 
can be distinguished from chance differences between the groups. 
(p. 253) 


residual 
Difference between an actual value of the response variable and the 
value predicted by the regression line: 


La distribucion de una estadist 


©1 ~ £2) en designaciones ale 
experimentales a grupos de tra 
especifico no afecte las respue 
condiciones, los procedimient« 
distribucion del muestreo de le 
correctos. (pag. 636) 


diseno de bloques aleatorios 
Se comienza con la formacién 
son similares de alguna maner 
asignacion aleatoria de tratami 
de cada bloque. (pag. 257) 


gama 
Medida de variabilidad que eq 


el valor maximo de una distrib 
minimo.¢ valor mAximo — valor mAnimo. 


linea de regresion 

Una linea que describe cémo 
que cambia una variable explic 
regresiOn para predecir el valo 


rechazar 

Hoo si es demasiado impro 
simple azar cuando la hipotesi 
decir que existe evidencia com 


tabla de frecuencia relativa 
Permite apreciar los porcentaj« 
en cada categoria o intervalo. | 


replicacion 

Principio de disefo experimen 
experimentales en cada grupo 
efectos de los tratamientos se | 
entre los grupos. (pag. 253) 


residual 
La diferencia entre un valor re 
proyectado por la linea de regi 


sidual=actual y—predicted y=y—y/residual = actual y — predicted y = y — ¥ regidual=real y—proyectado y=y—y/ 


(p. 179) 


residual plot 

A scatterplot that plots the residuals on the vertical axis and the 
explanatory variable on the horizontal axis. Residual plots help us 
assess whether a regression model is appropriate. (p. 185) 


resistant measure 
Statistic that is not affected very much by extreme observations. (p. 
56) 


response bias 
Occurs when there is a consistent pattern of inaccurate responses to 
a survey question. (p. 234) 


response variable 
Variable that measures an outcome of a study. (p. 153) 


roundoff error 
Difference between the calculated approximation of a number and 
its exact mathematical value. (p. 16) 


sample 
Subset of individuals in the population from which we collect data. 
(p. 221) 


sample regression line (estimated regression line) 


Least-squares regression line yA=b0+b1xY = bo + bx computed 
from the sample data. (p. 770) 


sample space S 
Set of all possible outcomes of a chance process. (p. 314) 


sample survey 

Study that uses an organized plan to choose a sample that 
represents some specific population. We base conclusions about the 
population on data from the sample. (p. 222) 


sampling distribution 
The distribution of values taken by a statistic in all possible samples 


(pag. 179) 


grafico residual 

Grafico de dispersion que traz. 
variable explicativa sobre el ej 
ayudan a evaluar si el modelo 


medida resistente 
Estadistica que no se ve muy ¢ 
56) 


sesgo de la respuesta 
Sucede cuando hay un patron « 
pregunta de una encuesta. (pag 


variable de respuesta 
Variable que mide un resultad: 


error de redondeo 
La diferencia entre la aproxim 
exacto. (pag. 16) 


muestra 
Subconjunto de individuos en 
datos. (pag. 221) 


linea de regresion de la mues 


La linea de regresion de minin 
computada a partir de los dato: 


espacios S de la muestra 
El conjunto de todos los result 


(pag. 314) 


valoracién de la muestra 

Estudio que usa un plan organ 
represente una poblacién espe: 
poblacion en datos tomados de 


distribucion del muestreo 
La distribucion de valores tom 


of the same size from the same population. (p. 444) 


sampling distribution of a sample mean 

x ® The distribution of values taken by the sample mean x = in 
all possible samples of the same size from the same population. (p. 
468) 


sampling distribution of a sample proportion 


p»P The distribution of values taken by the sample proportion p\P 
in all possible samples of the same size from the same population. 
(p. 459) 


sampling distribution of a slope 


b191 The distribution of values taken by the sample slope b151 in 
all possible samples of the same size from the same population. (p. 
772) 


sampling distribution of 


p\1-p\2P1 ~ P2 The distribution of values taken by the statistic 
pA1—p\2P1 ~ P2 in all possible samples of size n1”1 from 


population 1 and all possible samples of size n2’2 from population 
2. (p. 623) 


sampling distribution of 

x 1-x 2©1 — ©2 The distribution of values taken by the statistic 
x 1-x 2%1 ~ ©2 in all possible samples of size n1™ from 
population 1 and all possible samples of size n2’2 from population 
2. (p. 647) 


sampling variability 
The fact that different random samples of the same size from the 
same population produce different estimates. (pp. 270, 443) 


scatterplot 

Plot that shows the relationship between two quantitative variables 
measured on the same individuals. The values of one variable 
appear on the horizontal axis, and the values of the other variable 
appear on the vertical axis. Each individual in the data appears as a 
point in the graph. (p. 154) 


segmented bar graph 

Graph used to compare the distribution of a categorical variable in 
each of several groups. For each group, there is a single bar with 
“segments” that correspond to the different values of the categorical 


posibles del mismo tamafio to1 


distribucion de muestreo de 
x ® Distribucion de valores t 
las posibles muestras del misn 


distribucion del muestreo de 


pp Distribucién de valores tc 
todas las posibles muestras del 
459) 


distribucion del muestreo de 


b191 Distribucion de valores 1 
todas las posibles muestras del 
772) 


distribucioén del muestreo 
p\1-p\2P1 ~ P2 Distribucid 
p\1-p\2P1 ~ P2 de todas las 
poblacion 1 y de todas las mue 
poblacion 2. (pag. 623) 


distribucion de muestreo de 
x 1-x 2©1 — ®2 pistribucic 
x 1-x 21 — £2 de todas las 
poblacion 1 y de todas las mue 
poblacion 2. (pag. 647) 


variabilidad del muestreo 
El hecho de que muestras alea 
misma poblacion producen est 


grafico de dispersion 

Permite apreciar la relaci6n en 
mismos individuos. Los valore 
y los valores de la otra variabl: 
los datos figura como un punt 


grafico de barras segmentad 
Se usa para comparar la distrit 
uno de varios grupos. Para cad 
“segmentos” que corresponde! 


variable. The height of each segment is determined by the percent 
of individuals in the group with that value. Each bar has a total 
height of 100%. (p. 19) 


side-by-side bar graph 

Graph used to compare the distribution of a categorical variable in 
each of several groups. For each value of the categorical variable, 
there is a bar corresponding to each group. The height of each bar is 
determined by the count or percent of individuals in the group with 
that value. (p. 19) 


significance level 

Fixed value a@ that we use as a cutoff for deciding whether an 
observed result is too unlikely to happen by chance alone when the 
null hypothesis is true. The significance level gives the probability 
of a Type I error. (p. 558) 


significance test 

Procedure for using observed data to decide between two 
competing claims (the null hypothesis and the alternative 
hypothesis). The claims are often statements about a parameter. (p. 
552) 


simple random sample (SRS) 

Sample chosen in such a way that every group of n?t individuals in 
the population has an equal chance to be selected as the sample. (p. 
226) 


simulation 
Imitation of chance behavior, based on a model that accurately 
reflects the situation. (p. 304) 


single-blind 

An experiment in which either the subjects or those who interact 
with them and measure the response variable, but not both, know 
which treatment a subject received. (p. 249) 


skewness 

A distribution is skewed to the right if the right side of the graph 
(containing the half of the observations with larger values) is much 
longer than the left side. It is skewed to the left if the left side of the 
graph is much longer than the right side. (p. 32) 


slope 


categorizada. La altura de cad 
individuos en el grupo que ten 
total del 100%. (pag. 19) 


grafico de barras contiguas 

Se usa para comparar la distrit 
uno de varios grupos. Para cad 
barra que corresponde a cada § 
cuenteo o el porcentaje de indi 


(pag. 19) 


nivel de significancia Valor f 
a@ que se usa como punto de 
es demasiado improbable para 
es verdad. El nivel de significe 
1. (pag. 558) 


prueba de significancia 

Procedimiento en el que se use 
opciones que compiten entre s 
Las opciones a menudo son en 


muestra aleatoria sencilla 
Muestra tomada de tal manera 
poblacion tenga la misma opo! 


(pag. 226) 


simulacion 
Imitacion de conducta de azar, 
con precision. (pag. 304) 


ciego sencillo 

Experimento en el que ya sea | 
con los sujetos y miden la vari 
fue el tratamiento que recibi6 | 


asimetria 

Distribuci6n que esta sesgada 
(que contiene la mitad de las o 
mucho mas larga que el lado i: 
el lado izquierdo del grafico e: 
32) 


pendiente 


Suppose that y¥ is a response variable (plotted on the vertical axis) Supongamos que y¥ es una va 


and x is an explanatory variable (plotted on the horizontal axis). A y x© es una variable explicatin 
regression line relating y¥ to x2 has an equation of the form regresion que relacione y¥ cor 
y=b0+b1x. ¥ = bo + 612. In this equation, b151 is the slope, the y=b0+b1x. y = bo + biz. Ep 
amount by which y¥ is predicted to change when x® increases by cantidad mediante la cual se p! 
one unit. (p. 181) aumento de una unidad. (pag. 


standard deviation desviacion estandar 

sx$« Statistic that measures the typical distance of the values in a sx8« Estadistica que mide la c 
distribution from the mean. It is calculated by finding an “average” distribucion a partir de la medi 
of the squared distances and then taking the square root. In distancias al cuadrado a las qu 
symbols, simbolos se representa 


1 5 —\45s; = jA_ a; — 2) 
8, =4/—— S- (a; —#)? sxk1n-1¥ (xi-x 2” 7 Va & -®) 
1n-1)(xi-x )2 Vn : 
(pag. 61) 
(p. 61) 
standard deviation of a discrete random variable desviaci6n estandar de una v 
2 
Square root of the variance of a discrete random variable 0X2.7 X° La raiz cuadrada de la variaci¢ 
The standard deviation measures the typical distance of the values La desviacion estandar mide lé 
in a distribution from their mean. In symbols, distribucion a partir de su med 
—- °° °&4«»x° ‘ ee — ny)? 
oma aj — px) pi OX=Y. (xi-pX)2pi7* = VL a)" 
=F (xi-px)2pi7> >| i—Hx)'P >» (xi-pX)2p 
(pag. 368) 
(p. 368) Ee 
standard deviation of the residuals (s)(S) desviacion estandar de las re 
If we use a least-squares line to predict the values of a response Si se hace uso de la linea de ct 
variable y¥ from an explanatory variable x,"> the standard de una variable de respuesta y: 
deviation of the residuals (s$) is given by desviacion estandar de las resi 
| +e residuals” 
s = (|| = a a 
», residuals2n—2=)> (yi-y/i)2n-2 \ n-2 s=) residuals2n—2=) (yi-y/i)2n- 
This value measures the size of a typical residual. That is, s$ Este valor mide el tamafio det 
measures the typical distance between the actual y¥ values and the distancia tipica entre los valor 
predicted y¥ values. (p. 189) (pag. 189) 
standard error error estandar 
When the standard deviation of a statistic is estimated from data, Cuando se computa la desviac 
the result is the standard error of the statistic. The standard error datos, el resultado es el error e 
estimates how far the value of the statistic typically varies from the estima cuanto varia tipicament 
value it is trying to estimate. (p. 513) busca estimar. (pag. 513) 
standard Normal distribution distribuci6n normal estanda: 


Normal distribution with mean 0 and standard deviation 1. (p. 120) La distribuci6n normal con me 


standard Normal table (Table A) 
Table of areas under the standard Normal curve. The table entry for 
each value z2 is the area under the curve to the left of z.2- (p. 120) 


standardized score (z2-score) 
If x@ is an observation from a distribution that has known mean 


and standard deviation, the standardized value of x# is 
value—mean 


. 7 z re RETR ER REO 
z=value—meanstandard deviation. standard deviation” A 


standardized value is often called a z2-score. (p. 95) 


Standardized test statistic 
Calculation that measures how far a sample statistic diverges from 


what we would expect if the null hypothesis Hof were true, in 
standardized units. That is, 


tabla normal estandar (Tabl 
Tabla de areas debajo de la cu: 
cada valor z2 es el area debajc 


puntuacion estandarizada 
Si x@ es una observacion a pai 
desviacion estandar conocidas 


z=valor—mediadesviacion esta 
estandarizado a menudo se le « 


estadistica de prueba estand 
Mide la divergencia entre la es 
la hipdtesis nula Hoo fuera: 
estandarizadas. Es decir, 


ndardized test statistic=statistic-parameterstandard deviation of 


Statistic parameter 


estadistica de prueba estandarizada: 


la estadistica*™***" "cain atete noe 


standardized test statistic 


tistic standard deviation of statistic 


statistic 
Number that describes some characteristic of a sample. (p. 442) 


statistically significant 

(1) When the observed results of a study are too unusual to be 
explained by chance alone, the results are called statistically 
significant. (p. 272) 


(2) If the PP-value is smaller than alpha, we say that the results of 
a Statistical study are significant at level a.. In that case, we reject 


the null hypothesis Ho40 and conclude that there is convincing 


evidence in favor of the alternative hypothesis HaHa: (p. 559) 


statistics 
The science and art of collecting, analyzing, and drawing 
conclusions from data. (p. 2) 


stemplot (also called stem-and-leaf plot) 

Simple graphical display for fairly small quantitative data sets that 
gives a quick picture of the shape of a distribution while including 
the actual numerical values in the graph. Each observation is 
separated into a stem, consisting of all but the final digit, and a leaf, 
the final digit. The stems are arranged in a vertical column with the 
smallest at the top. Each leaf is written in the row to the right of its 
stem, with the leaves arranged in increasing order out from the 


(pag. 570) 


dato estadistico 
Numero que describe alguna c 


estadisticamente significativt 
(1) Cuando los resultados obse 
para poder decir que son conse 
llama estadisticamente signific 


(2) Si el valor PP es menor qu 
estudio estadistico son signific 


hipotesis nula Hoo y se com 
de la hipotesis Hatta alternati 


estadistica 
La ciencia y arte de coleccion< 
un conjunto de datos. (pag. 2) 


grafico de tallos al que tambi 
Representacion grafica sencill. 
relativamente pequefios que dé 
distribucion, al tiempo que inc 
grafico. Cada observacion se s 
el ultimo digito, y una hoja, qu 
en una columna vertical en la 
hoja se escribe en el rengl6n a 


stem. (p. 37) 


stratified random sample 

Sample obtained by classifying the population into groups of 
similar individuals, called strata, then choosing a separate SRS in 
each stratum and combining these SRSs to form the sample. (p. 
229) 


subjects 
Experimental units that are human beings. (p. 245) 


symmetric 
A graph in which the right and left sides are approximately mirror 
images of each other. (p. 32) 


t€ distribution 
Draw an SRS of size n?’t from a large population that has a Normal 
distribution with mean yp and standard deviation 0.7. The statistic 
a 
== 


K —ysxn yn 


has the té distribution with degrees of freedomdf=n-1.4£ =n-l. 
This statistic will have approximately a tn— 1!n—1 distribution if the 
sample size is large enough. té distributions are symmetric, single- 
peaked, bell-shaped density curves. (p. 588) 


tt interval for the slope B? 

Confidence interval used to estimate the slope of a population (true) 
regression line. (p. 778) For more details, see the inference 
summary on page T-16. 


tt test for the slope 

A test of the null hypothesis that there is no linear association 
between two quantitative variables. (p. 783) For more details, see 
the inference summary on page T-16. 


third quartile Q3@s 
In a data set in which the observations are ordered from smallest to 
largest, the median of the observations whose position is to the right 


en orden ascendiente comenza 


muestra aleatoria estratifica 
Muestra que se obtiene clasific 
parecidos, llamados estratos. I 
sencilla separada en cada estra 
sencillas para conformar la mv 


sujetos 
Unidades experimentales que : 


simétrico 
Si los lados derecho e izquierd 
del otro, se dice que son simét 


distribuci6n 
t€ Se grafica una muestra alea 
poblacion grande que tiene un. 
desviacion estandar o.7- La es 
T— Hp 
= 


= sxn vn 


tiene la distribucion té con grc 
estadistica tendra una distribuc 
muestra es suficientemente gre 
densidad de un solo pico, son: 
588) 


intervalo té para la pendient 
Intervalo de confianza que se | 
regresion de una poblacion (re 
resumen de inferencia en la pa 


prueba té para la pendiente 
Prueba de la hipotesis nula en 
variables cuantitativas. (pag. 7 
inferencia en la pagina T-16. 


tercer cuartil Q3Qs 
Si las observaciones en un con 


pequefio a la mas grande, el te 


of the median. (p. 64) 


transforming 
Applying a function such as the logarithm or square root to a 
quantitative variable. (p. 795) 


treatment 

Specific condition applied to the individuals in an experiment. If an 
experiment has several explanatory variables, a treatment is a 
combination of specific values of these variables. (p. 245) 


tree diagram 

A diagram that shows the sample space of a chance process 
involving multiple stages. The probability of each outcome is 
shown on the corresponding branch of the tree. All probabilities 
after the first stage are conditional probabilities. (p. 339) 


two-sample tt interval for a difference between two means 
Confidence interval used to estimate a difference in the means of 
two populations/treatments. (p. 651) For more details, see the 
inference summary on page T-16. 


two-sample té test for the difference between two means 

A test of the null hypothesis that the means of two 
populations/treatments are equal. (p. 658) For more details, see the 
inference summary on page T-16. 


two-sample z2 interval for a difference between two 
proportions 

Confidence interval used to estimate a difference in the proportions 
of successes in two populations/treatments. (p. 626) For more 
details, see the inference summary on page T-16. 


two-sample zz test for the difference between two proportions 
A test of the null hypothesis that the proportions of successes in two 
populations/treatments are equal. (p. 634) For more details, see the 
inference summary on page T-16. 


two-sided alternative hypothesis 

The alternative hypothesis is two-sided if it states that the parameter 
is different from the null value (it could be either smaller or larger). 
(p. 555) 


observaciones cuya posicion e 


transformacion 
La aplicacion de una funci6n t 
variable cuantitativa se denom 


tratamiento 

Una condicion especifica que : 
experimento. Si un experimen! 
tratamiento es una combinacid 
variables. (pag. 245) 


diagrama de arbol 

Diagrama que presenta el espa 
multiples etapas. La probabilic 
correspondiente del arbol. Tod 
primera etapa son probabilidac 


intervalo tt de dos muestras 
medias 

Intervalo de confianza que se | 
medios de dos poblaciones/tra 
ver el resumen de inferencia e! 


prueba té de dos muestras p. 
Prueba de la hipotesis nula en 
poblaciones/tratamientos son i 
el resumen de inferencia en la 


intervalo z2 de dos muestras 
proporciones 

Intervalo de confianza que se | 
proporciones de éxitos en dos 
mas informacion, ver el resum 


prueba z2 de dos muestras p 
proporciones 

Prueba de la hipotesis nula en 
poblaciones/tratamientos es ig 
resumen de inferencia en la pa 


hipotesis alternativa bilatera 
La hipotesis alternativa es bila 
del valor nulo (podria ser mas 


two-way table tabla de doble via 
Table of counts that organizes data about two categorical variables. Una tabla de doble via de cuer 
(p. 14) categorizadas. (pag. 14) 


Type I error error Tipo I 
An error that occurs if a test rejects Ho/f0 when HO/o is true. Error que sucede cuando una f 


That is, the test finds convincing evidence that Ha!fa is true when verdadera. Es decir, la prueba 
it really isn’t. (p. 560) es verdadera cuando en realide 


Type II error error Tipo II 
An error that occurs if a test fails to reject Ho/Z0 when Ha/Za is 


true. That is, the test does not find convincing evidence that Hata 
is true when it really is. (p. 560) 


Error que sucede cuando una f 


Aa es verdadera. Es decir, la y 


que Ha/Za es verdadera cuand 


estimador sin sesgo 

La estadistica que se usa para | 
sesgo si la media de distribuci: 
del parametro que se esta com 


unbiased estimator 
A statistic used for estimating a parameter is unbiased if the mean 
of its sampling distribution is equal to the true value of the 

parameter being estimated. (p. 448) 


undercoverage subcobertura 
Occurs when some members of the population are less likely to be Sucede cuando algunos miemlt 


chosen or cannot be chosen in a sample. (p. 233) probablidad de ser elegidos 0 1 
(pag. 233) 

uniform distribution distribucion uniforme 

A distribution where the relative frequency of each possible value is Distribucion en la cual la frect 

the same. (p. 33) misma. (pag. 33) 

unimodal unimodal 

A graph of quantitative data with a single peak. (p. 33) Grafico de datos cuantitativos 

union union 


The union of events A and B, denoted by AUB,A UB, consists of 
all outcomes in A or B or both. (p. 323) 


La union de los eventos A y B 
todos los resultados en A o B, 


variability of a statistic variabilidad de una estadisti 
Describes the variation in a statistic’s sampling distribution. Describe la variabilidad del mi 


Statistics from larger samples have less variability. (p. 451) 


variable 
Any characteristic of an individual. A variable can take different 
values for different individuals. (p. 3) 


variance of a discrete random variable oX27x 
Weighted average of the squared deviations of the values of the 
variable from their mean. In symbols, 


=)0 (ei — ux)*p: 


(=F (xi- ux2pi?X 


(p. 368) 


: 2 
variance sx2%z 
“Average” squared deviation of the observations in a data set from 
their mean. In symbols, 


1 
D=1n-1¥ (xi-x )2 n—-1 


(p. 62) 


Venn diagrams 

A diagram that consists of one or more circles surrounded by a 

rectangle. Each circle represents an event. The region inside the 
rectangle represents the sample space of the chance process. (p. 
322) 


voluntary response sample 
A sample that consists of people who choose to be in the sample by 
responding to a general invitation. Voluntary response samples are 
sometimes called self-selected samples. (p. 224) 


wording of questions 
An important influence on the answers given in a survey. 
Confusing or leading questions can introduce strong bias, and 
changes in wording can greatly change a survey’s outcome. Even 
the order in which questions are asked matters. (p. 234) 


se describe por la amplitud de 
estadisticas de muestras mas ¢g 


variable 
Toda caracteristica de un indi, 
valores para diferentes individ 


variacion de una variable ale 
Promedio sopesado de las des 
variable a partir de su media. I 


oX2=¥(xi-pX)2pi7e = 22 — ax)'m 


(pag. 368) 


be eee 2 
variacion sx2%= 
La desviacion cuadratica “pro! 
de datos seguin su separacion c 


En simbolos se expresa sx2=1n-1 


2 1 J 
En sAmbolos se expresa_ 5? =A y. (aj - z)" 


(pag. 62) 


diagramas 

Venn Diagrama que consiste e 
rectangulo. Cada circulo repre 
representa el espacio muestral 


muestra de respuesta volunt: 
Muestra compuesta por persor 
eligen formar parte de la mues 
conocen a veces como muestré 


terminologia de las pregunta 
Influyen en gran medida sobre 
Toda pregunta confusa, capcic 
introducirle al proceso un sesg 
pueden modificar de manera n 
los resultados se pueden ver af 
preguntas. (pag. 234) 


yY intercept 

Suppose that y¥ is a response variable (plotted on the vertical axis 
of a graph) and x® is an explanatory variable (plotted on the 
horizontal axis). A regression line relating y¥ to x® has an equation 


of the form yA=b0+b1x.¥ = bo + bi 2. Ip this equation, the number 


b020 is the yY intercept, the predicted value of y¥ when x=0. 
x = 0. (p. 181) 


interceptaci6n 

yY¥ Supongamos que y¥ es uni 
vertical) y x es una variable 
La linea de regresion que relac 
yA=b0+b1x.Y = 40 + biz. se 
interceptacién y,¥> el valor pre 
181) 


Index 


A 


Addition rule, 317 
general addition rule, 318-321 
usage, 470 
Additive, impact (matched pairs design), 260 
Alternative distribution, 600 
Alternative hypothesis, 554, 710, 727 
evidence, strength (measurement), 559-560 


favoring, 558 
Alzheimer’s disease, treatment, 597-598 
American Community Survey (ACS) 

household data, 2 

nonresponse rate, 233 
Apartment cost, conditions (checking), 649-650 
Apgar scores 

discrete random variables, 363 


mean, finding/interpretation, 367 


probability distribution, histogram, 364f 


variation, standard deviation (finding/interpretation), 369 
Apgar, Virginia, 363 
Approximate sampling distribution, 444 
Areas 

calculation, 796 


determination, 122-127 


normal distribution values, 125-126 

values, finding, 127-131 
Association, 19 

causation, implication (absence), 21—22 

conditional relative frequencies (Titanic disaster), 20-21 

negative linear association, 177 

strength/consistency, 277 

term, usage, 168 
Astronomical units (AUs), measurement, 807 
Attendance audit, SRS selection (Table D usage), 228 
Auto care, CLT (usage), 477-478 
Average income/fertility rate, relationship, 185f 


B 


Background music 

analysis, 727 

types, 730, 733 
Backpack, sampling distribution, 462-463 
Back-to-back stemplot, 39 

resting pulse rates, 39f 


Bar graphs, 10 
axes, drawing/labeling/scaling, 10 
communication method, 5f, 10f 
creation, 11 

process, 10 

histograms, contrast, 45 
interpretation, 11 
Titanic travel classes, 21f 


Baseball teams 


monetary expenditures, 155-156 
payroll, relationship, 166, 167f, 168f, 200-201 


wins, Mean number, 167 


Bathwater, transformations (effect), 387 


Batteries 
boxplot, 586f 


conclusions, making, 559 


conditions, checking, 586 
histogram, 586f 
lifetimes 

probability plot, 586f 


test power, interpretation, 598 


mean lifetime, 585 


point estimators, usage, 495—496 
power, 599-600 
P-value, 588f 
stemplot, 586f 
Batteries, standards, 440 
Bayes, Thomas, 340 
Bayes’s theorem, 340 
Beads 
conditions, checking, 513-514 


population proportion, 510 
Beta, 770 

significance test, 783-784 
Bias, 224 

avoidance, 271 


biased sampling methods, 224—225 
estimator usage, 453f 
response bias, 234 

Biased estimators, 448—450 


Binomial coefficient, 407-408 


calculation, 408 


Binomial distributions, 406 


computation, 418 

description, 412-418 
histogram, 420f 

normal approximation, 420-422 


sampling, relationship, 419-420 


Binomial mean/Vvariance formulas, origin, 417 
Binomial probabilities 
calculation, 406-412 
distribution, graphing, 413 
finding, process, 411 
formula, 408 
Binomial random variables, 361, 402, 403-406 
mean/standard deviation, 415-41 


probability distribution 
appearance, 412-413 
histogram, 413f 
probability histogram, 414f 
standard deviation, 416 
Binomial settings, 403-406 
conditions, 404 
BINS (Binary Independent Number Same), 404 
Biological populations, characteristics, 116 
Blinding, 249-250 
Block, 257 
Blocking, usage, 259 


Blood pressure 
means, difference (significance test), 659-660 
treatment groups, simulated random reassignments (dotplot), 662f 
Blood type inheritance, binomial probability (calculation), 409 
Blue M&M’S®, avoidance (probability rules), 317-318 
Boaty McBoatface, name (vote), 224—225 
Bones (health), P-value interpretation, 557 
Book of Odds, The, 302 
Books (reading) 


critical value/confidence interval, calculation, 516 


[op) 


H, one-sample t interval, 535-5 


Bottled water, tap water (contrast), 360 
binomial distribution, description, 414, 416 


Boundary value, 120 
standardization, 123, 125 


usage, 558 
Boxplots 
creation process, 68, 69—70, 73 
creation, visual illustration, 68f 
five-number summary, determination, 69 
interpretation, 68, 69—70 
median, marking, 69 
outliers, identification, 69 
pumpkin selection, 69—70 
usage, 71—73 
whiskers, drawing, 69 
Brand names 
importance, 627 
preference, conditions (checking), 625 
Breakfast 
breakfast cereal calories, dotplots/summary statistics, 132f 


consumption, standardized test statistic/P-value (calculation), 632—633 


C 


Caffeine 
comparison, completely randomized design (outline), 255f 


dependence (reality), mean difference (significance test), 680-681 
experiment, 247—249 

analysis, 273 
pulse rate 

change, 272 


relationship, 251 
Calcium 
means, difference (significance test), 659-660 
P-value, interpretation, 557 
treatment groups, simulated random reassignments (dotplot), 662f 
Cancer, causes, 277 
Candy color, distribution, 708—709 


Candy grab 
prediction, 178-179 
scatterplot, display, 154f 


slope/y intercept, interpretation, 182 


s/r2 interpretation, 191—192 


values, 152 


Candy machine, sampling distribution, 459-460 


CareerStart, effectiveness, 276 
Cars, value (loss), 176, 177f 
Categorical data 


analysis, 9 
display, 10-12 
Categorical variables, 3, 152 


data analysis, 13-16 
relationships, 17—22, 740—741 


Causation 


correlation, connection, 163 
establishment, challenges, 277-278 


implication 
absence, 21—22 


association, relationship, 201 


Cause and effect 
inference, 682 
questions, 277 
understanding, 244 
Cause, plausibility, 278 
Census, 221 
Census At School, 3-4 
Random Data Selector, 336 


biased/unbiased estimators, 448-449 
comparison, 461, 469 
measurement, 54-57, 366-367 
skewness/outliers, effect, 58 
Central limit theorem (CLT), 474-478, 646 
usage, 477—478 
Challenger disaster, multiplication rule, 345 
Chance, 298 
process, 335, 403 
Chance behavior, predictability, 301 


Chebyshev’s inequality, 119 

Children (hunger) 
hypotheses (stating)/conditions (checking), 631 
SRS, 631 

Chi-square distributions, 715 
density curves, 715f 


P-values, relationship, 714—717 
Chi-square test statistic, 711—714 
calculation, expected count division (reason), 713 
expected counts, relationship, 728—732 
values, dotplot, 714f 
Chi-square tests, 708, 709 
execution, 717—721 


expected counts (calculation), two-way table (data basis), 730 


performing, 735-736 

P-values, finding, 716 

selection, 747—748 

usage, 746—751 
Chocolate 

correlation, 165-166 


milk, concussions, relationship, 256 
Cholesterol 
levels, impact, 127 


P1 — P2 significance test, performing, 635-636 


Church attendance, life expectancy (relationship), 277 


Claim (evaluation), sampling distribution (usage), 445 


Cluster sampling, 230 


strata, difference, 231 
Clusters, 230 
display, absence, 70 


Coefficient of determination r2, 188-192 


interpretation, 191—192 

s value, relationship, 191 
Coffee, drinkers (identification), 302 
Coin toss 

fairness, 299-301 


proportion, 301f 


randomness, investigation, 302-303 
scenario, 335 
College 
attendance, p sampling distribution calculations, 464—465 
cost, constant (multiplication/division effect), 385 
Communication method, distribution 
bar graph, usage, 5f 
histogram, 45 
Comparison 
experiment design, 247—249, 253 


number, execution, 728 


Complement, 316, 323 
rule, 316 

Completely randomized designs, 255-256 
outline, 255f 


Computer regression output 


interpretation, 192-194 
t values, 779 
Conclusions, making, 558-560 


Concussions 
chocolate milk, relationship, 256 
stemplots, creation/interpretation, 38 
Conditional probability, 342, 406, 598 
calculation, 332-333 
defining, 330-334 
independence, relationship, 330, 334-337 


tree diagrams, relationship, 338—343 


usage, 332 

Conditional relative frequency, 17 
association, Titanic disaster, 20—21 
Titanic disaster, 17-18 

Confidence intervals, 495, 496—499, 552 
applet, 499-500, 502f, 503-504 
calculation, 516, 650 
construction, 511-518, 534-538 

conditions, 513, 625 

four-step process, 517-519 


ee) 


information, increase, 579 
interpretation, 498, 499, 517-51 


90% confidence interval, 653 


one-sided tests, connection, 597 
P1 — P2, 625-630 
two-sided tests, relationship, 595-597 
usage, 595-596 

Confidence level, 497—498 
adjustment, 503 


critical value, 538 


decrease, 504 


interpretation, 499-502 
Confidentiality, 279 
Confounding, 243 

remedy, 248 


Constant 
addition/subtraction, 382 
effect, 98-99, 382-384 
multiplication/division, 382 
effect, 100—101, 384-386 


Consumers Union, experiment, 682 


Continuous random variables, 361, 371—374 


probabilities, finding, 372 
probability distribution, 371, 372f 
Control, 252—253, 259 
groups, 248-249 
Convenience sampling, 223-224 
Coronary heart disease (CHD) 
development, anger category (bar graph), 741f 
status, 742 
Correlation, 153 
calculation, 166—168 
causation, connection, 163 
cautions, 163-166 


coefficient, 160 


estimation, 161 

explanatory variables/response variables, contrast, 168 
facts, 168-169 

impact, 164-165 

interpretation, 162 

knowledge, 197—201 


linear association, 160-162 


negative correlation, 168 


regression applet, usage, 163—164 


resistance, absence, 197—201 


usage, 161f, 164, 197-201 


Correlation r, 160 


calculation, 166 
Craft stick problem, 447-448 
Critical value, 511 
calculation, 515, 516, 529-531 
dependence, 504—505 
finding, 515f 
Crying/IQ (relationship), beta (significance test), 783-784 
Crying peaks, 784 


Cumulative frequency, 93 
Cumulative relative frequency graphs, 92—95 


Current Population Survey (CPS), U.S. Census Bureau, 232, 269 
income questions, 441 
interviews, 440 


Customer satisfaction, sample size (determination), 520 


D 


Data 
analysis, 4—5 
categorical variables, 13—16 
collection, 746 
display, usefulness, 46 
dotplot, five-number summary (inclusion), 68f 
ethics, 278-279 
graph, 132 
histogram, 132 
limitations, 575-576 


organization, 2—5 

plotting, 109 

set 
measurement units, change, 97—98 
range, 60 

skewness, 131 


summary Statistics, 132 


tests, performing, 728 
transformation, 97-103, 795-796 


unrepresentative data, production, 223-224 


values 
inaccuracy, 68 
plotting, 155 
Degrees of freedom (df), 529, 588f, 746 
Density curves, 90, 109-110 
description, 111-113 


dotplot modeling, 110 

mean, balance point, 112f 

median, 112 

modeling, 110f 

randomness, 111 

shapes, 111f 

symmetric density curve, midpoint balance, 112 


usage, 529f 
Dependent variables, 153—154 
Diameter at breast height (DBH), 652-653 


Diamonds 
pricing, 186 
SATs, connection, 154 
Die rolls, 33 
hypotheses/expected counts/chi-square statistic, 712—713 
return, P-value (finding), 716-717 
Discrete random variables, 361, 362—364 
analysis, 364—365, 370 
mean (expected value), 366-367 


median, finding, 367 
probability distribution, 362 
standard deviation, 368-371 


variance, 368-371 


Dissolved oxygen (DO) 
yt, significance test (performing), 592-593 


one-sample t test, Minitab output, 594f 


Distance, equation, 797 
Distracted driving, 274 
experiment, 274-275 
Distribution, 4 
appearance, interval selection (impact), 43 


comparison, 36—37 
boxplots, usage, 71—73 
description, 34—35, 102 
median, 92 
normal distributions, 90, 109, 113-116 
percentile, 92, 118 
shape, 100 
normal density curve display, 113f 
skewed to the left, 32 
skewed to the right, 32 


variability, 98 
Dotplots, 5, 30-32 
comparative dotplots (PVC pipe length), 59f 
creation, process, 31—32 
interpretation, 31—32 
Old Faithful eruptions, display, 33f, 41f 
score dotplot, 91f 
soccer goals, 31f 
Double-blind experiment, 249 
Drink size, x 1-x 21 — £2, sampling distribution (description), 648 
Drivers, distraction, 274 


Drop height, 776 


E 


Earnings, variation, 392-393 


ebooks/print books, readership (tree diagrams/probability), 341 
Effect size, 602 
Elections 


confidence interval, 499 


poll, confidence level (interpretation), 501 


Empirical rule, 117 


English (speaking), homogeneity (chi-square test), 738—739 


Entertainment media, teen use, 220 


Environmental club membership, bar graph, 20 
Equal-areas point, 112 


Equal proportions, test, 718-719 
Equal-width intervals, selection, 41 
Error 
data, conversion, 100—101 
distribution, 99, 101 


margin. See Margin of error. 
standard error, 513 


Estimator 
biased estimators, 448—450 
bias/variability, 453f 


point estimator, 495 

selection, 452-453 

unbiased estimators, 448-449, 469 
Events, 314-315 

complement/intersection/union, 323f 


independent events, 335 

mutually exclusive events, 320f 
addition rule, 317 

occurrence, 316 

probability, 316 

relationship, 322f 


representation, 322f 


two-way tables, 320f 


Expected cell counts, impact, 750—751 


Expected counts 
calculation, 711 
two-way table, data basis, 730 


division, reason, 713 


observed counts, comparison, 711—714 
Expected value, 366-367 
Experimental design, principles, 253, 254 


Experimental unit, 245, 248 
Experiments, 241 


blocking, usage, 259 

design, 247-253 

explanatory variables, usage, 246-247 
inference, 272-275, 627-629 
language, 245-247 


observational studies 
contrast, 242—245 
relationship, 244 
usage, 269 
vocabulary, 246 
yield, worthlessness, 248 
Explanatory variables, 153-154, 242 
change, 773 


effect, observational studies, 243 


response variables, contrast, 168 
usage, 246-247 


values 


association, 277 
increase, 799 
variability, increase, 775 


Exponential models, 803-806 
Extrapolation, 178 
Extrasensory perception (ESP), testing, 403 


FE 


Facebook, financial incentives, 244 
Facebook/Instagram 
conditional probability, calculation, 333 


general addition rule, 321 
Venn diagrams/probability, 324 
Factor, 246 
Factoring, 631 
Federalist Papers, authors (identification), 222—223 
Feet/height 
least-squares regression line, calculation, 195 


relationship, scatterplot, 196f 
Feet, usage, 192-193 


Female president (vote), hypotheses/expected counts/chi-square test statistic, 731—732 


Fertility rate/average income (relationship), scatterplot/ 
residual plot (usage), 185f 

First quartile, 64 

Fisher, Ronald A., 560 

Fishing (transformation) 


logarithms (usage), power models, 800-801 
powers, usage, 797—798 
roots, usage, 799 

Fit, summary, 192f, 194 


Five-number summary, 68 


determination, 69 

dotplot, combination, 68f 
Five-second rule, 246—247 
Fixed significance level, usage, 559 


Football players, weighing (sampling), 271, 275 
Ford F-150s 

analysis, 187 

data, 181 


regression line, addition (scatterplots), 183f 


least-squares regression line, 184, 192f 
price/miles 

residual plot, 186f 

scatterplot, 177f, 186f 


price prediction, regression line (usage), 177f 


scatterplot, regression line (addition), 188f 
value 
loss, 176, 177f 


slope, confidence interval, 779-780 


Form (measurement), correlation (impact), 164 
Free throws 

confidence interval, 552—553 

proportion, 553f 

P-value, 556 
Frequency histogram (sales tax rate), 43f 
Frequency table, 9, 92-93 
Fuel economy 


distribution, description, 35 
ratings, dotplot, 58f 


G 


Galton, Francis, 196—197 


Gapminder, 795 
Gaps, display (absence), 70 


Gasoline prices, variation, 440 

Gauss, Carl Friedrich, 116 

Gender/handedness, independence (investigation), 336 
General addition rule, 318-321 

General multiplication rule, 337-338 

General Social Survey (GSS), 656 

Geographic region, indication, 795f 


Geometric distributions, 423 
Geometric probability 
calculation, 423-425 
formula, 423 
Geometric random variable(s), 361, 402, 422-426 


mean (expected value), 426 
probability distribution, histogram, 425f 
Geometric setting, 422 
Gesell Adaptive Score, 198 
least-squares regression lines, 199f 
data, changes, 199-200 
linear model, residual plot, 199f 
scatterplot, 199f 
Gladwell, Malcolm, 718 
Goals scored 
dotplot, 58f 


mean, calculation, 55 


median, calculation, 57 

outliers, identification, 67 
Golden ticket parking lottery, 306-308 
Goldfish, pA1—pA2P1 P2 sampling distribution (description), 624 
Goodness of fit, chi-square test, 708, 709, 718 

expected counts, calculation, 711 


follow-up analysis, 720-721 
performing, 719-720 
conditions, 717 
P-value, 715f 
Goodness of fit, test, 733 
Grade Point Average (GPA) 


margin of error, factors, 505 


normal/large sample condition, 533-534 
Graphs, 12—13 
usage, 30 


Guesses 
constant, multiplying/dividing (effect), 100—101 


dotplots/summary statistics, 99f, 101f 


Hand span, scatterplot, 154f, 180, 182, 191 
Harris Interactive survey, 625, 626 


Hawaii Pineapple Company 
confidence intervals, 595-596 


hypotheses, stating, 555 
two-sided tests, 595-596 


Head circumference data, stemplots, 39 


Heart attacks, p; — po significance test (performing), 635-636 
Height 
distribution, example, 119 


prediction, feet (usage), 192-193 
Height/feet 
least-squares regression line, calculation, 195 


scatterplot, 196 
Height/strides, correlation, 169 


Helicopter experiment 
conditions, checking, 775—7 


76 
parameters, estimation, 777—778 
Helsinki Heart Study, 635-637 
treatment groups, Pp; — P2 values, 637f 


Hiring discrimination, 6 
Histograms, 40-44, 771f 
axes, scaling, 41 
bar graphs, contrast, 45 
bars, drawing, 41 
creation process, 41-44 
equal-width intervals, selection, 41 
interpretation, 42 
intervals, selection, 43 
Old Faithful eruptions, 41f 
table, creation, 41 
usage, 45-46 
word lengths, comparison, 45f 
HIV testing, probability (finding), 345-346 
Hockey birthdays, equal proportions (test), 718—719 


Homogeneity 
chi-square test, 728, 738-740 


follow-up analysis, 739-740 


performing, conditions, 733 
tests, 708 
conditions/P-values, 733-737 


hypotheses, stating, 727-728 


Household size, distributions (comparison), 36 
Hypotheses, stating, 554, 630, 655, 710 
Hypothesis test, 552 


Identical twins (pairs), IQ scores 
dotplot, 674f 
parallel dotplots, 673f 


Identical twins (intelligence), mean difference (confidence interval), 677-678 


iMac, introduction/survey, 12—13 
Independence 
absence, problem, 346-347 
chi-square test, 741—746 
performing, 726 


performing, conditions, 743 


conditional probability, relationship, 330, 334—337 
investigation, 336 
tests 
conditions/calculations, 743-744 
expected counts, 742 
hypotheses, stating, 741—742 
Independent events, 335 
multiplication rule, 344—347 
Independent random samples, 647 
data, 625 
Independent random variables, 390 
difference, standard deviation, 3923 
sum, standard deviation, 391 
Independent SRSs, selection, 624f, 648f 
Independent variables, 153-154 
Individuals, 2 


random assignment, 276 
Inference, 6—7, 270-271 

scope, 275-277 

usage, 272-275 
Influential points, usage, 200-201 


Informed consent, 279 


Instagram/Facebook 
conditional probability, calculation, 333 


general addition rule, 321 
Venn diagrams/probability, 324 


Institutional review boards, 279 

Intensity, equation, 797 

Interest, outcomes, 403 

Interquartile range (IQR), 63-65 
constancy, 98 


determination, 64-65 


Intersection, 323 


Interval 
confidence intervals, 495, 496-499 


estimate, 496 
selection, 43 
usage, 497 


Interviewer gender (impact), conditions/P-value/conclusion, 734—735 
Inverse t (usage), 531 
Iowa Test of Basic Skills (TBS) vocabulary scores, 113-117 
calculation, 121—122 
distribution, 114-116, 119-120 
description, 68—95—99.7 rule (usage), 118 


percentile, estimation, 127 


i 


histogram, 113f 

level, proportion (normal distribution estimation), 121f 
90th percentile (display), normal distribution (usage), 127f 
normal curve, usage, 120f 

obtaining, 118 

quantitative data sets, 115 

seventh-grade scores, 123, 125-126 


normal distribution approximation, 125f 
proportion, normal distribution estimation, 124f 
test score data, 120 
IQ scores 
crying (relationship), beta (significance test), 783—784 
dotplots/normal probability, 134f 
normal distribution, 132-133 


J 


Jeep tours, probability distribution (display), 365 
JMP statistical software, 192f 
Joint relative frequency, 15 


calculation, 15-16 


K 


Kepler, Johannes, 808 


L 


meeting, 570, 632 


satisfaction, 462 


usage, 573 


Law of large numbers, 301-302 
Least-squares regression, 176 


computation, 778 
lines, 770f 
dotplot, 771f 


resistance, absence, 197—201 


Least-squares regression line, 183-184, 191 
calculation, 184, 186, 192f 
feet/height relationship, 195 


calculation process, summary statistics (usage), 194 


point, addition, 198 
properties, investigation, 198 
Left skew, function, 112 
Levels, 246 
Life expectancy 


church attendance, relationship, 277 
scatterplot, 795f 


log(income per person), contrast, 796f 


Linear association, 164 
measurement, 160—162 


Linear model, appropriateness (determination), 185-188 


Linear regression, 768 
inference, 769 

Linear relationship 
description 


correlation/regression lines, usage, 197 


correlation, usage, 164 
direction, 160 


strength/direction measurement, correlation (usage), 161f 
Linear transformations, 796 
Linearity (achievement), transformation (usage), 795, 809 


LINER (Linear Independent Normal Equal SD Random), 774—775 
LinRegTTest, p > 0, 785 
Location 


distribution description, 91 
measurement 
percentiles, usage, 91-92 
z-scores, usage, 95-97 
Logarithms, usage, 800-802 
Lower back pain treatment, p; — pz confidence interval, 628 
Luck, 422 


Lucky Day game, geometric probabilities (calculation), 423—42 


M 


Magnets, impact, 250 


Malaria, prevention, 246, 248 


Mammograms, tree diagrams/conditional probability, 342 
Margin of error, 271, 498, 538 

capability, 518 

decrease, 504 


examination, Confidence intervals applet (usage), 503-504 
factors, 502-505 


limitations, 505 


sample size, 519 


pL estimation, 539 
size, 651 
usage, 539 


Marginal relative frequency, 14 
calculation, 15-16 
Matched pairs design, 259-261 
Math (impact), paired data (analysis), 674-675 
Mean, 54-56 
balance point, 56 


calculation, 55, 469 


confidence interval, construction (conditions), 533 
discrete random variable, 366—367 
display, 114 
finding/interpretation, 367 
median 
comparison/contrast, 58-59, 112 
relationship, dotplots (usage), 58f 
mystery mean, 494—495 


one-sample t test, 592 


performing, 594 
population mean, estimation, 525 
regression, 194—197 


standard deviation, 119 


values, regression, 196—197 


Mean difference 
confidence interval, 677-678 


construction, conditions, 676 


one-sample t interval, 677 
one-sample t test, 679, 680 


paired t interval, 677 
paired t test, 679, 680 


significance test, performing (conditions), 679 


Mean price, usage, 190f 
Means 


addition rules, usage, 470 
comparison, 645, 673 

Means, differences 
confidence interval, construction, 654 


conditions, 649 


dotplots, usage, 273f 


sampling distribution, 645-648 
significance test, performing, 656, 660-661 


two-sample t interval, 650, 651 


two-sample t test, 658 
Median 
calculation, 57 
finding, 367 
marking, 69 
mean, comparison/contrast, 58-59, 112 
mean (relationship), dotplots (usage), 58f 


measurement, 57 


Medium/large drink, x 1-x 2%1 ~ ©2, sampling distribution (description), 648 


Mileage (improvement), matched pairs design, 260 

Minitab statistical software, 192f 

M&M’S®, color distribution, 708—717 
observed/expected counts, bar graph, 712 

Monkeys (counting), sample size (determination), 539 

Movie attendance, mean/standard deviation, 470 


iv 
confidence interval, 99% level, 596 


confidence interval, construction, 534-538 


ke 


estimation, conditions, 532-53 


one-sample t interval, 535-536 


one-sample t test, 591-595 


significance test, 585-591 


significance test, performing, 592-593 


1 — Ha confidence intervals, 649-655 


Hl — Ho significance tests, 655-658 


calculations, 657 


hypotheses (stating)/conditions (checking), 655-656 
standardized test statistic/ 
P-value, 657 
1 — Hz two-sample t-test, 658-66 


a 


Ke) 


Lgiff confidence intervals, 676-67 


Lgift Significance tests, 679-682 
Multiple analyses, usage (limitations), 604 
Multiplication rule, 344-347 

general multiplication rule, 337-338 


Music 
background music, types, 730 


conditions (entrée distributions), relative frequency (bar graphs), 727f 


impact, paired data (analysis), 674-675 


Mutually exclusive, 317 

Mutually exclusive events, 319 
addition rule, 317 

Mystery mean, 494—495 


N 

NASCAR cards, cereal boxes (relationship), 305-306 

National Health and Nutrition Examination Survey (NHANES), 645-646 
National Highway Traffic Safety Administration (NHTSA), seat belt report, 220 
Nausea drug interaction, test power (factors), 602 


Negative correlation, 168 

Negative linear association, 177 

Nonlinear relationships, strength (measurement), 164 

Non-normal distributions, 131, 135 

Non-normal population, sample mean (sampling distribution), 475 


Nonresponse, 233 
Nonsmokers, two-sided test, 577-578 
Normal approximation, usage, 464-465 


Normal curve, 113 
mean/standard deviation, display, 114f 
sketching, 115, 119 


usage, 120f 


Normal density curve, 125 
display, 113f 
usage, 116-117 


approximation, 125f 
areas, finding, 119-127 
process, 122 
values, determination, 128 
drawing, 122, 123, 129, 373 
estimation, 124f 
facts, 114 


importance, 116 


mean, standard deviation, 123 


mean/variance, 396 

modeling, 645-646 

sampling distribution models, 571f 
shape, 114 

sketching, 115 

specialness, 116-117 

standard normal distribution, 120 


symmetry, 118 
usage, 127f 
values, areas (determination), 121-122, 129 


Normal/Large Sample condition, 532 


allowance, 591 
checking, 533-534 
Normal population, sampling, 471—474 


distribution, examination, 471-472 

sample mean, sampling distribution, 472 
Normal probability distributions, 373, 487f 
Normal probability plots, 133-136 


creation, 136 


interpretation, 135 


linear/curved form, 134 
nonlinear forms, 134 
usage, 135, 136 
Normal quantile plot, 133 
Normal random variables 
combining, 394-397 
histograms, 395f 
Normality, assessment, 131-13 


process, normal probability plot (usage), 135 
Null hypothesis, 554—558, 570, 573 
deviation, 603 


general form, 630 
homogeneity usage, 727 


restaurant experiment, 729 


Null value, alternative parameter value (difference), 600 


Numbers, usage, 54 
Numerical summaries 
calculation, 109 


computation, 66 


technology, usage, 65-66 


O 


Obama, Barack, age (percentile), 94 
Observational studies, 241—242 
experiments 
contrast, 242—245 
relationship, 2434 


Observed counts, expected counts (comparison), 711—714 


Observed values, standardization, 97 
Occurrence, indication, 68 
Oil changes, CLT (usage), 477-478 
Old Faithful eruptions 
durations 
dotplots, 33f, 41f 
histogram, 41f 
scatterplot, 769f 
interval, scatterplot, 769f 
sampling, 768—769 


SRS, scatterplots/least-squares regression line, 770f 


subpopulation, 773-774 
“1 in 6 wins” game, 298, 305 
One-sample t interval, 677 
One-sample t test, 591-595 
Minitab output, 594f 


One-sample z test for a proportion, 572-577 
One-sided alternative hypothesis, 554—555 
One-sided tests, confidence intervals (connection), 597 
Outcomes, 372 

collection, 315 

likelihood, 315 


outcome of interest, 403 

probability, 316 
Outliers, 34 

effect, 58 

identification, 66-69 


influence, 198 
usage, 200-201 
Outliers (Gladwell), 718 


p 


P1 — P2 confidence intervals, 625-630 
endpoints, 629 


formula, usage, 626 


lower back pain treatment, 628 


P1 — P2 significance tests, 630-634 
calculations, 631-633 
P1 — Pz two-sample z test, 634-637 


Pain (repulsion), magnets (usage), 250 
Paired data 
analysis, 673-676 


means, comparison, 619, 673 


two samples, contrast, 683-685 
Paired t interval, 677 
Parameters, 442-443 

estimation, 776—778 


Part-time jobs, standardized test statistic/P-value (calculation), 571-572 
Pass the Pigs (game), 403 
Payroll 


wins, correlation, 162 
Peaks, display (absence), 70 
Penny 

date, proportion, 440-441 

distribution, 443f 
Percentiles, 90 

finding/interpreting, 92 

usage, 91—92 


z-scores, connection, 121 


Period, equation, 797 


Physicians’ Health Study, experimental design (principles), 254—255 
Pictograph, 12-13 
Pie chart, 10 


example, 10f, 11f 
Pierced ears (user identification) 
probability example, 331 
two-way tables/probability, usage, 319 
Placebo, 244 
effect, 249-250 
Planet, model (selection), 807-808 


Plants, mean number, 230 
Plausible, term (meaning), 497 
Point estimate, 495, 498 
Point estimator, 495 

usage, 495-496 


Political polls, response rates, 233 


Pooled two-sample t procedures, avoidance, 663 
Pooled two-sample t statistic, 663 

Pooling, function, 663-664 

Popcorn, blocking (usage), 259 


Population, 221 
distribution, 446 
shape, 469 
normal population, sampling, 471-474 
regression line, 770 
samples, relationship, 221—222 


shapes, sample sizes (sampling distributions), 476f 


slope, marking, 771f 


variance, biased estimator, 450 


Population mean, 585 
confidence interval construction, 537-538 


estimation, 525 


one-sample t interval, 529 


one-sample z interval, 526 


one-sided tests, confidence intervals (connection), 597 


Population parameter 
capture, confidence interval (usage), 501 


confidence intervals, simulation, 526, 528f, 532 


estimation, 451 
Population proportion 


confidence interval, construction, 518 


estimation, 510 
one-sample interval, 511 
tests, 568 

Potatoes, production 


proportion, significance test, 574-575 
Type I/II errors, 561 

Power, 598, 602 
calculation, 602 


increase, 601 
model, 796, 800-801 
finding, 802 
form, 800 
Type IJ error, relationship, 599 


Type I/II errors, connection, 601f 


Power of a test. See Tests 


Practical importance, statistical significance (relationship), 603-604 
Prediction, 177-179 

avoidance, 178 

candy grab, 178-179 


errors, addition, 183 


Price, reference, 177 
Probability, 298, 299, 318-321 


conditional probability, 330 
defining, 299-300 
determination, 344 

finding, 315, 345-346, 372 
idea, 299-304 
interpretation, 302 

models, 314-315, 361 

plot, 771f 

rules, 313, 316-318 


tree diagrams, relationship, 341 


usage, 319 

Venn diagrams, relationship, 322-325 
Probability distribution, 361, 362 

constant, adding/subtracting (effect), 383 


construction, 391 
continuous random variable usage, 372f 
display, 365 
histogram, 364f, 383, 425f 
Proportions 
comparison, 619, 621, 748-750 


conditions, significance test (performing), 569 


equal proportions, test, 718-719 
Large Counts condition, 568 
one-sample z test, 573, 576-577 


sample proportion, 622 
combination, 632 
significance test, 574-575 


Proportions, difference 
confidence interval, construction, 629 


conditions, 625 


sampling distribution, 621-624 


significance test, performing, 633-634 


conditions, 630 
two-sample z interval, 626 
two-sample z test, 634 
PROVE-IT experiment, 619 
pP sampling distribution, 459-464, 622-623 


binomial count, relationship, 463 
calculations, 464—465 


conditions, violation, 514-515 


confidence interval 
construction, 511—516, 517-518 


interpretation, 517-51 


description, 624 


estimation, conditions, 511-514 


mean, 622 


formulas, origin, 623 


normal approximation, usage, 464-465 

one-sample z test, 572—577 

significance test, performing, 568-572 

simulation, 622f 

standard deviation, formulas (origin), 623 
Pulse rate. See Standing/sitting pulse rates 

change, 252 

effect, absence, 273f 
Pumpkin selection, boxplots (usage), 69—70 
P-values, 631-633, 657 

calculations, 570-572, 587-590 

chi-square distributions, relationship, 714—717 


computation, t distributions (usage), 590-591 
display, 571f, 588f 

estimation, 637 

finding, 716—717 


limitations, 589 


interpretation, 556-560 


PVC pipe length, comparative dotplots, 59f 


Q 


Quantitative data 
category grouping, 750 


description, numbers (usage), 54 
display, graphs (usage), 30 
distribution, 102, 112 
description, 113 
sets, 115 
Quantitative table, frequency table, 92—93 
Quantitative variables, 3, 152 


distribution 
description process, 34 
skewness, 131 
requirements, 168 
values, 525 
Quartiles, 64 
identification process, 66 


Questions, wording, 234 


Quiz scores, 33 


R 


r2 values, 189-192 
interpretation, 191—192 
role, 188-192 

Random assignment, 251—253, 275 
process, 251 


purpose, 272 
Random condition, 512, 532, 573 
impact, 591 
Random Data Selector, 336 
Random digits, usage, 306 


Random numbers 
distribution, 111 
sequence, generation, 111 
Random reassignments (simulation), p; — pz values (dotplot), 637f 
Random sampling, 225-226, 275 
methods, 229-232 
work, inference methods, 636-637 
Random variables, 361, 362, 368 
combining, 381, 388-390 


transforming, contrast, 393-394 


defining, 361-362 
difference, mean (expected value), 389 


normal probability distribution, 387f 
normal random variables, combining, 394-397 
sum/difference 
mean (expected value), 388-390 
standard deviation, 390—394 
transforming, 381, 382-388 
Randomized block designs, 256—261 
outline, 257f 


Randomized experiments, random sampling work (inference methods), 636-637, 662 


Randomness 
investigation, 302-303 
understanding, 302—304 

Range, 59-60 


estimation, 449 


Regression 
analysis, 798-799, 801 
applet, usage, 163-1 


rm 


[op) 


inference, conditions, 773-776 
knowledge, 197-201 
least-squares regression, 176 
model, 773f 

output, interpretation, 192—1 


(ee) 


population regression line, 770 


sample regression line, 770 
standard error, 777 
Regression line, 176 
addition, 188f 
interpretation, 181-182 
scatterplot, 177f 


slope, interpretation, 182 
usage, 177f, 197-201 
Relationships, display, 154—15 


Relative frequency, 93 
table, 9, 10 
usage, 45f 
Replication, 253 
Residual plots 


creation, 187 


interpretation process, 186 


patterns, examination (reason), 187 


usage, 185-188 
fertility rate/average income, relationship, 185f 
Residual s, 188-192 
Residuals, 179-181, 780, 782 
addition, 183f 
equal variability, 775 


squared residuals, sum, 190f 
standard deviation, 188—1 


IO 


sum, minimization, 183 

Resistant, 56 

Response bias, 234, 242 

Response variables, 153-154, 243 
explanatory variables, contrast, 168 
standard deviations, 196—197 


value 
prediction, 177 
pth root, 799 


Responses, strength, 277 


Resting pulse rates, back-to-back stemplot, 39f 
Root mean squared error, 777 
Rough symmetry, 111 


Row number, 782 


S 


s value 


N 


interpretation, 191—1 


r2, relationship, 191 
role, 188-192 


standard deviation, 189 


Sales tax 
histograms, usage, 42 
rate, frequency histogram, 43f 
Sample, 221 
data, distribution, 446 
effectiveness, 225-228 
problems, 223-225 
regression line, 770 
probability plot/histogram, 771f 
simple random sample (SRS), 225-226 
standard deviation (calculation), division (reason), 450 
stratified random sample, 229 


Sample mean (x ©) 


mean/standard deviation formulas, origin, 470 
sampling distribution, 468-470, 468f 


examination, 471—472 


mean/standard deviation, 469 
shape, 477 


Sample means, 468 


Sample proportion, sampling distribution, 446, 461-462 


Sample proportions, 458, 622 
sampling distribution, 459, 459f 


Sample size, 600 
adjustment, 503-504 
determination, 520 


margin of error, usage, 539 


increase, 504 


sampling variability, relationship, 271 
selection, 519-529, 538-540 
Sample space, 314-315 


events, relationship, 322f 


example, 339f 


outcomes, likelihood, 316 


Sample survey 
idea, 222-223 
problems, 232-235 


Sample variance, 450 
Sampling, 221 
biased sampling methods, 224—225 


binomial distributions, relationship, 419-420 
convenience sampling, 223-224 

example, 229 

heights, 444, 449 

inference, 270-271 

methods, 231-23 


monitors, 221—222 


N 


random sampling, 225-226 
statistical sampling, binomial distributions, 418-420 
variability, 270-271, 443 


sample size, relationship, 271 


variability, margin of error (impact), 518 
voluntary response sampling, 224 
approximate sampling distribution, 444 


approximations, 461f 


creation, 444 
description, 447-453 
mean, 461, 646 
samples, usage, 446f 


standard deviation, 462 
factors, 772 
usage, 445 
Satterthwaite, F.E., 651 
Scary movies/fear, chi-square test (selection), 747—748 


Scatterplots 
axes, label/scale, 155 


correlation, 153 
creation, 155—156 


process, 155 


data values, plotting, 155 

description, 156—160 

linear relationship, 768 

usage, 154-156 
Scholastic Aptitude Tests (SATs), normal/large sample condition, 533-534 
School assembly, sampling, 231—232 


Scores 
adjusted scores/doubled scores, graphs/numerical summaries, 100 
dotplots, 91f, 96, 98f 
report, 95 


summary Statistics, 96, 98f 
Seat belt use, NHTSA observational study, 220 
Segmented bar graph, 29 


snowmobile distribution, 19f 


comparison, 461, 469 
description, 32-34, 364-365 


quiz scores/die rolls, 33 


determination, normal probability plot (usage), 136 


Shipments, timing (claim), 403 
Shoe ownership, JQR determination, 64—65 
Side-by-side bar graph, 19 

snowmobile distribution, 19f 
Significance level, 559-562, 600 
Significance test, 552, 554, 602 

conclusion, making, 558 

four-step process, 572 

pi usage, 585-591 


P1 — P2 significance tests, 630-634 
conditions, 630, 656, 679 
power, increase, 601 
P-value, finding, 570 
Silver, Nate, 233 
Simple random sample (SRS), 225-226 
distribution, 448f 
independent SRSs, selection, 624f 


least-squares regression lines, 770f 
mean height, sampling distribution, 474f 


sample means, simulated sampling distribution, 646f 


sample proportion, sampling distribution, 451f 
scatterplots, 770f 


selection, 621 
selection process, 226-227 


Table D, usage, 227-228 


technology, usage, 226—227 


simulation, rise, 450f 


software, usage, 646 


taking, simulation, 526 


Simulated sampling distribution, 646f 
dotplot, 553f 
Simulation, 298, 299, 304-308 


performing, 305-307 


process, 305 
Single-blind experiment, 249 
68—95-99.7 rule, 116-119 
defining, 117 
usage, 118-119, 118f 
avoidance, 120 
Skewness, 111, 425 
effect, 58 
Slope, 181 
confidence interval, construction, 778—782 


interpretation, 182 

sampling distribution, 772 
significance test, performing, 782-786 
standard error, 777 

t interval, 778 

t test, 783 


Smartphone experiment, 258 


randomized block design, outline, 257f 
Smoking 
ADHD, confounding, 243 
impact, 277 
study, confidence interval, 579f 
Snowmobiles (impact), independence (chi-square test), 744—745 
Soccer goals, dotplot, 31f 
South Africa household size, United Kingdom household size (distribution comparison), 36 
Spieth, Jordan, 124 
drives, distance (proportion), 125 


values, areas (determination), 126—127 


Spinners, probability, 315 
Spoken languages (distribution), dotplot (usage), 5f 
Squared residuals, sum, 190f 
percent reduction, calculation, 190 
Standard deviation, 368—371, 587 
calculation process, 61-62 


complexity, 62 
display, 114 
finding, 369 


formula, 623 


WO 


interpretation, 61—62, 36 


S—- lc 


number, interpretation, 96 
parameter, 776—777 
usage, 61-63, 469 

Standard error, 513, 587 

Standard normal distribution, 120, 121f 
usage, 121-122, 128 

Standardized score (z-score), 95 
scatterplot, 168f 


unstandardized normal distribution, usage, 122 
Standardized test statistic, 631-633, 657 

calculations, 570-572, 587-590 

equations, 587, 595 


Standing/sitting pulse rates 
comparison, 684 
parallel dotplots, 684f 
Statistical sampling, binomial distributions, 418—420 
Statistical significance, practical importance (relationship), 603-604 


Statistically significant, 272 
Statistics, 442-443 


definition, 2 


normal distributions, importance, 116 
probability distribution, 384f 


standard deviation, 511 
test, 96 

Stemplots, 37—40 
back-to-back stemplot, 39 
creation process, 38 
head circumference data, 39f 
interpretation, 38 

Strata, 229, 230 
clusters, difference, 231 


Stratified random sample, 229 


estimates, precision, 230 
Stream health, p (significance test), 592-593 
Strength 

description, 160 


resistant measure, correlation (relationship), 165 
Students, guesses (dotplots/summary statistics), 99f, 101f 
Studies, usage, 269 
Subjects, 245 

usage, 253 


Subtraction, order (importance), 389 
Success 

condition, 406 

proportions, 624f 
Sugar, amount, 396-397 

normal distribution, 397f 
Summary statistics, 97 

usage, 194 

values, influence, 68 


Sunflowers, sampling, 229 


Surveys, 221 
sample survey, idea, 222-223 


Survival times, normality, 135 


Symmetric density curve 
mean, 112f 
median, 112f 
midpoint balance, 112 


Symmetric distribution, 32 


- 


t* critical values 


calculation, 529-531 

finding, Table B (usage), 530 
t distributions, 588 

density curves, 529f, 588f 

modeling, 588 


symmetry, usage, 589 
usage, 590-591, 650-651 
t interval, 778 
t test, 783 
t values, 779 
Table A, usage, 121 
Table B, usage, 530-531 
Table C, usage, 734 
Table D, usage, 227-229 
Tablet quality (distribution comparison), boxplots (usage), 71—72 
Tattoos, bias project, 619-620, 627 
Tax rate (histogram), 45 


Teens 
debit cards 
binomial distribution, normal approximation, 421 
binomial distributions/sampling, 419-420 


social media, general multiplication rule, 338 


‘Temperatures 
parameters/statistics, 442—443 
tracking, 101-102 


allowance, 592 
Test of hypotheses, 552 
Test of significance, 552 


Test scores 
probability distribution, 384f 
scaling, constant adding/subtracting (effect), 384 
variation, 260 

Test statistic, 570, 57 

Tests, power, 597—60 


On 


\ee) 


factors, 599-603, 602 

interpretation, 598 
Tests, usage, 603-604 
Third quartile, 64 
Three-year-olds, height 

determination, 129-130 


normal distribution, mean, 131f 


Time (effect), cause (impact), 278 


Titanic 
travel classes, bar graph, 21f 


two-way tables/conditional probabilities, 332 
Titanic disaster 

conditional relative frequencies, 17-18 

conditional relative frequencies, association, 20-21 

marginal/joint relative frequencies (calculation), 15—16 
Transformations, 97—98, 795 

connecting, 102—103 

effects, analysis, 101-102, 387 


logarithms, usage 


exponential models, 803-806 


power models, 800-802 
powers/roots, usage, 796—799 
selection, 806-810 

Treatment, 245 
Tree diagrams, 342f 


conditional probability, relationship, 338-343 
going backward, 340 
possibilities, 340f 


probability, relationship, 341 
sample space example, 339f 
Trees (comparison), [1 — 2 confidence intervals, 652-653 
Trucks 
miles driven, 779 
price/miles, scatterplot (usage), 177f 
price prediction, regression line (usage), 177f 
value 
loss, 176, 177f 
slope, confidence interval, 779-780 
20/20 (ABC News program), blind taste test, 361 
Two-sample t interval, 650 


Two-sample t procedures, simulation studies, 651 
Two-sample z interval, 626 


Two-sample z procedures, 726 
Two-sample z test, 634-637 
Two-sided alternative hypothesis, 554—555 


Two-sided significance test, Minitab output, 579f 
Two-sided tests, 577-580, 589-590 
confidence intervals, relationship, 595-597 
usage, 595-596 


Two-variable data, 189 
Two-way tables, 14, 318-321, 62 
analysis, 22 


iS 


chi-square tests (performing), 735—736 


inference, 726 
Minitab output, 740f 
usage, 319, 332 

Type I errors, 560-562 
power, connection, 601f 
probability, 562 

Type II errors, 560—562 
power, relationship, 599, 601f 


Typing speed, 258 


Unbiased estimators, 448-449, 469 
Unconditional probabilities, 406 


Undercoverage, 233 

Uniform density curve, 110f 

Union, 323, 331-333 

United Kingdom household size, South Africa household size (distribution comparison), 36 
Unknown o, problem, 525-531 

Unrepresentative data, production, 223-224 


Unstandardized normal distribution, usage, 122, 128 


U.S. Census Bureau 
Current Population Survey (CPS), 232, 269, 440-441 
student job study, conditions (checking), 569 


U.S. presidents, age (cumulative relative frequency graph), 93f 


interpretation, 94 


V 


Values 
determination, 127-131 


area, usage, 129-130 


usage, 178 
comparison, 461, 469 
estimator usage, 453f 
resistant measurement, 63 
size, 451-452 
Variability, measurement, 368-371 


interquartile range (IQR), 63-65 

range, 59-60 

standard deviation, usage, 61—63 
Variables, 2 

control, 252 


means/standard deviation, 166 
Variance, 368-371 
addition rules, usage, 470 


constant multiplication, impact, 386 
formulas, origin, 417 


Vending machine problem, 130 


Venn diagrams 
events, representation, 322f 
probability, relationship, 322-325 


sample space, 322f 
Video screen tension, p! (confidence interval construction), 536-537 
Vitamin D, impact (study), 242—244 
Volume, 797 
Voluntary response 


sampling, 224 


term, misuse, 233 


Voters, sampling monitors, 221—222 


W 


Wade Tract Preserve, 652 
Weight/length (contrast), scatterplot (usage), 798f 
Welch, B. L., 651 
Whiskers, drawing, 69 
Women, height 
comparison, 474f 
normal population, sampling, 472-473 
normal probability distributions, 373 
Word lengths, histograms, 45f 
Work week length 
hypotheses (stating)/conditions (checking), 656 
standardized test statistic/ 


P-value, calculation, 657-658 


X 


x £, See Sample mean 
x 1-x 2%1 ~ “2, sampling distribution, 647 


description, 648 


mean/standard deviation formulas, origin, 647 


Y 


y intercept, 181 


interpretation, 182 
Yellowstone National Park, 744—745 


Z 


ZoomStat, usage, 160 


Z-Scores 
calculation, 97 
connecting, 102—103 


determination, 96, 124 


interpretation, 96 


multiplication, 167 


percentiles, connection, 121 


usage, 95-97 
z-scores distribution 
mean, 102-103 

shape, 102 


standard deviation, 103 


About the AP® Exam and AP® Exam 
Tips 


The AP® Statistics exam currently consists of two distinct sections: Multiple Choice and Free 
Response. Each section lasts 90 minutes and counts 50% of the total exam score. Periodically, 
The College Board!™ modifies the composition of the exam, so it is wise to check the AP® 


Statistics website (apcentral.collegeboard.org/courses/ap-statistics) to verify the breakdown 


before you begin your preparation for the exam. 


Formulas and tables like the ones on the next few pages are provided on both sections of 
the exam. You may use your calculator throughout the exam. See the AP® Exam Tips that 
follow for advice about how to maximize your score. 


Section I: Multiple Choice Your score is based only on the number of questions you 
answer correctly. So don’t leave any questions unanswered! 


Weighted Section I Score = Number of correct answers x 1.25 


Section II: Free Response Each free response question is scored holistically on a 0 to 4 
scale. The score categories represent different levels of quality in a student’s response across 
two dimensions: statistical knowledge and communication. 


4 Complete Shows complete statistical understanding of the problem; Provides a 
clear, organized, complete explanation 


3 Substantial Shows substantial statistical understanding of the problem; Provides 
a fairly clear and organized, almost complete explanation 


2 Developing Shows some statistical understanding of the problem; Provides 
somewhat clear and organized, incomplete explanation 


1 Minimal Shows limited statistical understanding of the problem; Provides 
minimal or unclear explanation 


0 Shows little or no statistical understanding of the problem; Provides 
no meaningful explanation 


Weighted Section II Score = (Sum of scores on Questions 1—5) x 1.875 + Question 6 score x 
3.125 
Composite Score = Weighted Section I Score + Weighted Section II Score 


Composite scores (on a 100-point scale) are converted to AP® Scores (on a 1 to 5 scale) 
using cutoffs determined each year based on statistical analysis of overall student 
performance on the exam. In 2012, for example, it took 33 points to earn a 2, 44 points to 


| earn a 3, 57 points to earn a 4, and 70 points to earn a 5. | 


Chapter 1 


e If you learn to distinguish categorical from quantitative variables now, it will pay big 
rewards later. You will be expected to analyze categorical and quantitative variables 
correctly on the AP® Statistics exam. 


e When comparing groups of different sizes, be sure to use relative frequencies (percents or 
proportions) instead of frequencies (counts) when analyzing categorical data. Comparing 
only the frequencies can be misleading, as in this setting (page 18). There are many more 
people who never use snowmobiles among the non-environmental club members in the 
sample (445) than among the environmental club members (212). However, the percentage 
of environmental club members who never use snowmobiles is much higher (69.5% to 
36.4%). Finally, make sure to avoid statements like “More club members never use 
snowmobiles” when you mean “A greater percentage of club members never use 
snowmobiles.” 


e Always be sure to include context when you are asked to describe a distribution. This means 
using the variable name, not just the units the variable is measured in. 


e When comparing distributions of quantitative data, it’s not enough just to list values for the 
center and variability of each distribution. You have to explicitly compare these values, 
using words like “greater than,” “less than,” or “about the same as.” 


e If you’re asked to make a graph on a free response question, be sure to label and scale your 
axes. Unless your calculator shows labels and scaling, don’t just transfer a calculator screen 
shot to your paper. 


e The formula sheet provided with the AP® Statistics exam gives the sample standard 


= Wor S- (x; — z)’. 


e You may be asked to determine whether a quantitative data set has any outliers. Be prepared 
to state and use the rule for identifying outliers. 


__ 8% 
deviation in the equivalent form sx=1n-1)> (xi-x )2. 


e Use statistical terms carefully and correctly on the AP® Statistics exam. Don’t say “mean” if 
you really mean “median.” Range is a single number; so are Q1,%1; Q3,@3; and IQR. Avoid 
poor use of language, like “the outlier skews the mean” or “the median is in the middle of the 
IQR.” Skewed is a shape and the IQR is a single number, not a region. If you misuse a term, 
expect to lose some credit. 


Chapter 2 


¢ Students often do not get full credit on the AP® Statistics exam because they only use 
option (ii) with “calculator-speak” to show their work on Normal calculation questions— 


for example, normalcdf(—1000,6,6.84,1.55). This is not considered clear communication. 
To get full credit, follow the two-step process (page 122), making sure to carefully label 
each of the inputs in the calculator command if you use technology in Step 2: 
normalcdf(lower: —-1000—1000 upper: 6, mean: 6.84, SD:1.55). 

¢ As noted previously, to make sure that you get full credit on the AP® Statistics exam, do 
not use “calculator-speak” alone—for example, invNorm(0.90,6.84,1.55). This is not 
considered clear communication. To get full credit, follow the two-step process (page 
129), making sure to carefully label each of the inputs in the calculator command if you 
use technology in Step 2: invNorm(area: 0.90, mean: 6.84, SD:1.55). 

e Never say that a distribution of quantitative data is Normal. Real-world data always show 
at least slight departures from a Normal distribution. The most you can say is that the 
distribution is “approximately Normal.” 

¢ Normal probability plots are not included on the AP® Statistics topic outline. However, 
these graphs are very useful for assessing Normality. You may use them on the AP® exam 
if you wish—just be sure that you know what you’re looking for. 


Chapter 3 


e When you are asked to describe the association shown in a scatterplot, you are expected to 
discuss the direction, form, and strength of the association, along with any unusual 
features, in the context of the problem. This means that you need to use both variable 
names in your description. 

e If you are asked to make a scatterplot, be sure to label and scale both axes. Don’t just copy 
an unlabeled calculator graph directly onto your paper. 

e When asked to interpret the slope or y intercept, it is very important to include the word 
predicted (or equivalent) in your response. Otherwise, it might appear that you believe the 
regression equation provides actual values of y. 

e When displaying the equation of a least-squares regression line, the calculator will report 
the slope and intercept with much more precision than we need. There is no firm rule for 
how many decimal places to show for answers on the AP® Statistics exam. Our advice: 
decide how much to round based on the context of the problem you are working on. 


Chapter 4 


e If you’re asked to describe how the design of a sample survey leads to bias, you’re 
expected to do two things: (1) describe how the members of the sample might respond 
differently from the rest of the population, and (2) explain how this difference would lead 
to an underestimate or overestimate. Suppose you were asked to explain how using your 
Statistics class as a sample to estimate the proportion of all high school students who own 
a graphing calculator could result in bias. You might respond, “This is a convenience 
sample. It would probably include a much higher proportion of students with a graphing 


calculator than in the population at large because a graphing calculator is required for the 
statistics class. So this method would probably lead to an overestimate of the actual 
population proportion.” 

e If you are asked to identify a possible confounding variable in a given setting, you are 
expected to explain how the variable you choose (1) is associated with the explanatory 
variable and (2) is associated with the response variable. 

e If you are asked to describe a completely randomized design, stay away from flipping 
coins. For example, suppose we ask each student in the caffeine experiment (page 252) to 
toss a coin. If it’s heads, then the student will drink the cola with caffeine. If it’s tails, then 
the student will drink the caffeine-free cola. As long as all 20 students toss a coin, this is 
still a completely randomized design. Of course, the two groups are unlikely to contain 
exactly 10 students because it is unlikely that 20 coin tosses will result in a perfect 50-50 
split between heads and tails. 


The problem arises if we try to force the two groups to have equal sizes. Suppose we 
continue to have students toss coins until one of the groups has 10 students and then place 
the remaining students in the other group. In this case, the last two students in line are very 
likely to end up in the same group. However, in a completely randomized design, the last 
two subjects should only have a 50% chance of ending up in the same group. 

e Don’t mix the language of experiments and the language of sample surveys or other 
observational studies. You will lose credit for saying things like “use a randomized block 
design to select the sample for this survey” or “this experiment suffers from nonresponse 
because some subjects dropped out during the study.” 


Chapter 5 


e On the AP® Statistics exam, you may be asked to describe how to perform a simulation 
using rows of random digits. If so, provide a clear enough description of your process for 
the reader to get the same results from only your written explanation. Remember that 
every label needs to be the same length. In the golden ticket lottery example (page 306), 
the labels should be 01 to 95 (all two digits), not 1 to 95. When sampling without 
replacement, be sure to mention that repeated numbers should be ignored. 

e Many probability problems involve simple computations that you can do on your 
calculator. It may be tempting to just write down your final answer without showing the 
supporting work. Don’t do it! A “naked answer,” even if it’s correct, will usually be 
penalized on a free response question. 


e You can write statements like P(A|B)? (AB) if events A and B are clearly defined in a 


problem. Otherwise, it’s probably easier to use contextual labels, like P(IIF)? (I/F) in the 


preceding example (page 333). Or you can just use words: P(Instagram|Facebook) 
P(Instagram| Facebook) 


Chapter 6 


e If the mean of a random variable has a non-integer value but you report it as an integer, 
your answer will not get full credit. 

e If you are asked to calculate the mean or standard deviation of a discrete random variable 
on a free response question, you must show numerical values substituted into the 
appropriate formula, as in the previous two examples (pages 367 and 369). Feel free to use 
ellipses (...) if there are many terms in the summation, as we did. You may then use the 
method described in Technology Corner 12 to perform the calculation with 1-Var Stats. 
Writing only 1-Var Stats L1, L2 and then giving the correct values of the mean and 
standard deviation will not earn credit for showing work. 

e Students often do not get full credit on the AP® Statistics exam because they only use 
option (ii) with “calculator-speak” to show their work on Normal calculation questions— 
for example, normalcdf(68,70,64,2.7). This is not considered clear communication. To get 
full credit, follow the two-step process (page 373), making sure to carefully label each of 
the inputs in the calculator command if you use technology in Step 2: normalcdf(lower:68, 
upper:70, mean:64, SD: 2.7). 

e Don’t rely on “calculator speak” when showing your work on free response questions. 
Writing binompdf (5,0.25,3)=0.08789(5; 0.25, 3) = 0.08789 will not earn you full credit 
for a binomial probability calculation. At the very least, you must indicate what each of 
those calculator inputs represents. For example, “binompdf(trials:5,p:0.25,x value:3) 
=0.08789.”= 0.08789. ” 


Chapter 7 


e Many students lose credit on the AP® Statistics exam when defining parameters because 
their description refers to the sample instead of the population or because the description 
isn’t clear about which group of individuals the parameter is describing. When defining a 
parameter, we suggest including the word all or the word true in your description to make 
it clear that you aren’t referring to a sample statistic. 

e Terminology matters. Never just say “the distribution.” Always say “the distribution of 
[blank],” being careful to distinguish the distribution of the population, the distribution of 
sample data, and the sampling distribution of a statistic. Likewise, don’t use ambiguous 
terms like “sample distribution,” which could refer to the distribution of sample data or to 
the sampling distribution of a statistic. You will lose credit on free response questions for 
misusing statistical terms. 

e Make sure to understand the difference between accuracy and precision when writing 
responses on the AP® Statistics exam. Many students use “accurate” when they really 
mean “precise.” For example, a response that says “increasing the sample size will make 
an estimate more accurate” is incorrect. It should say that increasing the sample size will 
make an estimate more precise. If you can’t remember which term to use, don’t use either 
of them. Instead, explain what you mean without using statistical vocabulary. 


e Notation matters. The symbols px ,D,p,}40,np,op\,x ,” 997s Ps By Fy B51 951 Mes and 


ox %= all have specific and different meanings. Either use notation correctly—or don’t 
use it at all. You can expect to lose credit if you use incorrect notation. 

e Many students lose credit on probability calculations involving x £ because they forget to 
divide the population standard deviation by n.V™: Remember that averages are less 
variable than individual observations! 


Chapter 8 


e When interpreting a confidence interval, make sure that you are describing the parameter 
and not the statistic. It’s wrong to say that we are 95% confident the interval from 0.613 to 
0.687 captures the proportion of U.S. adults who admitted they would experience financial 
difficulty. The “proportion who admitted they would experience financial difficulty” is the 
sample proportion, which is known to be 0.65. The interval gives plausible values for the 
proportion who would admit to experiencing some financial difficulty if asked. 

e Ona given problem, you may be asked to interpret the confidence interval, the confidence 
level, or both. Be sure you understand the difference: the confidence interval gives a set of 
plausible values for the parameter and the confidence level describes the overall capture 
rate of the method. 

e Ifa free response question asks you to construct and interpret a confidence interval, you 
are expected to do the entire four-step process. That includes clearly defining the 
parameter, identifying the procedure, and checking conditions. 

e You may use your calculator to compute a confidence interval on the AP® Statistics exam. 
But there’s a risk involved. If you just give the calculator answer with no work, you’ ll get 
either full credit for the “Do” step (if the interval is correct) or no credit (if it’s wrong). If 
you opt for the calculator-only method, be sure to complete the other three steps, including 
identifying the procedure (e.g., one-sample z interval for p) and give the interval in the Do 
step (e.g., 0.19997 to 0.26073). 

e Ifa question on the AP® Statistics exam asks you to construct and interpret a confidence 
interval, all the conditions should be met. However, you are still required to state the 
conditions and show evidence that they are met—including a graph if the sample size is 
small and the data are provided. 

e It is not enough just to make a graph of the data on your calculator when assessing 
Normality. You must sketch the graph on your paper to receive credit. You don’t have to 
draw multiple graphs—any appropriate graph will do. 


Chapter 9 


e Hypotheses always refer to a population, not to a sample. Be sure to state HOM and Ha 
Ha in terms of population parameters. It is never correct to write a hypothesis about a 
sample statistic, such as HO:p\=0.80/40 : P = 9.80 oy Ha:x 431.Ha : = # 31. 


e We recommend that you follow the two-sentence structure from the example (page 559) 
when writing the conclusion to a significance test. The first sentence should give a 
decision about the null hypothesis—reject HO/40 or fail to reject HO/40—based on an 
explicit comparison of the PP-value to a stated significance level. The second sentence 
should provide a statement about whether or not there is convincing evidence for Ha/e in 
the context of the problem. 

° When a significance test leads to a fail to reject HO0 decision, as in the preceding 
example (page 574), be sure to interpret the results as “We don’t have convincing 
evidence for Ha.” a.” Saying anything that sounds like you believe HO/0 is (or might 
be) true will lead to a loss of credit. For instance, it would be wrong to conclude, “There is 
convincing evidence that the true proportion of blemished potatoes is 0.08.” And don’t 
write responses as text messages, like “FTR the H0.”40.” 

e You can use your calculator to carry out the mechanics of a significance test on the AP® 
Statistics exam. But there’s a risk involved. If you give just the calculator answer with no 
work, and one or more of your values are incorrect, you will probably get no credit for the 
“Do” step. If you opt for the calculator-only method, be sure to name the procedure (one- 
sample z test for a proportion) and to report the test statistic (z)=1.15)(2 ) = 1.15) and PP- 
value (0.1243). 

e When making a conclusion in a significance test, be sure that you are describing the 
parameter and not the statistic. In the preceding example (page 578), it’s wrong to say that 
we have convincing evidence that the proportion of students at Yanhong’s school who 
said they have never smoked differs from the CDC’s claim of 0.68. The “proportion who 
said they have never smoked” is the sample proportion, which is known to be 0.60. The 
test gives convincing evidence that the proportion of all students at Yanhong’s school who 
would say they have never smoked a cigarette differs from 0.68. 

e It is not enough just to make a graph of the data on your calculator when assessing 
Normality. You must sketch the graph on your paper and make an appropriate comment 
about it to receive credit. You don’t have to draw more than one graph—a single 
appropriate graph will do. 

e Remember: If you give just calculator results with no work, and one or more values are 
wrong, you probably won’t get any credit for the “Do” step. If you opt for the calculator- 
only method, name the procedure (one-sample t test for up) and report the test statistic 
(t)=-0.94)(¢) ~ —0.94) degrees of freedom df=14,4f = 14, and PP-value (0.1809). 


Chapter 10 


e The formula for the two-sample z interval for p1—p2?! ~ P2 often leads to calculation 
errors by students. As a result, your teacher may recommend using the calculator’s 2- 
PropZInt feature to compute the confidence interval on the AP® Statistics exam. Be sure 
to name the procedure (two-sample z interval for pl1—p2?!1 ~ P2) in the “Plan” step and 
give the interval (-0.311,0.116)| —0.311, 0.116) in the “Do” step. 


e The formula for the two-sample z statistic for a test about pl—p2?1 ~ P2 often leads to 
calculation errors by students. As a result, your teacher may recommend using the 
calculator’s 2-PropZTest feature to perform calculations on the AP® Statistics exam. Be 
sure to name the procedure (two-sample z interval for p1—p2?!1 ~ P2) in the “Plan” step 
and report the standardized test statistic (z=1.17(2 = 1.17 and P-value (0.2427) in the 
“Do” step. 

e The formula for the two-sample t interval for p1-p21 ~ #2 often leads to calculation 
errors by students. Also, the interval produced by technology is narrower than the one 
calculated using the conservative method. As a result, your teacher may recommend using 
the calculator’s 2-SampTInt feature to compute the confidence interval. Be sure to name 
the procedure (two-sample t interval for 11-p2#! ~ #2) in the “Plan” step and give the 
interval (3.9362, 17.724) and df (55.728) in the “Do” step. 

e The formula for the two-sample t statistic for a test about p1-21 ~ #2 often leads to 
calculation errors by students. Also, the P-value from technology is smaller and more 
accurate than the one obtained using the conservative method. As a result, your teacher 
may recommend using the calculator’s 2-SampTTest feature to perform calculations. Be 
sure to name the procedure (two-sample t test for p1-p2/1 ~ #2) in the “Plan” step and to 
report the standardized test statistic (t=1.60(¢ = 1.60 p_yalue (0.0644), and df (15.59) in 
the “Do” step. 


Chapter 11 


e The formula for the chi-square test statistic is included on the formula sheet that is 
provided on the AP® Statistics exam. However, it doesn’t include the word count: 


. (Observed- Expected)? 


x? 
x2=y ( Observed-Expected)2Expected Expected 


We included the word count to emphasize that you must use the observed and expected 
counts—not the observed and expected proportions—when calculating the chi-square test 
statistic. 


e When checking the Large Counts condition, be sure to examine the expected counts, not 
the observed counts. And make sure to write and label the expected counts on your paper 
or you won’t receive credit. 

e You can use your calculator to carry out the mechanics of a significance test on the AP® 
Statistics exam. But there’s a risk involved. If you just give the calculator answer with no 
work, and one or more of your values is incorrect, you will likely get no credit for the 
“Do” step. We recommend writing out the first few terms of the chi-square calculation 
followed by “.. .” . This approach might help you earn partial credit if you enter a number 
incorrectly. Be sure to name the procedure (chi-square test for goodness of fit) and to 


report the test statistic (y2=11.2),0€ = 11.2), degrees of freedom (df=3)(4£ = 3) and P- 


value (0.011). 

e As with chi-square tests for goodness of fit, the expected counts should not be rounded to 
the nearest whole number. While an observed count of entrées ordered must be a whole 
number, an expected count need not be a whole number. The expected count gives the 
average number of entrées ordered if HO/0 is true and the random assignment process is 
repeated many times. 

e Inthe “Do” step, you aren’t required to show every term in the chi-square test statistic. 
Writing the first few terms of the sum followed by “. . .” is considered as “showing work.” 
We suggest that you do this and then let your calculator tackle the computations. 

e You can use your calculator to carry out the mechanics of a significance test on the AP® 
Statistics exam. But there’s a risk involved. If you just give the calculator answer without 
showing work, and one or more of your entries is incorrect, you will likely get no credit 
for the “Do” step. We recommend writing out the first few terms of the chi-square 
calculation followed by “. . .”. This approach may help you earn partial credit if you enter 
a number incorrectly. Be sure to name the procedure (y2=test for homogeneity) 

(x° = test for homogeneity) and to report the test statistic (y2=18.279), (X° = 18.279) , 
degrees of freedom (af=4), (df = 4), and P-value (0.0011). 

¢ Many students lose credit on the AP® Statistics exam because they don’t write down and 
label the expected counts in their response. It isn’t enough to claim that all the expected 
counts are at least 5. You must provide clear evidence. 

e When the P-value is very small, the calculator will report it using scientific notation. 
Remember that P-values are probabilities and must be between 0 and 1. If your calculator 
reports the P-value with a number that appears to be greater than 1, look to the right, and 
you will see that the P-value is being expressed in scientific notation. If you claim that the 
P-value is 4.82, you will certainly lose credit. 


Chapter 12 


° We use the same notation as the AP® Statistics exam formula sheet for the equation of the 
sample regression line yA=b0+b1x.¥ = 0 + 612. However, your graphing calculator 
probably uses the notation y\=a+bx.¥ = @ + 52. Just remember: the slope is always the 
coefficient of x, no matter what symbol is used. 

e The AP® Statistics exam formula sheet gives the formula for the standard error of the 
slope as 


Sh, — 


sb 1=y (yi-yi)2n- 2(xi-x ) y) V (x, —z 


The numerator is just a fancy way of writing the standard deviation of the residuals s. Can 
you show that the denominator of this formula is the same as ours? 


e When you see a list of data values on an exam question, wait a moment before typing the 
data into your calculator. Read the question through first. Often, information is provided 
that makes it unnecessary for you to enter the data at all. This can save you valuable time 
on the AP® Statistics exam. 


Formulas for the AP® Statistics Exam 


Students are provided with the following formulas on both the multiple choice and free- 
response sections of the AP® Statistics exam. 
I. Descriptive Statistics 


x =y xin? = a 


/ 1 =A 9 
_ 8s =,/— — 
sx=1n-1)'(xi-x 2” V n-1 Ye z) 


| (m4 —1)s7+(ng—1)s3 


3s, = 4/ -—-— 
sp=(n1-1)s12+(n2-1)s22(n1-1)+(n2-1) . \ (ny —1)+(n9—1) 
yA=b0+b1x¥ = bo + biz 


S()-2)-(vi-9) 
b1=)\(xi-x )-(yi-y )¥(xi-x )2 y- (2-2) 


b0=y —b1x %o = 7 —b= 


r=In-1.(xi-x sx)(yi-y sy) "~ ~( Sz )( Sy 


s 
by = r— 
Sz 


b1=rsysx 


Sb; — 


sb1=)(yi-y/i)2n—-2) (xi-x )2 y > (zi at 


ll. Probability 
P(AUB)=P(A)+P(B)-P(AnB)? (AUB) = P(A)+ P(B) — P(ANMB) 


_ P(AMB) 
P(AIB)=P(ANB)P(B) ~ P(B) 


E(X)=pX=PxipiZ (X) = wx =Deips 


Var(X)oX2=Y (xipX)2piV at (X) oy =Le(wiax) ps 


If XX has a binomial distribution with parameters n” and p,?> then: 


P(X =k) = (7) ok(1— p)"* 


P(X=k)=(nk)pk(1-p)n-k 
uxX=npHx = nip 

oX=np(1-p)?* = Jf np (1 ~ p) 
pp=piy ~ P 


/ p(l—p) 


op\=p(1-p)n?? ~ Va 


If x © is the mean of a random sample of size nv from an infinite population with mean pp 
and standard deviation o,7%> then: 


ux =pee = 


—_ _& 


= OF 
ox =on “ 


lll. Inferential Statistics 


Standardized test statistic=statistic-parameterstandard deviation of statistic 
statistic— parameter 
Standardized test statistic = ee 
standard deviation of statistic 
Confidence interval: statistic+(critical value)-(std. deviation of statistic) 
Confidence interval : statistic + (critical value) - (std. deviation of statistic) 


Single-Sample 
Statistic Standard Deviation of Statistic 
oO 
Sample Mean Pe W 
| ( 1 a ) 
Sample Proportion / ee 
p(l-pynV on 
Two-Sample 
Statistic Standard Deviation of Statistic 
oO 2 Co 2 
Difference of sample means 4 | de 


o12nl1+o022n2\"1 "2 


Special case when 


0; — 09 
1 
o,/—+ Ry 
Special case whenol=o2o01n1+1n2 “2 
Difference of sample Pi (1—p1) fi p2 (1—p2) 
proportions p1(1-pi)n1 +p2(1—p2yn2V ny ny 


Special case whenp1=p2p(1-p)1n1+1n2 
Special case when 


Pi = P2 


Chi-square test statistic=)'(observed—expected)2expected 

observed—expected)? 

Chi-square test statistic = 2 Gein tsi’ 
expected 


Probability 


Table entry for z is the area under the standard Normal curve to the left of z. 
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Table entry for p and C is the point t* with probability p lying to its right and probability C 
lying between -t* and ¢t* 


Tail probability p 


df .25 .20 15 .10 .05 .025 .02 01 005 .0025 .001 
1.000 1.376 1.963 3.078 6.314 12.71 15.89 31.82 63.66 127.3 318.3 
0.816 1.061 1.386 1.886 2.920 4.303 4.849 6.965 9.925 14.09 22.33 
0.765 0.978 1.250 1.638 2.353 3.182 3.482 4.541 5.841 7.453 10.21 
O741 0.941 1.190° 1533 2.132 2.776 2.999 3.747 4.604 5.598 7.173 
0.727 0.920 1.156 1.476 2.015 2.571 2.757 3.365 4.032 4.773 5.893 
0.718 0.906 1.134 1.440 1.943 2.447 2.612 3.143 3.707 4.317 5.208 
0.711 0.896 1.119 1.415 1.895 2.365 2.517 2.998 3.499 4.029 4.785 
0.706 0.889 1.108 1.397 1.860 2.306 2.449 2.896 3.355 3.833 4.501 
0.703 0.883 1.100 1.383 1.833 2.262 2.398 2.821 3.250 3.690 4.297 
0.700 0.879 1.093 1.372 1.812 2.228 2.359 2.764 3.169 3.581 4.144 
0.697 0.876 1.088 1.363 1.796 2.201 2.328 2.718 3.106 3.497 4.025 
0.695 0.873 1.083 1.356 1.782 2.179 2.303 2.681 3.055 3.428 3.930 
0.694 0.870 1079 1350 1.771 2.160 2.282 2650 3.012 3.372 3.852 
0.692 0.868 1.076 1.345 1.761 2.145 2.264 2.624 2.977 3.326 3.787 
0.691 0.866 1.074 1.341 1.753 2.131 2.249 2.602 2.947 3.286 3.733 
0.690 0.865 1.071 1.337 1.746 2.120 2.235 2.583 2.921 3.252 3.686 
0.689 0.863 1.069 1.333 1.740 2.110 2.224 2.567 2.898 3.222 3.646 
0.688 0.862 1.067 1.330 1.734 2.101 2.214 2.552 2.878 3.197 3.611 
0.688 0.861 1.066 1.328 1.729 2.093 2.205 2.539 2.861 3.174 3.579 
0.687 0.860 1.064 1.325 1.725 2.086 2.197 2.528 2.845 3.153 3.552 
0.686 0.859 1,063 1.323 1.721 2.080 2189 2518 2831 3.135 3.527 
0.686 0.858 1.061 1.321 1.717 2.074 2.183 2.508 2.819 3.119 3.505 


O ON OO BWDN BF 


NNNPPRP RP RP RP RP RB RP B 
NUPOMOANODAUOA WHR O 


23 0.685 0.858 1.060 1.319 1.714 2.069 2.177 2.500 2.807 3.104 3.485 
24 0.685 0.857 1.059 1.318 1.711 2.064 2.172 2.492 2.797 3.091 3.467 
25 0.684 0.856 1.058 1.316 1.708 2.060 2.167 2.485 2.787 3.078 3.450 
26 0.684 0.856 1.058 1.315 1.706 2.056 2.162 2.479 2.779 3.067 3.435 
27 0.684 0.855 1.057 1.314 1.703 2.052 2.158 2.473 2.771 3.057 3.421 
28 0.683 0.855 1.056 1.313 1.701 2.048 2.154 2.467 2.763 3.047 3.408 
29 0.683 0.854 1.055 1.311 1.699 2.045 2.150 2.462 2.756 3.038 3.396 
30 0.683 0.854 1.055 1.310 1.697 2.042 2.147 2.457 2.750 3.030 3.385 
40 0.681 0.851 1.050 1.303 1.684 2.021 2.123 2.423 2.704 2.971 3.307 
50 0.679 0.849 1.047 1.299 1.676 2.009 2.109 2.403 2.678 2.937 3.261 
60 0.679 0.848 1.045 1.296 1.671 2.000 2.099 2.390 2.660 2.915 3.232 
80 0.678 0.846 1.043 1.292 1.664 1.990 2.088 2.374 2.639 2.887 3.195 
100 0.677 0.845 1.042 1.290 1.660 1.984 2.081 2.364 2.626 2.871 3.174 
1000 0.675 0.842 1.037 1.282 1.646 1.962 2.056 2.330 2.581 2.813 3.098 
co 0.674 0.841 1.036 1.282 1.645 1.960 2.054 2.326 2.576 2.807 3.091 
50% 60% 70% 80% 90% 95% 96% 98% 99% 99.5% 99.8% 
Confidence level C 


Probability 
P 


x2 


Table entry for p is the point y? with probability p lying to its right. 


Tail probability p 


df 25 .20 15 10 05 .025 .02 01 005 .0025 .001 
1 132 164 207 271 384 502 541 663 7.88 9.14 10.83 
2 24/ 322 379 461 399 738 762 921 1060 1198 13:02 
3 411 464 532 625 781 935 984 11.34 12.84 14.32 16.27 
4 539 599 674 7.78 949 11.14 11.67 13.28 1486 16.42 18.47 
a 6.63 “29 G12 924 1107 12:33 13.39 15:09 16.75 168.39 20:51 
6 7.84 856 945 1064 12.59 1445 15.03 16.81 18.55 20.25 22.46 
7 9.04 9.80 10.75 12.02 14.07 16.01 16.62 18.48 20.28 22.04 24.32 
8 10.22 11.03 12.03 13.36 15.51 17.53 18.17 20.09 21.95 23.77 26.12 
9 11.39 12.24 13.29 1468 16.92 19.02 19.68 21.67 23.59 25.46 27.88 

10 1255 13.44 14.53 15.99 18.31 2048 21.16 23.21 25.19 27.11 29.59 

11 13.70 1463 15.77 17.28 19.68 21.92 22.62 24.72 26.76 28.73 31.26 

12 1485 15.81 1699 1855 21.03 23:34 2405 26.22 28.30 30.32 32.91 
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Inference Summary 


Confidence intervals 
(Cls) Significance tests 


State the parameter you State the hypotheses you want to test and 
want to estimate andthe __ the significance level, and define any 
confidence level. parameters you use. 


Identify the appropriate Identify the appropriate inference method 
inference method and and check conditions. 
check conditions. 


If the conditions are met, If the conditions are met, perform 
perform calculations. calculations. 


e Give the sample statistic(s). 

e Calculate the standardized test 
statistic. 

e Find the PP-value. 


CONCLUDE: Interpret your interval in the Make a conclusion about the hypotheses 
context of the problem. in the context of the problem. 


Standardized test 

statistic=statistic- 

CI: statistic+(critical value)-(standard deviation of statistic) parameterstandard 
CI: statistic + (critical value) - (standard deviation of statistic) deviation of statistic 


Standardized test statistic 
statistic-parameter 


na standard deviation of statistic 
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One-sample z2 test for p? 
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Two-sample z2 interval for p1-p2P?1 ~ P2 
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Interval (2-PropZInt) 


(10.1) 


. /P1-Pi) , P2(1-P2) 


Pp, — Py) +z \ ny no 
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(2-PropZTest) 
Test see 
(10.1) [e (i-p) _p G-p) 
z=(p"1-p%2)-Op’ (1-p*)n1+p*(1-pyn2 Vom 
where p“=total successestotal sample size=X1+X2n1+n2 
' total successes 1+X9 
where p =———_______ = ———_ 
total sample size nm ,+n9 
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One-sample té interval for u 
Interval (TInterval) 
(8.3) , 
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x +t*sxn df=n-1 Vv 


One-sample tf test for uM (T-Test) 


tp 
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Interval 
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(TInterval) 
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data 
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Chi-square test for goodness of fit (X2GOF-Test) (:cor-test) 


Test X2=) (observed-expected)2expecteddf=number of categories—1 
: (11.1) ‘ Pp (observed—expected)? 
a i expected 
df number of categories — 1 
Distribution 
ofa 
categorical _ F 
variable Chi-square test for homogeneity 
2 
(x2-Test) (X -Test) 
2 or Test 
more (11.2) 
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: ; 2 
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Chi-square test for independence 
2 
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tt interval for the slope 
Interval LinRegTint 
12.1 ( on 
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Relationship 
between 2 
quantitative 
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Test (LinRegTTest) 
(12.1) . 
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= SE, 
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TI-Nspire and other technology instructions are on the Student Site at 
highschool.bfwpub.com/tps6e. 


1. Analyzing two-way tables page 22 
2. Making histograms page 43 
3. Computing numerical summaries page 66 
4. Making boxplots page 73 
5. Finding areas from values in a Normal distribution page 121 
6. Finding values from areas in a Normal distribution page 128 
7. Making Normal probability plots page 136 
8. Making scatterplots page 159 
9. Calculating least-squares regression lines page 184 
10. Making residual plots page 187 
11.Choosing an SRS page 226 
12. Analyzing discrete random variables page 370 
13. Calculating binomial coefficients page 408 
14. Calculating binomial probabilities page 410 
15. Graphing binomial probability distributions page 413 
16. Calculating geometric probabilities page 424 
17. Constructing a confidence interval for a population proportion page 518 
18. Using inverse t page 531 
19. Constructing a confidence interval for a population mean page 537 
20. Performing a one-sample z test for a proportion page 576 
21.Computing P-values from f distributions page 590 
22.Performing a one-sample t¢ test for a mean page 594 
23. Constructing a confidence interval for a difference in proportions page 629 
24.Performing a significance test for a difference in proportions page 633 
25. Constructing a confidence interval for a difference in means page 654 
26. Performing a significance test for a difference in means page 660 
27.Finding P-values for chi-square tests page 716 
28.Performing a chi-square test for goodness of fit page 719 
29.Performing chi-square tests for two-way tables page 735 
30. Constructing a confidence interval for slope page 781 
31. Performing a significance test for slope page 785 
32. Transforming to achieve linearity page 809 


Written to help you start soaring to success in your statistics course and on the AP® 
Statistics exam, The Practice of Statistics for the AP® Exam, Sixth Edition, has the 
support you want and need built into the book. Here are a few of the features designed to 
give you access to the science and art of statistics: 


¢ Clear explanations adhere to the language and terminology used on the AP® Statistics 
exam. 


¢ Paired Examples and Exercises supported by video tutorials, and featuring 
experienced AP® Statistics teachers, offer guidance when you want help, so that you 
don’t have to circle to make a landing. 


° New to this edition, Chapter Review Exercise videos offer more chances to master 
the big ideas in each chapter by taking another look at statistical concepts with in-depth 
guidance. 


e AP® Exam Tips offer “insider information” to help you develop the habits necessary for 
SUCCESS. 


e Chapter AP® and Cumulative AP® Tests offer high-quality support so that you can 
practice, practice, and practice for the AP® Statistics exam—all built right into the book, 
all of them written in the authors’ unique engaging style. 


¢ Graphing Calculator instruction with new video tutorials for the TI-83/84 and TI- 
Nspire leads you step by step to using your “device” more effectively. Additional print 
instructions are available online. 


* New Chapter Projects and integrated Activities give you a chance to think like a 
statistician. 


Stay ahead of the flock with additional, easy-to-access help: 


¢ Our new digital platform with e-Book and a comprehensive online homework 
program offers the ease of access you want and the flexibility you expect, with video 
tutorials and other support available at a simple click. 


¢ Strive for 5: Preparing for the AP® Statistics Examination by Jason Molesky and 
Michael Legacy combines a study guide and an AP® prep guide—it’s the perfect 
companion to help you get the most out of the text. 


For information about e-Book and package options and to access open-source student 
resources, visit: www.highschool.bfwpub.com/tps6e 
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